How NVIDIA Uses TensorFlow
444 engineering articles about TensorFlow from NVIDIA's engineering team
Other NVIDIA Technologies
Other Companies Using TensorFlow
Articles
Filter:
The article discusses the relevance of the Common Vulnerabilities and Exposures (CVE) system in relation to AI models, arguing that CVEs should be focused on the frameworks and applications that ut...
Rich Harang
7 min read
Has Summary
--
The article discusses the impending shortage of healthcare workers and how AI-enabled robotic systems, powered by NVIDIA Isaac for Healthcare, can address these challenges.
Ansley Dunn
6 min read
Includes Code
Has Summary
--
The article discusses the introduction of cuda-cccl, a Python library that provides high-level building blocks for NVIDIA CUDA kernel fusion, enabling developers to write efficient algorithms witho...
Ashwin Srinath
5 min read
Includes Code
Has Summary
--
The article discusses how to accelerate Deep Learning (DL) and Large Language Model (LLM) inference using Apache Spark in cloud environments.
ApacheApache SparkAWSAzureDeep LearningDockerJSONNumPyPythonPyTorchSemantic SearchTensorFlowTransformers
Rishi Chandra
9 min read
Includes Code
Has Summary
--
The article discusses optimizing transformer-based diffusion models for video generation using NVIDIA TensorRT, highlighting significant reductions in latency and total cost of ownership (TCO) achi...
Maximilian Mรผller
7 min read
Has Summary
--
A new AI-powered tool developed by researchers at the University of Florida and medical centers aims to improve the diagnosis of Parkinson's disease using standard MRI scans.
Michelle Horton
3 min read
Has Summary
--
NVIDIA has open-sourced the KAI Scheduler, a Kubernetes-native GPU scheduling solution under the Apache 2. 0 license, originally developed for the Run:ai platform.
Ronen Dar
9 min read
Has Summary
--
NVIDIA Dynamo is a newly released low-latency distributed inference framework designed to enhance the deployment of generative AI and reasoning models in large-scale environments.
Amr Elmeleegy
12 min read
Has Summary
--
NVIDIA has announced world-record inference performance for the DeepSeek-R1 model using the Blackwell architecture, achieving over 250 tokens per second per user and a maximum throughput of over 30...
Ashraf Eassa
13 min read
Has Summary
--
The article discusses an advanced deep-learning model designed to automate X-ray analysis for spinal health diagnostics, enhancing speed and accuracy in assessing conditions like scoliosis and kyph...
Michelle Horton
4 min read
Has Summary
--
The article discusses key insights from the NVIDIA 6G Developer Day 2024, highlighting the integration of AI into 6G infrastructure and the significance of AI-RAN.
Emeka Obiodu
10 min read
Has Summary
--
The article discusses the advancements in AI agents facilitated by NVIDIA AI Enterprise, emphasizing enhanced security, streamlined deployment, and management of AI pipelines.
Charu Chaubal
5 min read
Has Summary
--
The article discusses how to scale Large Language Models (LLMs) using NVIDIA Triton and NVIDIA TensorRT-LLM in a Kubernetes environment.
AWSAzureDockerGenerative AIGPTGrafanaHelmHugging FaceKubernetesNGINXPrometheusPythonPyTorchTensorFlowTraefik
Maggie Zhang
16 min read
Includes Code
Has Summary
--
The article discusses the Brain-Machine Interactive Neuromodulation Research Tool (BMINT), which utilizes closed-loop neuromodulation techniques to treat brain diseases like epilepsy and Parkinson'...
Shouyan Wang
4 min read
Has Summary
--
The article discusses the development and deployment of real-time neural receivers (NRX) in 5G New Radio (5G NR) systems, highlighting their potential to enhance wireless communication through AI-d...
Sebastian Cammerer
10 min read
Has Summary
--
The article discusses the release of NVIDIA TAO 5. 5, a framework that simplifies AI model development and deployment.
Monika Jhuria
12 min read
Includes Code
Has Summary
--
The article discusses the impressive performance of the NVIDIA Triton Inference Server in the MLPerf Inference v4.
Amr Elmeleegy
8 min read
Includes Code
Has Summary
--
The article discusses NVIDIA's fVDB, a deep-learning framework designed to build spatial intelligence from real-world 3D data.
Ken Museth
6 min read
Has Summary
--
The article discusses NVIDIA Metropolis, a platform for real-time vision AI that streamlines deployment through microservices and workflows.
Monika Jhuria
11 min read
Has Summary
--
The article discusses how to build a zero-copy AI sensor processing pipeline using OpenCV within the NVIDIA Holoscan SDK.
Meiran Peng
7 min read
Includes Code
Has Summary
--
The article discusses the enhancements made in NVIDIA's cuDNN 9 library, focusing on the acceleration of Transformers through the implementation of Scaled Dot Product Attention (SDPA).
Matthew Nicely
11 min read
Includes Code
Has Summary
--
The article discusses how Snap's ML engineering team enhanced the apparel shopping experience using AI, specifically through the Screenshop service integrated into Snapchat.
Amr Elmeleegy
7 min read
Has Summary
--
The article provides a comprehensive guide on building a Retrieval-Augmented Generation (RAG) pipeline using NVIDIA AI LangChain AI Endpoints.
Amit Bleiweiss
13 min read
Includes Code
Has Summary
--
The article discusses NVIDIA AI Enterprise IGX, a software solution designed for mission-critical AI applications at the edge.
Suhas Hariharapura Sheshadri
5 min read
Has Summary
--
The article discusses the NVIDIA 6G Developer Program, which aims to accelerate the development of 6G technology by providing access to AI/ML tools, simulation environments, and software-defined ne...
Kuntal Chowdhury
9 min read
Has Summary
--
The article discusses NVIDIA cuTENSOR 2. 0, highlighting its applications, performance improvements, and usage from Python and Julia.
Paul Springer
8 min read
Includes Code
Has Summary
--
cuTENSOR 2. 0 is an advanced CUDA math library designed to accelerate tensor computations, offering optimized implementations for dense, multi-dimensional arrays.
Paul Springer
17 min read
Includes Code
Has Summary
--
The article discusses how to generate stunning images using Stable Diffusion XL on the NVIDIA AI Inference Platform, highlighting the challenges of deploying diffusion models at scale and how NVIDI...
Amr Elmeleegy
13 min read
Includes Code
Has Summary
--
The article discusses an innovative edge computing and video analytics solution developed to detect plastic bag contamination in waste collection trucks.
Umair Iqbal
8 min read
Has Summary
--
The article discusses how enterprises can build enterprise-grade AI applications using NVIDIA AI Software, focusing on the importance of optimized software for various stages of AI development.
Nirmal Kumar Juluru
5 min read
Has Summary
--
The article discusses the significance of robust scene text detection and recognition (STDR) in various applications, emphasizing the challenges faced in recognizing text from natural scenes.
Vishal Chavan
8 min read
Has Summary
--
The article discusses how to accelerate computer vision deployments using NVIDIA DeepStream and Edge Impulse, highlighting their integration for building and deploying AI-based applications.
Peter Ing
11 min read
Includes Code
Has Summary
--
The article discusses how NVIDIA TAO and Vision AI models can transform industrial defect detection, emphasizing the financial impact of defects in manufacturing.
Nirmal Kumar Juluru
10 min read
Includes Code
Has Summary
--
The article discusses the collaboration between NVIDIA and Microsoft to enhance enterprise generative AI application development using NVIDIA AI on Azure Machine Learning.
Abhishek Sawarkar
5 min read
Has Summary
--
NVIDIA's Differentiable Slang is a new shading language designed to unify real-time, inverse, and differentiable rendering, enabling seamless integration of machine learning with graphics programmi...
This article introduces Graph Neural Networks (GNNs) and how to utilize cuGraph-DGL, a GPU-accelerated library for graph computations.
Vibhu Jawa
7 min read
Includes Code
Has Summary
--
The article discusses the NVIDIA AI Workbench, a unified toolkit designed to simplify the development and deployment of scalable generative AI models.
Tyler Whitehouse
10 min read
Has Summary
--
The article discusses how organizations can streamline AI model training and deployment across various cloud platforms using NVIDIA's Cloud Native Stack and Run:ai.
Guy Salton
7 min read
Includes Code
Has Summary
--
The article discusses the integration of distributed deep learning with Apache Spark 3. 4, highlighting new built-in APIs for both distributed model training and inference.
Lee Yang
6 min read
Includes Code
Has Summary
--
The article discusses how to create high-quality computer vision applications using the Superb AI Suite and NVIDIA TAO Toolkit.
Tyler McKean
14 min read
Includes Code
Has Summary
--
This article discusses the use of time-series models, specifically autoregressive recursive neural networks and XGBoost, for predicting credit defaults.
Jiwei Liu
11 min read
Includes Code
Has Summary
--
The article discusses how NVIDIA AI Enterprise can be effectively utilized on Microsoft Azure Machine Learning to streamline the implementation of AI and machine learning solutions.
Michael Balint
7 min read
Has Summary
--
This article discusses the development of a high-fidelity multi-robot simulation environment using NVIDIA Isaac Sim, ROS, and Nimbus.
Yakir Ari
6 min read
Includes Code
Has Summary
--
The article discusses the training workflow and best practices for implementing sparsity in INT8 models using NVIDIA TensorRT.
Gwena Cunha Sergio
11 min read
Includes Code
Has Summary
--
The article discusses how to efficiently scale large language model (LLM) training across a large GPU cluster using the open-source frameworks Alpa and Ray.
Jiao Dong
14 min read
Includes Code
Has Summary
--
The article discusses the importance of automatic augmentation in deep learning, emphasizing its role in enhancing model accuracy by diversifying training datasets.
Kamil Tokarski
12 min read
Includes Code
Has Summary
--
The article discusses the growing demand for AI-based computer vision applications and the associated increase in compute costs.
Sagar Singh
10 min read
Has Summary
--
The article discusses the optimization of Kakao Brain's KoGPT large language model using NVIDIA FasterTransformer, highlighting the significant improvements in inference speed and performance.
Daemyung Jang
5 min read
Has Summary
--
The article discusses the integration of Dataiku and NVIDIA technologies for deep learning applications, particularly in image classification and topic modeling.
Shashank Gaur
9 min read
Includes Code
Has Summary
--
The article discusses the Bird@Edge project, an innovative system developed by researchers at the University of Marburg to identify bird species by sound using the NVIDIA Jetson Nano Developer Kit.
Jason Black
6 min read
Has Summary
--