How NVIDIA Uses Neural Networks

96 engineering articles about Neural Networks from NVIDIA's engineering team

Other NVIDIA Technologies

Python(740)PyTorch(566)Deep Learning(505)TensorFlow(444)Docker(292)Kubernetes(251)

Other Companies Using Neural Networks

Articles

Filter:

NVIDIA

Advanced

Reimagining LLM Memory: Using Context as Training Data Unlocks Models That Learn at Test-Time

The article discusses the limitations of current large language models (LLMs) in handling long contexts and introduces Test-Time Training with an end-to-end formulation (TTT-E2E) as a solution.

Neural NetworksRecurrent Neural NetworksTransformerTransformers

Yu Sun

6 min read

Has Summary

NVIDIA

Intermediate

Using AI Physics for Technology Computer-Aided Design Simulations

The article discusses the integration of AI Physics into Technology Computer-Aided Design (TCAD) simulations, highlighting its significance in semiconductor manufacturing.

Graph Neural NetworksHugging FaceNeural NetworksPythonPyTorch

Ram Cherukuri

7 min read

Has Summary

NVIDIA

Intermediate

Supercharging Fraud Detection in Financial Services with Graph Neural Networks (Updated)

The article discusses the application of Graph Neural Networks (GNNs) in enhancing fraud detection within financial services.

ApacheApache SparkDockerGraph Neural NetworksJSONKubernetesNeural NetworksXGBoost

Naim

10 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Data-Efficient Knowledge Distillation for Supervised Fine-Tuning with NVIDIA NeMo-Aligner

The article discusses the implementation of data-efficient knowledge distillation using NVIDIA NeMo-Aligner during supervised fine-tuning (SFT).

CachingLarge Language ModelsNeural Networks

Anna Shors

5 min read

Has Summary

NVIDIA

Advanced

Introducing Tile-Based Programming in Warp 1.5.0

The article introduces tile-based programming in Warp 1. 5. 0, highlighting new Python primitives that enhance GPU programming efficiency.

Neural NetworksNumPyPythonPyTorchWarp

Miles Macklin

13 min read

Includes Code

Has Summary

NVIDIA

Intermediate

NVIDIA Deep Learning Institute Releases New Data Science Teaching Kit for Educators

The NVIDIA Deep Learning Institute has launched the Accelerated Data Science Teaching Kit, aimed at educators to enhance data science education.

DaskDeep LearningMachine LearningNetworkXNeural NetworksPolarsPython

Joe Bungo

3 min read

Has Summary

NVIDIA

Intermediate

Fusing Epilog Operations with Matrix Multiplication Using nvmath-python

The article discusses the nvmath-python library, which allows Python programmers to perform high-performance mathematical operations using NVIDIA's CUDA-X math libraries.

Neural NetworksPythonPyTorch

Szymon Karpiński

6 min read

Includes Code

Has Summary

NVIDIA

Advanced

Using Graph Neural Networks for Additive Manufacturing

The article discusses the application of Graph Neural Networks (GNNs) in optimizing the design and simulation of lattice structures in additive manufacturing.

ApacheGraph Neural NetworksNeural Networks

Ayush Jain

6 min read

Has Summary

NVIDIA

Advanced

Optimizing Memory and Retrieval for Graph Neural Networks with WholeGraph, Part 2

This article explores the optimization of memory and retrieval processes for large-scale Graph Neural Networks (GNNs) using WholeGraph, a feature of the RAPIDS cuGraph library.

EmbeddingGraph Neural NetworksNeural Networks

Dongxu Yang

5 min read

Has Summary

NVIDIA

Intermediate

Optimizing Memory and Retrieval for Graph Neural Networks with WholeGraph, Part 1

The article discusses WholeGraph, a feature in the RAPIDS cuGraph library designed to optimize memory and retrieval for Graph Neural Networks (GNNs).

DGLEmbeddingGraph Neural NetworksNeural NetworksNumPyPythonPyTorch

Dongxu Yang

9 min read

Includes Code

Has Summary

NVIDIA

Advanced

Turning Machine Learning to Federated Learning in Minutes with NVIDIA FLARE 2.4

The article discusses the rapid adoption of federated learning (FL) and the new features introduced in NVIDIA FLARE 2. 4.

AWSAzureFederated LearningGPTGraph Neural NetworksgRPCHugging FaceMachine LearningNeural NetworksPyTorchXGBoost

Chester Chen

15 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Mastering LLM Techniques: Training

The article discusses the intricacies of training Large Language Models (LLMs) using transformer networks, focusing on model architectures, attention mechanisms, and embedding techniques.

Attention MechanismBERTEmbeddingGPTLarge Language ModelsNeural NetworksRecurrent Neural NetworksSelf-AttentionTransformerTransformersV

Anjali Shah

14 min read

Has Summary

NVIDIA

Advanced

Enabling Greater Patient-Specific Cardiovascular Care with AI Surrogates

A Stanford University team is revolutionizing cardiovascular care through AI-driven simulations that provide patient-specific blood flow visualizations.

ApacheGraph Neural NetworksNeural Networks

Harpreet Sethi

8 min read

Has Summary

NVIDIA

Intermediate

Power Your Business with NVIDIA AI Enterprise 4.0 for Production-Ready Generative AI

The article discusses NVIDIA AI Enterprise 4. 0, a comprehensive solution designed to support enterprises in developing and deploying generative AI applications.

AWSAzureGenerative AIGoogle CloudGraph Neural NetworksKubernetesMachine LearningNeural NetworksPythonRetrieval Augmented Generation

Phoebe Lee

4 min read

Has Summary

NVIDIA

Intermediate

Introduction to Graph Neural Networks with NVIDIA cuGraph-DGL

This article introduces Graph Neural Networks (GNNs) and how to utilize cuGraph-DGL, a GPU-accelerated library for graph computations.

DGLGraph Neural NetworksNeural NetworksPythonPyTorchTensorFlow

Vibhu Jawa

7 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Designing Deep Networks to Process Other Deep Networks

The article discusses the design of deep neural networks (DNNs) that can process the weights of other DNNs, focusing on architectures that leverage the symmetries of weight spaces.

Deep LearningGraph Neural NetworksNeural NetworksTransformerTransformersV

Haggai Maron

14 min read

Has Summary

NVIDIA

Intermediate

Structured Sparsity in the NVIDIA Ampere Architecture and Applications in Search Engines

The article discusses the structured sparsity feature in the NVIDIA Ampere architecture, particularly focusing on its implementation in deep learning and applications in search engines.

BERTMachine LearningNeural NetworksPythonSelf-AttentionTransformers

Hongxiao Bai

12 min read

Includes Code

Has Summary

NVIDIA

Beginner

Predicting Credit Defaults Using Time-Series Models with Recursive Neural Networks and XGBoost

This article discusses the use of time-series models, specifically autoregressive recursive neural networks and XGBoost, for predicting credit defaults.

LightGBMNeural NetworksPyTorchscikit-learnTensorFlowXGBoost

Jiwei Liu

11 min read

Includes Code

Has Summary

NVIDIA

Advanced

Develop Physics-Informed Machine Learning Models with Graph Neural Networks

The article discusses NVIDIA PhysicsNeMo, a framework for developing physics-informed machine learning models, with a focus on the latest update that introduces support for Graph Neural Networks (G...

ApacheDeep LearningGraph Neural NetworksMachine LearningNeural NetworksPyTorch

Bhoomi Gadhia

5 min read

Has Summary

NVIDIA

Intermediate

Sparsity in INT8: Training Workflow and Best Practices for NVIDIA TensorRT Acceleration

The article discusses the training workflow and best practices for implementing sparsity in INT8 models using NVIDIA TensorRT.

Deep LearningNeural NetworksPythonPyTorchResNetTensorFlowtorchvision

Gwena Cunha Sergio

11 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Power-Up Your Skills and Credentials at NVIDIA GTC 2023

The article discusses the NVIDIA GTC 2023 conference, highlighting its extensive training opportunities in AI, HPC, and the metaverse.

Computer VisionDeep LearningNatural Language ProcessingNeural NetworksPythonTransformer

Ann Sheridan

5 min read

Has Summary

NVIDIA

Advanced

Benchmarking Deep Neural Networks for Low-Latency Trading and Rapid Backtesting on NVIDIA GPUs

The article discusses the benchmarking of deep neural networks, specifically Long Short-Term Memory (LSTM) models, for low-latency trading and rapid backtesting using NVIDIA GPUs.

LSTMNeural NetworksPyTorchTensorFlow

Martin Marciniszyn Mehringer

7 min read

Has Summary

NVIDIA

Advanced

NVIDIA Grace Hopper Superchip Architecture In-Depth

The NVIDIA Grace Hopper Superchip Architecture represents a significant advancement in heterogeneous computing, combining NVIDIA Grace CPUs and Hopper GPUs to optimize performance for AI and high-p...

Deep LearningEmbeddingFortranGPTGraph Neural NetworksNatural Language ProcessingNeural NetworksPythonRenderTransformer

Jonathon Evans

15 min read

Has Summary

NVIDIA

Intermediate

Optimizing Fraud Detection in Financial Services with Graph Neural Networks and NVIDIA GPUs

The article discusses how Graph Neural Networks (GNNs) and NVIDIA GPUs can optimize fraud detection in financial services.

AWSDaskDGLGraph Neural NetworksNeural NetworksPythonPyTorchPyTorch GeometricXGBoost

Ashish Sardana

21 min read

Includes Code

Has Summary

NVIDIA

Advanced

New SDKs Accelerating AI Research, Computer Vision, Data Science, and More

NVIDIA has announced significant updates to its AI software suite, including JAX, NVIDIA CV-CUDA, and NVIDIA RAPIDS, aimed at accelerating AI research, computer vision, and data science.

ApacheApache SparkComputer VisionDaskDeep LearningDGLGoogle CloudGPTJAXKubernetesNeural NetworksNumPyPyTorchPyTorch GeometricSQL

Siddharth Sharma

7 min read

Has Summary

NVIDIA

Intermediate

Get Hands-on Training from NVIDIA Experts at GTC

The article discusses hands-on training opportunities provided by the NVIDIA Deep Learning Institute (DLI) at the upcoming GPU Technical Conference (GTC).

Deep LearningNeural Networks

Ann Sheridan

5 min read

Has Summary

NVIDIA

Advanced

Deploying GPT-J and T5 with NVIDIA Triton Inference Server

This article provides a comprehensive guide on deploying large transformer models like GPT-J and T5 using NVIDIA's Triton Inference Server and FasterTransformer library.

BERTDockerGPTHugging FaceNeural NetworksPythonPyTorchT5TensorFlowTransformer

Denis Timonin

15 min read

Includes Code

Has Summary

NVIDIA

Advanced

NVIDIA AI Platform Delivers Big Gains for Large Language Models

NVIDIA has announced significant updates to the NeMo framework, enhancing the training speed of large language models (LLMs) by up to 30%.

AzureDeep LearningGPTHugging FaceLarge Language ModelsNeural NetworksTransformerV

Markel Ausin

6 min read

Has Summary

NVIDIA

Advanced

Accelerating GPU Applications with NVIDIA Math Libraries

The article discusses how to accelerate GPU applications using NVIDIA Math Libraries, highlighting three main approaches: compiler directives, programming languages, and preprogrammed libraries.

Deep LearningFortranNeural NetworksPython

Aastha Jhunjhunwala

12 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Transformers4Rec: Building Session-Based Recommendations with an NVIDIA Merlin Library

The article introduces Transformers4Rec, a library from NVIDIA Merlin designed for building session-based recommendation systems using state-of-the-art Transformer architectures.

Hugging FaceKerasNeural NetworksPyTorchRecurrent Neural NetworksTensorFlowTransformerTransformers

Ronay AK

7 min read

Includes Code

Has Summary

NVIDIA

Beginner

NVIDIA Experts Explore Robotics, GNNs, and NLP Advancements at the WeAreDevelopers World Congress

NVIDIA experts will present advancements in robotics, Graph Neural Networks (GNNs), and Natural Language Processing (NLP) at the WeAreDevelopers World Congress in Berlin.

Deep LearningGPTGraph Neural NetworksJavaScriptNatural Language ProcessingNeural NetworksPHPPython

Marjut Dieringer

4 min read

Has Summary

NVIDIA

Advanced

A Data Scientist’s Guide to Gradient Descent and Backpropagation Algorithms

This article serves as a guide for Data Scientists to understand the fundamental concepts of gradient descent and backpropagation algorithms, which are essential for training Artificial Neural Netw...

Deep LearningNeural NetworksPyTorchscikit-learnTensorFlow

Richmond Alake

9 min read

Has Summary

NVIDIA

Intermediate

NVIDIA GTC: A Complete Overview of Nsight Developer Tools

The article provides a comprehensive overview of NVIDIA's Nsight Developer Tools, which are designed to optimize computational applications across various architectures.

Deep LearningHTMLMachine LearningNeural NetworksPyTorch

Chaitrali Joshi

6 min read

Includes Code

Has Summary

NVIDIA

Intermediate

NVIDIA Announces PhysicsNeMo: A Framework for Developing Physics ML Models for Digital Twins

NVIDIA has launched PhysicsNeMo, a framework for training neural networks that integrates governing physics equations with observed or simulated data, aimed at enhancing the development of digital ...

AWSNeural NetworksPythonPyTorchTensorFlow

Jay Gould

2 min read

Has Summary

NVIDIA

Intermediate

CUDA-X Accelerated DGL Containers for Large Graph Neural Networks

NVIDIA has introduced GPU-accelerated Deep Graph Library (DGL) containers to assist developers, researchers, and data scientists in working with Graph Neural Networks (GNN) on large heterogeneous g...

DGLGraph Neural NetworksMachine LearningNeural NetworksPyTorchTransformer

Gordana Neskovic

3 min read

Has Summary

NVIDIA

Intermediate

NVIDIA Launches Updated Teaching Kit for Edge AI and Robotics Educators

NVIDIA has released an updated Edge AI and Robotics Teaching Kit aimed at university educators, developed in collaboration with experts from the University of Oxford and the University of Maryland,...

Deep LearningNeural NetworksPyTorchReinforcement LearningSpring

Jason Black

3 min read

Has Summary

NVIDIA

Intermediate

Accelerating Product Development with Physics-Informed Neural Networks and NVIDIA PhysicsNeMo

The article discusses NVIDIA PhysicsNeMo, an AI toolkit that leverages physics-informed neural networks (PINNs) to enhance product development by solving complex nonlinear physics problems.

Neural NetworksPythonTensorBoard

Michael Eidell

9 min read

Has Summary

NVIDIA

Intermediate

Learn from NVIDIA Experts at GTC DRIVE Developer Day

The article discusses the upcoming NVIDIA DRIVE Developer Day at NVIDIA GTC, where developers can learn about the latest features in autonomous vehicle technology from NVIDIA experts.

Neural Networks

Katie Washabaugh

1 min read

Has Summary

NVIDIA

Intermediate

Using Hybrid Physics-Informed Neural Networks for Digital Twins in Prognosis and Health Management

The article discusses the use of NVIDIA PhysicsNeMo, a physics-informed neural network toolkit, for creating digital twins in prognosis and health management.

Neural NetworksPythonTensorFlow

Felipe Viana

10 min read

Has Summary

NVIDIA

Advanced

Accelerating Inference with Sparsity Using the NVIDIA Ampere Architecture and NVIDIA TensorRT

This article discusses how the NVIDIA Ampere Architecture and TensorRT 8. 0 leverage sparsity to accelerate neural network inference.

BERTDeep LearningDockerNeural NetworksPythonPyTorchResNettorchvisionTransformer

Jeff Pool

8 min read

Includes Code

Has Summary

NVIDIA

Advanced

Using Neural Networks for Your Recommender System

This article discusses the application of deep learning techniques in recommender systems, highlighting the advantages of using neural networks over traditional methods.

Deep LearningEmbeddingGRULSTMNeural NetworksPyTorchTensorFlowTransformer

Benedikt Schifferer

8 min read

Has Summary

NVIDIA

Advanced

Fighting Disease-Carrying Mosquitoes with Neural Networks

A new study leverages deep learning to identify disease-carrying tiger mosquitoes with high accuracy, utilizing images submitted by citizen scientists through the Mosquito Alert app.

Artificial IntelligenceNeural NetworksPyTorch

Michelle Horton

3 min read

Has Summary

NVIDIA

Advanced

Applying Natural Language Processing Across the World’s Languages

The article discusses the advancements and challenges in applying Natural Language Processing (NLP) across various languages, emphasizing the need for large-scale models and the engineering efforts...

BERTDeep LearningGPTNatural Language ProcessingNeural NetworksT5TransformersYAML

Adam Grzywaczewski

14 min read

Has Summary

NVIDIA

Advanced

NVIDIA PhysicsNeMo v21.06 Released for General Availability

NVIDIA PhysicsNeMo v21. 06 has been released for general availability, enhancing physics simulations through a Physics-Informed Neural Networks (PINNs) toolkit.

Deep LearningNeural Networks

Rekha Mukund

6 min read

Has Summary

NVIDIA

Intermediate

Accelerating Conversational AI Research with New Cutting-Edge Neural Networks and Features from NeMo 1.0

The article discusses the NVIDIA NeMo toolkit, a conversational AI framework designed to enhance research in automatic speech recognition (ASR), natural language processing (NLP), and text-to-speec...

BERTHugging FaceNeural NetworksPyTorch

Oleksii Kuchaiev

8 min read

Includes Code

Has Summary

NVIDIA

Beginner

Take a Deep Dive into Ray Tracing, Machine Learning and Neural Networks Through SIGGRAPH Frontiers

The article discusses the upcoming SIGGRAPH Frontiers webinars starting May 24, 2021, focusing on ray tracing, machine learning, and neural networks.

Machine LearningNeural Networks

Ike Nnoli

2 min read

Has Summary

NVIDIA

Intermediate

How to Build a Deep Learning Powered Recommender System, Part 2

This article is the second part of a series on building deep learning-powered recommender systems, focusing on the application of deep learning techniques to enhance recommendation quality.

ApacheApache ArrowBERTConvolutional Neural NetworksDeep LearningGRULSTMNeural NetworksPyTorchscikit-learnSpringTensorFlowTransformerTransformersVariational Autoencoders

Carol McDonald

14 min read

Has Summary

NVIDIA

Intermediate

How to Accelerate Signal Processing in Python

This article introduces cuSignal, a library within the RAPIDS ecosystem designed for signal processing using NVIDIA GPUs, which significantly accelerates computations compared to traditional method...

Convolutional Neural NetworksDeep LearningMachine LearningNeural NetworksPythonscikit-learnSQL

Tom Drabas

7 min read

Includes Code

Has Summary

NVIDIA

Advanced

Meet the Researcher: Lokman Abbas Turki, Applying HPC to Computationally Complex Mathematical Finance Problems

The article features Lokman Abbas Turki, a researcher at Sorbonne University, who applies high performance computing (HPC) to complex mathematical finance problems and cryptography.

ClaudeNeural NetworksPython

Brad Nemire

7 min read

Has Summary

NVIDIA

Intermediate

Accelerating AI Training with NVIDIA TF32 Tensor Cores

The article discusses the introduction of TensorFloat32 (TF32) in NVIDIA's Ampere GPU architecture, which accelerates AI training by providing significant performance improvements for single-precis...

Computer VisionDeep LearningNeural NetworksPyTorchTensorFlowTransformer

Dusan Stosic

9 min read

Includes Code

Has Summary