How NVIDIA Uses Neural Networks
96 engineering articles about Neural Networks from NVIDIA's engineering team
Other NVIDIA Technologies
Other Companies Using Neural Networks
Articles
Filter:
The article discusses the limitations of current large language models (LLMs) in handling long contexts and introduces Test-Time Training with an end-to-end formulation (TTT-E2E) as a solution.
Yu Sun
6 min read
Has Summary
--
The article discusses the integration of AI Physics into Technology Computer-Aided Design (TCAD) simulations, highlighting its significance in semiconductor manufacturing.
Ram Cherukuri
7 min read
Has Summary
--
The article discusses the application of Graph Neural Networks (GNNs) in enhancing fraud detection within financial services.
Naim
10 min read
Includes Code
Has Summary
--
The article discusses the implementation of data-efficient knowledge distillation using NVIDIA NeMo-Aligner during supervised fine-tuning (SFT).
Anna Shors
5 min read
Has Summary
--
The article introduces tile-based programming in Warp 1. 5. 0, highlighting new Python primitives that enhance GPU programming efficiency.
Miles Macklin
13 min read
Includes Code
Has Summary
--
The NVIDIA Deep Learning Institute has launched the Accelerated Data Science Teaching Kit, aimed at educators to enhance data science education.
Joe Bungo
3 min read
Has Summary
--
The article discusses the nvmath-python library, which allows Python programmers to perform high-performance mathematical operations using NVIDIA's CUDA-X math libraries.
Szymon Karpiński
6 min read
Includes Code
Has Summary
--
The article discusses the application of Graph Neural Networks (GNNs) in optimizing the design and simulation of lattice structures in additive manufacturing.
Ayush Jain
6 min read
Has Summary
--
This article explores the optimization of memory and retrieval processes for large-scale Graph Neural Networks (GNNs) using WholeGraph, a feature of the RAPIDS cuGraph library.
Dongxu Yang
5 min read
Has Summary
--
The article discusses WholeGraph, a feature in the RAPIDS cuGraph library designed to optimize memory and retrieval for Graph Neural Networks (GNNs).
Dongxu Yang
9 min read
Includes Code
Has Summary
--
The article discusses the rapid adoption of federated learning (FL) and the new features introduced in NVIDIA FLARE 2. 4.
AWSAzureFederated LearningGPTGraph Neural NetworksgRPCHugging FaceMachine LearningNeural NetworksPyTorchXGBoost
Chester Chen
15 min read
Includes Code
Has Summary
--
The article discusses the intricacies of training Large Language Models (LLMs) using transformer networks, focusing on model architectures, attention mechanisms, and embedding techniques.
Attention MechanismBERTEmbeddingGPTLarge Language ModelsNeural NetworksRecurrent Neural NetworksSelf-AttentionTransformerTransformersV
Anjali Shah
14 min read
Has Summary
--
A Stanford University team is revolutionizing cardiovascular care through AI-driven simulations that provide patient-specific blood flow visualizations.
Harpreet Sethi
8 min read
Has Summary
--
The article discusses NVIDIA AI Enterprise 4. 0, a comprehensive solution designed to support enterprises in developing and deploying generative AI applications.
AWSAzureGenerative AIGoogle CloudGraph Neural NetworksKubernetesMachine LearningNeural NetworksPythonRetrieval Augmented Generation
Phoebe Lee
4 min read
Has Summary
--
This article introduces Graph Neural Networks (GNNs) and how to utilize cuGraph-DGL, a GPU-accelerated library for graph computations.
Vibhu Jawa
7 min read
Includes Code
Has Summary
--
The article discusses the design of deep neural networks (DNNs) that can process the weights of other DNNs, focusing on architectures that leverage the symmetries of weight spaces.
Haggai Maron
14 min read
Has Summary
--
The article discusses the structured sparsity feature in the NVIDIA Ampere architecture, particularly focusing on its implementation in deep learning and applications in search engines.
Hongxiao Bai
12 min read
Includes Code
Has Summary
--
This article discusses the use of time-series models, specifically autoregressive recursive neural networks and XGBoost, for predicting credit defaults.
Jiwei Liu
11 min read
Includes Code
Has Summary
--
The article discusses NVIDIA PhysicsNeMo, a framework for developing physics-informed machine learning models, with a focus on the latest update that introduces support for Graph Neural Networks (G...
Bhoomi Gadhia
5 min read
Has Summary
--
The article discusses the training workflow and best practices for implementing sparsity in INT8 models using NVIDIA TensorRT.
Gwena Cunha Sergio
11 min read
Includes Code
Has Summary
--
The article discusses the NVIDIA GTC 2023 conference, highlighting its extensive training opportunities in AI, HPC, and the metaverse.
Ann Sheridan
5 min read
Has Summary
--
The article discusses the benchmarking of deep neural networks, specifically Long Short-Term Memory (LSTM) models, for low-latency trading and rapid backtesting using NVIDIA GPUs.
Martin Marciniszyn Mehringer
7 min read
Has Summary
--
The NVIDIA Grace Hopper Superchip Architecture represents a significant advancement in heterogeneous computing, combining NVIDIA Grace CPUs and Hopper GPUs to optimize performance for AI and high-p...
Deep LearningEmbeddingFortranGPTGraph Neural NetworksNatural Language ProcessingNeural NetworksPythonRenderTransformer
Jonathon Evans
15 min read
Has Summary
--
The article discusses how Graph Neural Networks (GNNs) and NVIDIA GPUs can optimize fraud detection in financial services.
Ashish Sardana
21 min read
Includes Code
Has Summary
--
NVIDIA has announced significant updates to its AI software suite, including JAX, NVIDIA CV-CUDA, and NVIDIA RAPIDS, aimed at accelerating AI research, computer vision, and data science.
ApacheApache SparkComputer VisionDaskDeep LearningDGLGoogle CloudGPTJAXKubernetesNeural NetworksNumPyPyTorchPyTorch GeometricSQL
Siddharth Sharma
7 min read
Has Summary
--
The article discusses hands-on training opportunities provided by the NVIDIA Deep Learning Institute (DLI) at the upcoming GPU Technical Conference (GTC).
Ann Sheridan
5 min read
Has Summary
--
This article provides a comprehensive guide on deploying large transformer models like GPT-J and T5 using NVIDIA's Triton Inference Server and FasterTransformer library.
Denis Timonin
15 min read
Includes Code
Has Summary
--
NVIDIA has announced significant updates to the NeMo framework, enhancing the training speed of large language models (LLMs) by up to 30%.
Markel Ausin
6 min read
Has Summary
--
The article discusses how to accelerate GPU applications using NVIDIA Math Libraries, highlighting three main approaches: compiler directives, programming languages, and preprogrammed libraries.
Aastha Jhunjhunwala
12 min read
Includes Code
Has Summary
--
The article introduces Transformers4Rec, a library from NVIDIA Merlin designed for building session-based recommendation systems using state-of-the-art Transformer architectures.
Ronay AK
7 min read
Includes Code
Has Summary
--
NVIDIA experts will present advancements in robotics, Graph Neural Networks (GNNs), and Natural Language Processing (NLP) at the WeAreDevelopers World Congress in Berlin.
Marjut Dieringer
4 min read
Has Summary
--
This article serves as a guide for Data Scientists to understand the fundamental concepts of gradient descent and backpropagation algorithms, which are essential for training Artificial Neural Netw...
Richmond Alake
9 min read
Has Summary
--
The article provides a comprehensive overview of NVIDIA's Nsight Developer Tools, which are designed to optimize computational applications across various architectures.
Chaitrali Joshi
6 min read
Includes Code
Has Summary
--
NVIDIA has launched PhysicsNeMo, a framework for training neural networks that integrates governing physics equations with observed or simulated data, aimed at enhancing the development of digital ...
Jay Gould
2 min read
Has Summary
--
NVIDIA has introduced GPU-accelerated Deep Graph Library (DGL) containers to assist developers, researchers, and data scientists in working with Graph Neural Networks (GNN) on large heterogeneous g...
Gordana Neskovic
3 min read
Has Summary
--
NVIDIA has released an updated Edge AI and Robotics Teaching Kit aimed at university educators, developed in collaboration with experts from the University of Oxford and the University of Maryland,...
Jason Black
3 min read
Has Summary
--
The article discusses NVIDIA PhysicsNeMo, an AI toolkit that leverages physics-informed neural networks (PINNs) to enhance product development by solving complex nonlinear physics problems.
Michael Eidell
9 min read
Has Summary
--
The article discusses the upcoming NVIDIA DRIVE Developer Day at NVIDIA GTC, where developers can learn about the latest features in autonomous vehicle technology from NVIDIA experts.
Katie Washabaugh
1 min read
Has Summary
--
The article discusses the use of NVIDIA PhysicsNeMo, a physics-informed neural network toolkit, for creating digital twins in prognosis and health management.
Felipe Viana
10 min read
Has Summary
--
This article discusses how the NVIDIA Ampere Architecture and TensorRT 8. 0 leverage sparsity to accelerate neural network inference.
Jeff Pool
8 min read
Includes Code
Has Summary
--
This article discusses the application of deep learning techniques in recommender systems, highlighting the advantages of using neural networks over traditional methods.
Benedikt Schifferer
8 min read
Has Summary
--
A new study leverages deep learning to identify disease-carrying tiger mosquitoes with high accuracy, utilizing images submitted by citizen scientists through the Mosquito Alert app.
Michelle Horton
3 min read
Has Summary
--
The article discusses the advancements and challenges in applying Natural Language Processing (NLP) across various languages, emphasizing the need for large-scale models and the engineering efforts...
Adam Grzywaczewski
14 min read
Has Summary
--
NVIDIA PhysicsNeMo v21. 06 has been released for general availability, enhancing physics simulations through a Physics-Informed Neural Networks (PINNs) toolkit.
Rekha Mukund
6 min read
Has Summary
--
The article discusses the NVIDIA NeMo toolkit, a conversational AI framework designed to enhance research in automatic speech recognition (ASR), natural language processing (NLP), and text-to-speec...
Oleksii Kuchaiev
8 min read
Includes Code
Has Summary
--
The article discusses the upcoming SIGGRAPH Frontiers webinars starting May 24, 2021, focusing on ray tracing, machine learning, and neural networks.
Ike Nnoli
2 min read
Has Summary
--
This article is the second part of a series on building deep learning-powered recommender systems, focusing on the application of deep learning techniques to enhance recommendation quality.
This article introduces cuSignal, a library within the RAPIDS ecosystem designed for signal processing using NVIDIA GPUs, which significantly accelerates computations compared to traditional method...
Tom Drabas
7 min read
Includes Code
Has Summary
--
The article features Lokman Abbas Turki, a researcher at Sorbonne University, who applies high performance computing (HPC) to complex mathematical finance problems and cryptography.
Brad Nemire
7 min read
Has Summary
--
The article discusses the introduction of TensorFloat32 (TF32) in NVIDIA's Ampere GPU architecture, which accelerates AI training by providing significant performance improvements for single-precis...
Dusan Stosic
9 min read
Includes Code
Has Summary
--