#
Neural Networks Programming Tutorials & Engineering Articles
169 Neural Networks tutorials, guides, and engineering insights from NVIDIA, Uber, OpenAI, and more
Companies Using This
Neural Networks Articles & Tutorials
Filter:
The article discusses the limitations of current large language models (LLMs) in handling long contexts and introduces Test-Time Training with an end-to-end formulation (TTT-E2E) as a solution.
Yu Sun
6 min read
Has Summary
--
The article discusses the integration of AI Physics into Technology Computer-Aided Design (TCAD) simulations, highlighting its significance in semiconductor manufacturing.
Ram Cherukuri
7 min read
Has Summary
--
The article discusses the application of Graph Neural Networks (GNNs) in enhancing fraud detection within financial services.
Naim
10 min read
Includes Code
Has Summary
--
The article discusses the establishment of a large-scale learned retrieval system at Pinterest, focusing on the transition from heuristic-based methods to an embedding-based retrieval system.
Pinterest Engineering
7 min read
Has Summary
--
The article discusses the implementation of data-efficient knowledge distillation using NVIDIA NeMo-Aligner during supervised fine-tuning (SFT).
Anna Shors
5 min read
Has Summary
--
The article introduces tile-based programming in Warp 1. 5. 0, highlighting new Python primitives that enhance GPU programming efficiency.
Miles Macklin
13 min read
Includes Code
Has Summary
--
The NVIDIA Deep Learning Institute has launched the Accelerated Data Science Teaching Kit, aimed at educators to enhance data science education.
Joe Bungo
3 min read
Has Summary
--
The article discusses the nvmath-python library, which allows Python programmers to perform high-performance mathematical operations using NVIDIA's CUDA-X math libraries.
Szymon Karpiński
6 min read
Includes Code
Has Summary
--
The article explores the RecurrentGemma architecture, a hybrid model that combines gated linear recurrences with local sliding window attention, enhancing performance for long context prompts.
Ju-yeong Ji, Ravin Kumar
6 min read
Includes Code
Has Summary
--
This article discusses the Candidate Generation (CG) stage of LinkedIn's People You May Know (PYMK) recommendation system, detailing the various techniques used to generate relevant candidate pools...
Parag Agrawal
13 min read
Has Summary
--
The article discusses the application of Graph Neural Networks (GNNs) in optimizing the design and simulation of lattice structures in additive manufacturing.
Ayush Jain
6 min read
Has Summary
--
This article explores the optimization of memory and retrieval processes for large-scale Graph Neural Networks (GNNs) using WholeGraph, a feature of the RAPIDS cuGraph library.
Dongxu Yang
5 min read
Has Summary
--
The article discusses WholeGraph, a feature in the RAPIDS cuGraph library designed to optimize memory and retrieval for Graph Neural Networks (GNNs).
Dongxu Yang
9 min read
Includes Code
Has Summary
--
The article discusses the rapid adoption of federated learning (FL) and the new features introduced in NVIDIA FLARE 2. 4.
AWSAzureFederated LearningGPTGraph Neural NetworksgRPCHugging FaceMachine LearningNeural NetworksPyTorchXGBoost
Chester Chen
15 min read
Includes Code
Has Summary
--
The article discusses the evolution of ads conversion optimization models at Pinterest, highlighting the transition from Gradient Boosted Decision Trees (GBDT) to advanced Deep Neural Networks (DNN...
Pinterest Engineering
12 min read
Has Summary
--
The article discusses the intricacies of training Large Language Models (LLMs) using transformer networks, focusing on model architectures, attention mechanisms, and embedding techniques.
Attention MechanismBERTEmbeddingGPTLarge Language ModelsNeural NetworksRecurrent Neural NetworksSelf-AttentionTransformerTransformersV
Anjali Shah
14 min read
Has Summary
--
A Stanford University team is revolutionizing cardiovascular care through AI-driven simulations that provide patient-specific blood flow visualizations.
Harpreet Sethi
8 min read
Has Summary
--
The article discusses NVIDIA AI Enterprise 4. 0, a comprehensive solution designed to support enterprises in developing and deploying generative AI applications.
AWSAzureGenerative AIGoogle CloudGraph Neural NetworksKubernetesMachine LearningNeural NetworksPythonRetrieval Augmented Generation
Phoebe Lee
4 min read
Has Summary
--
This article introduces Graph Neural Networks (GNNs) and how to utilize cuGraph-DGL, a GPU-accelerated library for graph computations.
Vibhu Jawa
7 min read
Includes Code
Has Summary
--
The article discusses the design of deep neural networks (DNNs) that can process the weights of other DNNs, focusing on architectures that leverage the symmetries of weight spaces.
Haggai Maron
14 min read
Has Summary
--
The article discusses the structured sparsity feature in the NVIDIA Ampere architecture, particularly focusing on its implementation in deep learning and applications in search engines.
Hongxiao Bai
12 min read
Includes Code
Has Summary
--
This article discusses the use of time-series models, specifically autoregressive recursive neural networks and XGBoost, for predicting credit defaults.
Jiwei Liu
11 min read
Includes Code
Has Summary
--
The article discusses NVIDIA PhysicsNeMo, a framework for developing physics-informed machine learning models, with a focus on the latest update that introduces support for Graph Neural Networks (G...
Bhoomi Gadhia
5 min read
Has Summary
--
The article discusses the training workflow and best practices for implementing sparsity in INT8 models using NVIDIA TensorRT.
Gwena Cunha Sergio
11 min read
Includes Code
Has Summary
--
This article discusses a Machine Learning (ML) based approach to proactively prevent advertiser churn at Pinterest.
Pinterest Engineering
8 min read
Has Summary
--
The article discusses the NVIDIA GTC 2023 conference, highlighting its extensive training opportunities in AI, HPC, and the metaverse.
Ann Sheridan
5 min read
Has Summary
--
The article discusses five methods for database obfuscation, emphasizing the importance of using realistic data for performance testing in analytical databases like ClickHouse.
Alexey Milovidov
27 min read
Includes Code
Has Summary
--
The article discusses the benchmarking of deep neural networks, specifically Long Short-Term Memory (LSTM) models, for low-latency trading and rapid backtesting using NVIDIA GPUs.
Martin Marciniszyn Mehringer
7 min read
Has Summary
--
The article discusses the application of causal machine learning at Netflix to derive creative insights for promotional artwork.
Netflix Technology Blog
15 min read
Has Summary
--
The article discusses the development of LinkedIn's Skills Graph, which aims to create a skills-first job market by mapping the relationships between skills, people, and organizations.
Sofus Macskássy
8 min read
Has Summary
--
The NVIDIA Grace Hopper Superchip Architecture represents a significant advancement in heterogeneous computing, combining NVIDIA Grace CPUs and Hopper GPUs to optimize performance for AI and high-p...
Deep LearningEmbeddingFortranGPTGraph Neural NetworksNatural Language ProcessingNeural NetworksPythonRenderTransformer
Jonathon Evans
15 min read
Has Summary
--
The article discusses how Graph Neural Networks (GNNs) and NVIDIA GPUs can optimize fraud detection in financial services.
Ashish Sardana
21 min read
Includes Code
Has Summary
--
NVIDIA has announced significant updates to its AI software suite, including JAX, NVIDIA CV-CUDA, and NVIDIA RAPIDS, aimed at accelerating AI research, computer vision, and data science.
ApacheApache SparkComputer VisionDaskDeep LearningDGLGoogle CloudGPTJAXKubernetesNeural NetworksNumPyPyTorchPyTorch GeometricSQL
Siddharth Sharma
7 min read
Has Summary
--
The article discusses hands-on training opportunities provided by the NVIDIA Deep Learning Institute (DLI) at the upcoming GPU Technical Conference (GTC).
Ann Sheridan
5 min read
Has Summary
--
This article provides a comprehensive guide on deploying large transformer models like GPT-J and T5 using NVIDIA's Triton Inference Server and FasterTransformer library.
Denis Timonin
15 min read
Includes Code
Has Summary
--
NVIDIA has announced significant updates to the NeMo framework, enhancing the training speed of large language models (LLMs) by up to 30%.
Markel Ausin
6 min read
Has Summary
--
The article discusses how to accelerate GPU applications using NVIDIA Math Libraries, highlighting three main approaches: compiler directives, programming languages, and preprogrammed libraries.
Aastha Jhunjhunwala
12 min read
Includes Code
Has Summary
--
The article introduces Transformers4Rec, a library from NVIDIA Merlin designed for building session-based recommendation systems using state-of-the-art Transformer architectures.
Ronay AK
7 min read
Includes Code
Has Summary
--
The article discusses the application of graph machine learning at Airbnb, highlighting how graph structures can enhance machine learning models by providing contextual information about users.
Devin Soni
12 min read
Has Summary
--
The article discusses various techniques for training large neural networks, focusing on the challenges and strategies involved in parallelizing model training across multiple GPUs.
Lilian Weng
9 min read
Has Summary
--
NVIDIA experts will present advancements in robotics, Graph Neural Networks (GNNs), and Natural Language Processing (NLP) at the WeAreDevelopers World Congress in Berlin.
Marjut Dieringer
4 min read
Has Summary
--
The article discusses the Performance-Adaptive Sampling Strategy (PASS) for Graph Neural Networks (GNNs) and announces its open-source release.
Jaewon Yang
4 min read
Has Summary
--
This article serves as a guide for Data Scientists to understand the fundamental concepts of gradient descent and backpropagation algorithms, which are essential for training Artificial Neural Netw...
Richmond Alake
9 min read
Has Summary
--
The article discusses the completion of member knowledge graphs using Graph Neural Networks (GNNs), specifically introducing a novel model called Entity-BERT.
BERTGraph Neural NetworksMachine LearningNatural Language ProcessingNeural NetworksSolidTransformerTransformers
Jaewon Yang
7 min read
Has Summary
--
The article provides a comprehensive overview of NVIDIA's Nsight Developer Tools, which are designed to optimize computational applications across various architectures.
Chaitrali Joshi
6 min read
Includes Code
Has Summary
--
NVIDIA has launched PhysicsNeMo, a framework for training neural networks that integrates governing physics equations with observed or simulated data, aimed at enhancing the development of digital ...
Jay Gould
2 min read
Has Summary
--
NVIDIA has introduced GPU-accelerated Deep Graph Library (DGL) containers to assist developers, researchers, and data scientists in working with Graph Neural Networks (GNN) on large heterogeneous g...
Gordana Neskovic
3 min read
Has Summary
--
NVIDIA has released an updated Edge AI and Robotics Teaching Kit aimed at university educators, developed in collaboration with experts from the University of Oxford and the University of Maryland,...
Jason Black
3 min read
Has Summary
--
The article discusses NVIDIA PhysicsNeMo, an AI toolkit that leverages physics-informed neural networks (PINNs) to enhance product development by solving complex nonlinear physics problems.
Michael Eidell
9 min read
Has Summary
--
The article discusses the upcoming NVIDIA DRIVE Developer Day at NVIDIA GTC, where developers can learn about the latest features in autonomous vehicle technology from NVIDIA experts.
Katie Washabaugh
1 min read
Has Summary
--