Neural Networks Programming Tutorials & Engineering Articles

169 Neural Networks tutorials, guides, and engineering insights from NVIDIA, Uber, OpenAI, and more

Companies Using This

NVIDIA(96)

Neural Networks Articles & Tutorials

Filter:

NVIDIA

Advanced

Reimagining LLM Memory: Using Context as Training Data Unlocks Models That Learn at Test-Time

The article discusses the limitations of current large language models (LLMs) in handling long contexts and introduces Test-Time Training with an end-to-end formulation (TTT-E2E) as a solution.

Neural NetworksRecurrent Neural NetworksTransformerTransformers

Yu Sun

6 min read

Has Summary

NVIDIA

Intermediate

Using AI Physics for Technology Computer-Aided Design Simulations

The article discusses the integration of AI Physics into Technology Computer-Aided Design (TCAD) simulations, highlighting its significance in semiconductor manufacturing.

Graph Neural NetworksHugging FaceNeural NetworksPythonPyTorch

Ram Cherukuri

7 min read

Has Summary

NVIDIA

Intermediate

Supercharging Fraud Detection in Financial Services with Graph Neural Networks (Updated)

The article discusses the application of Graph Neural Networks (GNNs) in enhancing fraud detection within financial services.

ApacheApache SparkDockerGraph Neural NetworksJSONKubernetesNeural NetworksXGBoost

Naim

10 min read

Includes Code

Has Summary

Advanced

Establishing a Large Scale Learned Retrieval System at Pinterest

The article discusses the establishment of a large-scale learned retrieval system at Pinterest, focusing on the transition from heuristic-based methods to an embedding-based retrieval system.

Machine LearningNeural NetworksTransformer

Pinterest Engineering

7 min read

Has Summary

NVIDIA

Intermediate

Data-Efficient Knowledge Distillation for Supervised Fine-Tuning with NVIDIA NeMo-Aligner

The article discusses the implementation of data-efficient knowledge distillation using NVIDIA NeMo-Aligner during supervised fine-tuning (SFT).

CachingLarge Language ModelsNeural Networks

Anna Shors

5 min read

Has Summary

NVIDIA

Advanced

Introducing Tile-Based Programming in Warp 1.5.0

The article introduces tile-based programming in Warp 1. 5. 0, highlighting new Python primitives that enhance GPU programming efficiency.

Neural NetworksNumPyPythonPyTorchWarp

Miles Macklin

13 min read

Includes Code

Has Summary

NVIDIA

Intermediate

NVIDIA Deep Learning Institute Releases New Data Science Teaching Kit for Educators

The NVIDIA Deep Learning Institute has launched the Accelerated Data Science Teaching Kit, aimed at educators to enhance data science education.

DaskDeep LearningMachine LearningNetworkXNeural NetworksPolarsPython

Joe Bungo

3 min read

Has Summary

NVIDIA

Intermediate

Fusing Epilog Operations with Matrix Multiplication Using nvmath-python

The article discusses the nvmath-python library, which allows Python programmers to perform high-performance mathematical operations using NVIDIA's CUDA-X math libraries.

Neural NetworksPythonPyTorch

Szymon Karpiński

6 min read

Includes Code

Has Summary

Google

Advanced

Gemma explained: RecurrentGemma architecture

The article explores the RecurrentGemma architecture, a hybrid model that combines gated linear recurrences with local sliding window attention, enhancing performance for long context prompts.

EmbeddingNeural NetworksTransformerTransformers

Ju-yeong Ji, Ravin Kumar

6 min read

Includes Code

Has Summary

Advanced

Candidate Generation in a Large Scale Graph Recommendation System: People You May Know

This article discusses the Candidate Generation (CG) stage of LinkedIn's People You May Know (PYMK) recommendation system, detailing the various techniques used to generate relevant candidate pools...

EmbeddingGraph Neural NetworksNeural Networks

Parag Agrawal

13 min read

Has Summary

NVIDIA

Advanced

Using Graph Neural Networks for Additive Manufacturing

The article discusses the application of Graph Neural Networks (GNNs) in optimizing the design and simulation of lattice structures in additive manufacturing.

ApacheGraph Neural NetworksNeural Networks

Ayush Jain

6 min read

Has Summary

NVIDIA

Advanced

Optimizing Memory and Retrieval for Graph Neural Networks with WholeGraph, Part 2

This article explores the optimization of memory and retrieval processes for large-scale Graph Neural Networks (GNNs) using WholeGraph, a feature of the RAPIDS cuGraph library.

EmbeddingGraph Neural NetworksNeural Networks

Dongxu Yang

5 min read

Has Summary

NVIDIA

Intermediate

Optimizing Memory and Retrieval for Graph Neural Networks with WholeGraph, Part 1

The article discusses WholeGraph, a feature in the RAPIDS cuGraph library designed to optimize memory and retrieval for Graph Neural Networks (GNNs).

DGLEmbeddingGraph Neural NetworksNeural NetworksNumPyPythonPyTorch

Dongxu Yang

9 min read

Includes Code

Has Summary

NVIDIA

Advanced

Turning Machine Learning to Federated Learning in Minutes with NVIDIA FLARE 2.4

The article discusses the rapid adoption of federated learning (FL) and the new features introduced in NVIDIA FLARE 2. 4.

AWSAzureFederated LearningGPTGraph Neural NetworksgRPCHugging FaceMachine LearningNeural NetworksPyTorchXGBoost

Chester Chen

15 min read

Includes Code

Has Summary

Advanced

Evolution of Ads Conversion Optimization Models at Pinterest

The article discusses the evolution of ads conversion optimization models at Pinterest, highlighting the transition from Gradient Boosted Decision Trees (GBDT) to advanced Deep Neural Networks (DNN...

AutoMLMachine LearningNeural NetworksPyTorchTransformerV

Pinterest Engineering

12 min read

Has Summary

NVIDIA

Intermediate

Mastering LLM Techniques: Training

The article discusses the intricacies of training Large Language Models (LLMs) using transformer networks, focusing on model architectures, attention mechanisms, and embedding techniques.

Attention MechanismBERTEmbeddingGPTLarge Language ModelsNeural NetworksRecurrent Neural NetworksSelf-AttentionTransformerTransformersV

Anjali Shah

14 min read

Has Summary

NVIDIA

Advanced

Enabling Greater Patient-Specific Cardiovascular Care with AI Surrogates

A Stanford University team is revolutionizing cardiovascular care through AI-driven simulations that provide patient-specific blood flow visualizations.

ApacheGraph Neural NetworksNeural Networks

Harpreet Sethi

8 min read

Has Summary

NVIDIA

Intermediate

Power Your Business with NVIDIA AI Enterprise 4.0 for Production-Ready Generative AI

The article discusses NVIDIA AI Enterprise 4. 0, a comprehensive solution designed to support enterprises in developing and deploying generative AI applications.

AWSAzureGenerative AIGoogle CloudGraph Neural NetworksKubernetesMachine LearningNeural NetworksPythonRetrieval Augmented Generation

Phoebe Lee

4 min read

Has Summary

NVIDIA

Intermediate

Introduction to Graph Neural Networks with NVIDIA cuGraph-DGL

This article introduces Graph Neural Networks (GNNs) and how to utilize cuGraph-DGL, a GPU-accelerated library for graph computations.

DGLGraph Neural NetworksNeural NetworksPythonPyTorchTensorFlow

Vibhu Jawa

7 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Designing Deep Networks to Process Other Deep Networks

The article discusses the design of deep neural networks (DNNs) that can process the weights of other DNNs, focusing on architectures that leverage the symmetries of weight spaces.

Deep LearningGraph Neural NetworksNeural NetworksTransformerTransformersV

Haggai Maron

14 min read

Has Summary

NVIDIA

Intermediate

Structured Sparsity in the NVIDIA Ampere Architecture and Applications in Search Engines

The article discusses the structured sparsity feature in the NVIDIA Ampere architecture, particularly focusing on its implementation in deep learning and applications in search engines.

BERTMachine LearningNeural NetworksPythonSelf-AttentionTransformers

Hongxiao Bai

12 min read

Includes Code

Has Summary

NVIDIA

Beginner

Predicting Credit Defaults Using Time-Series Models with Recursive Neural Networks and XGBoost

This article discusses the use of time-series models, specifically autoregressive recursive neural networks and XGBoost, for predicting credit defaults.

LightGBMNeural NetworksPyTorchscikit-learnTensorFlowXGBoost

Jiwei Liu

11 min read

Includes Code

Has Summary

NVIDIA

Advanced

Develop Physics-Informed Machine Learning Models with Graph Neural Networks

The article discusses NVIDIA PhysicsNeMo, a framework for developing physics-informed machine learning models, with a focus on the latest update that introduces support for Graph Neural Networks (G...

ApacheDeep LearningGraph Neural NetworksMachine LearningNeural NetworksPyTorch

Bhoomi Gadhia

5 min read

Has Summary

NVIDIA

Intermediate

Sparsity in INT8: Training Workflow and Best Practices for NVIDIA TensorRT Acceleration

The article discusses the training workflow and best practices for implementing sparsity in INT8 models using NVIDIA TensorRT.

Deep LearningNeural NetworksPythonPyTorchResNetTensorFlowtorchvision

Gwena Cunha Sergio

11 min read

Includes Code

Has Summary

Advanced

An ML based approach to proactive advertiser churn prevention

This article discusses a Machine Learning (ML) based approach to proactively prevent advertiser churn at Pinterest.

LSTMMachine LearningNeural NetworksSHAPTransformers

Pinterest Engineering

8 min read

Has Summary

NVIDIA

Intermediate

Power-Up Your Skills and Credentials at NVIDIA GTC 2023

The article discusses the NVIDIA GTC 2023 conference, highlighting its extensive training opportunities in AI, HPC, and the metaverse.

Computer VisionDeep LearningNatural Language ProcessingNeural NetworksPythonTransformer

Ann Sheridan

5 min read

Has Summary

ClickHouse

Beginner

Five Methods For Database Obfuscation

The article discusses five methods for database obfuscation, emphasizing the importance of using realistic data for performance testing in analytical databases like ClickHouse.

LSTMMySQLNeural NetworksPostgreSQLPythonRecurrent Neural NetworksRedisSQLV

Alexey Milovidov

27 min read

Includes Code

Has Summary

NVIDIA

Advanced

Benchmarking Deep Neural Networks for Low-Latency Trading and Rapid Backtesting on NVIDIA GPUs

The article discusses the benchmarking of deep neural networks, specifically Long Short-Term Memory (LSTM) models, for low-latency trading and rapid backtesting using NVIDIA GPUs.

LSTMNeural NetworksPyTorchTensorFlow

Martin Marciniszyn Mehringer

7 min read

Has Summary

Netflix

Advanced

Causal Machine Learning for Creative Insights

The article discusses the application of causal machine learning at Netflix to derive creative insights for promotional artwork.

Machine LearningNeural Networks

Netflix Technology Blog

15 min read

Has Summary

Intermediate

Building LinkedIn's Skills Graph to Power a Skills-First World

The article discusses the development of LinkedIn's Skills Graph, which aims to create a skills-first job market by mapping the relationships between skills, people, and organizations.

Deep LearningMachine LearningNeural Networks

Sofus Macskássy

8 min read

Has Summary

NVIDIA

Advanced

NVIDIA Grace Hopper Superchip Architecture In-Depth

The NVIDIA Grace Hopper Superchip Architecture represents a significant advancement in heterogeneous computing, combining NVIDIA Grace CPUs and Hopper GPUs to optimize performance for AI and high-p...

Deep LearningEmbeddingFortranGPTGraph Neural NetworksNatural Language ProcessingNeural NetworksPythonRenderTransformer

Jonathon Evans

15 min read

Has Summary

NVIDIA

Intermediate

Optimizing Fraud Detection in Financial Services with Graph Neural Networks and NVIDIA GPUs

The article discusses how Graph Neural Networks (GNNs) and NVIDIA GPUs can optimize fraud detection in financial services.

AWSDaskDGLGraph Neural NetworksNeural NetworksPythonPyTorchPyTorch GeometricXGBoost

Ashish Sardana

21 min read

Includes Code

Has Summary

NVIDIA

Advanced

New SDKs Accelerating AI Research, Computer Vision, Data Science, and More

NVIDIA has announced significant updates to its AI software suite, including JAX, NVIDIA CV-CUDA, and NVIDIA RAPIDS, aimed at accelerating AI research, computer vision, and data science.

ApacheApache SparkComputer VisionDaskDeep LearningDGLGoogle CloudGPTJAXKubernetesNeural NetworksNumPyPyTorchPyTorch GeometricSQL

Siddharth Sharma

7 min read

Has Summary

NVIDIA

Intermediate

Get Hands-on Training from NVIDIA Experts at GTC

The article discusses hands-on training opportunities provided by the NVIDIA Deep Learning Institute (DLI) at the upcoming GPU Technical Conference (GTC).

Deep LearningNeural Networks

Ann Sheridan

5 min read

Has Summary

NVIDIA

Advanced

Deploying GPT-J and T5 with NVIDIA Triton Inference Server

This article provides a comprehensive guide on deploying large transformer models like GPT-J and T5 using NVIDIA's Triton Inference Server and FasterTransformer library.

BERTDockerGPTHugging FaceNeural NetworksPythonPyTorchT5TensorFlowTransformer

Denis Timonin

15 min read

Includes Code

Has Summary

NVIDIA

Advanced

NVIDIA AI Platform Delivers Big Gains for Large Language Models

NVIDIA has announced significant updates to the NeMo framework, enhancing the training speed of large language models (LLMs) by up to 30%.

AzureDeep LearningGPTHugging FaceLarge Language ModelsNeural NetworksTransformerV

Markel Ausin

6 min read

Has Summary

NVIDIA

Advanced

Accelerating GPU Applications with NVIDIA Math Libraries

The article discusses how to accelerate GPU applications using NVIDIA Math Libraries, highlighting three main approaches: compiler directives, programming languages, and preprogrammed libraries.

Deep LearningFortranNeural NetworksPython

Aastha Jhunjhunwala

12 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Transformers4Rec: Building Session-Based Recommendations with an NVIDIA Merlin Library

The article introduces Transformers4Rec, a library from NVIDIA Merlin designed for building session-based recommendation systems using state-of-the-art Transformer architectures.

Hugging FaceKerasNeural NetworksPyTorchRecurrent Neural NetworksTensorFlowTransformerTransformers

Ronay AK

7 min read

Includes Code

Has Summary

Airbnb

Advanced

Graph Machine Learning at Airbnb

The article discusses the application of graph machine learning at Airbnb, highlighting how graph structures can enhance machine learning models by providing contextual information about users.

Graph Neural NetworksMachine LearningNeural NetworksSHAP

Devin Soni

12 min read

Has Summary

OpenAI

Advanced

Techniques for training large neural networks

The article discusses various techniques for training large neural networks, focusing on the challenges and strategies involved in parallelizing model training across multiple GPUs.

KubernetesNeural NetworksTransformer

Lilian Weng

9 min read

Has Summary

NVIDIA

Beginner

NVIDIA Experts Explore Robotics, GNNs, and NLP Advancements at the WeAreDevelopers World Congress

NVIDIA experts will present advancements in robotics, Graph Neural Networks (GNNs), and Natural Language Processing (NLP) at the WeAreDevelopers World Congress in Berlin.

Deep LearningGPTGraph Neural NetworksJavaScriptNatural Language ProcessingNeural NetworksPHPPython

Marjut Dieringer

4 min read

Has Summary

Intermediate

Performance-Adaptive Sampling Strategy (PASS) for GNNs: Open sourcing PASS

The article discusses the Performance-Adaptive Sampling Strategy (PASS) for Graph Neural Networks (GNNs) and announces its open-source release.

Graph Neural NetworksMachine LearningNeural Networks

Jaewon Yang

4 min read

Has Summary

NVIDIA

Advanced

A Data Scientist’s Guide to Gradient Descent and Backpropagation Algorithms

This article serves as a guide for Data Scientists to understand the fundamental concepts of gradient descent and backpropagation algorithms, which are essential for training Artificial Neural Netw...

Deep LearningNeural NetworksPyTorchscikit-learnTensorFlow

Richmond Alake

9 min read

Has Summary

Intermediate

Completing a member knowledge graph with Graph Neural Networks

The article discusses the completion of member knowledge graphs using Graph Neural Networks (GNNs), specifically introducing a novel model called Entity-BERT.

BERTGraph Neural NetworksMachine LearningNatural Language ProcessingNeural NetworksSolidTransformerTransformers

Jaewon Yang

7 min read

Has Summary

NVIDIA

Intermediate

NVIDIA GTC: A Complete Overview of Nsight Developer Tools

The article provides a comprehensive overview of NVIDIA's Nsight Developer Tools, which are designed to optimize computational applications across various architectures.

Deep LearningHTMLMachine LearningNeural NetworksPyTorch

Chaitrali Joshi

6 min read

Includes Code

Has Summary

NVIDIA

Intermediate

NVIDIA Announces PhysicsNeMo: A Framework for Developing Physics ML Models for Digital Twins

NVIDIA has launched PhysicsNeMo, a framework for training neural networks that integrates governing physics equations with observed or simulated data, aimed at enhancing the development of digital ...

AWSNeural NetworksPythonPyTorchTensorFlow

Jay Gould

2 min read

Has Summary

NVIDIA

Intermediate

CUDA-X Accelerated DGL Containers for Large Graph Neural Networks

NVIDIA has introduced GPU-accelerated Deep Graph Library (DGL) containers to assist developers, researchers, and data scientists in working with Graph Neural Networks (GNN) on large heterogeneous g...

DGLGraph Neural NetworksMachine LearningNeural NetworksPyTorchTransformer

Gordana Neskovic

3 min read

Has Summary

NVIDIA

Intermediate

NVIDIA Launches Updated Teaching Kit for Edge AI and Robotics Educators

NVIDIA has released an updated Edge AI and Robotics Teaching Kit aimed at university educators, developed in collaboration with experts from the University of Oxford and the University of Maryland,...

Deep LearningNeural NetworksPyTorchReinforcement LearningSpring

Jason Black

3 min read

Has Summary

NVIDIA

Intermediate

Accelerating Product Development with Physics-Informed Neural Networks and NVIDIA PhysicsNeMo

The article discusses NVIDIA PhysicsNeMo, an AI toolkit that leverages physics-informed neural networks (PINNs) to enhance product development by solving complex nonlinear physics problems.

Neural NetworksPythonTensorBoard

Michael Eidell

9 min read

Has Summary

NVIDIA

Intermediate

Learn from NVIDIA Experts at GTC DRIVE Developer Day

The article discusses the upcoming NVIDIA DRIVE Developer Day at NVIDIA GTC, where developers can learn about the latest features in autonomous vehicle technology from NVIDIA experts.

Neural Networks

Katie Washabaugh

1 min read

Has Summary