#

Self-Attention Programming Tutorials & Engineering Articles

7 Self-Attention tutorials, guides, and engineering insights from NVIDIA

Companies Using This

Self-Attention Articles & Tutorials

Filter:
NVIDIA logo
NVIDIA
Advanced
This article discusses the emulation of the attention mechanism in transformer models using a fully convolutional network, specifically targeting improvements in computer vision tasks.
NVIDIA logo
NVIDIA
Advanced
This article discusses inference optimization techniques for large language models (LLMs), highlighting the challenges and solutions associated with memory and compute efficiency.
Shashank Verma
24 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the intricacies of training Large Language Models (LLMs) using transformer networks, focusing on model architectures, attention mechanisms, and embedding techniques.
NVIDIA logo
NVIDIA
Intermediate
The article discusses the structured sparsity feature in the NVIDIA Ampere architecture, particularly focusing on its implementation in deep learning and applications in search engines.
Hongxiao Bai
12 min read
Includes Code
Has Summary
--
Uber logo
Uber
Advanced
The article discusses DeepETA, Uber's advanced model for predicting arrival times using deep learning techniques.
NVIDIA logo
NVIDIA
Advanced
The article discusses the optimizations NVIDIA has made to the BERT model using TensorRT, enabling real-time natural language understanding with significantly reduced latency.
Purnendu Mukherjee
19 min read
Includes Code
Has Summary
--
OpenAI logo
OpenAI
Advanced
The article discusses the development of Sparse Transformers, a novel deep neural network architecture that enhances the prediction of sequences in various domains, including text, images, and soun...
Rewon Child
7 min read
Has Summary
--

You've reached the end! All 7 articles loaded.