Self-Attention Programming Tutorials & Engineering Articles

7 Self-Attention tutorials, guides, and engineering insights from NVIDIA

Companies Using This

NVIDIA(5)

Self-Attention Articles & Tutorials

Filter:

NVIDIA

Advanced

Emulating the Attention Mechanism in Transformer Models with a Fully Convolutional Network

This article discusses the emulation of the attention mechanism in transformer models using a fully convolutional network, specifically targeting improvements in computer vision tasks.

Attention MechanismResNetSelf-AttentionTransformerTransformersV

John Yang

12 min read

Has Summary

NVIDIA

Advanced

Mastering LLM Techniques: Inference Optimization

This article discusses inference optimization techniques for large language models (LLMs), highlighting the challenges and solutions associated with memory and compute efficiency.

Autoregressive ModelsBERTGPTSelf-AttentionTransformerV

Shashank Verma

24 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Mastering LLM Techniques: Training

The article discusses the intricacies of training Large Language Models (LLMs) using transformer networks, focusing on model architectures, attention mechanisms, and embedding techniques.

Attention MechanismBERTEmbeddingGPTLarge Language ModelsNeural NetworksRecurrent Neural NetworksSelf-AttentionTransformerTransformersV

Anjali Shah

14 min read

Has Summary

NVIDIA

Intermediate

Structured Sparsity in the NVIDIA Ampere Architecture and Applications in Search Engines

The article discusses the structured sparsity feature in the NVIDIA Ampere architecture, particularly focusing on its implementation in deep learning and applications in search engines.

BERTMachine LearningNeural NetworksPythonSelf-AttentionTransformers

Hongxiao Bai

12 min read

Includes Code

Has Summary

Uber

Advanced

DeepETA: How Uber Predicts Arrival Times Using Deep Learning

The article discusses DeepETA, Uber's advanced model for predicting arrival times using deep learning techniques.

ApacheApache SparkComputer VisionDeep LearningMachine LearningSelf-AttentionTensorFlowTransformerTransformersXGBoost

Xinyu Hu, Olcay Cirit, Tanmay Binaykiya, Ramit Hora

15 min read

Has Summary

NVIDIA

Advanced

Real-Time Natural Language Understanding with BERT Using TensorRT

The article discusses the optimizations NVIDIA has made to the BERT model using TensorRT, enabling real-time natural language understanding with significantly reduced latency.

BERTDockerGoogle CloudGPTPythonRoBERTaSelf-AttentionTransformerTransformersV

Purnendu Mukherjee

19 min read

Includes Code

Has Summary

OpenAI

Advanced

Generative modeling with sparse transformers

The article discusses the development of Sparse Transformers, a novel deep neural network architecture that enhances the prediction of sequences in various domains, including text, images, and soun...

Self-AttentionTransformerTransformersWhisper

Rewon Child

7 min read

Has Summary

You've reached the end! All 7 articles loaded.