How NVIDIA Uses Recurrent Neural Networks

16 engineering articles about Recurrent Neural Networks from NVIDIA's engineering team

Other NVIDIA Technologies

Python(740)PyTorch(566)Deep Learning(505)TensorFlow(444)Docker(292)Kubernetes(251)

Articles

Filter:

NVIDIA

Advanced

Reimagining LLM Memory: Using Context as Training Data Unlocks Models That Learn at Test-Time

The article discusses the limitations of current large language models (LLMs) in handling long contexts and introduces Test-Time Training with an end-to-end formulation (TTT-E2E) as a solution.

Neural NetworksRecurrent Neural NetworksTransformerTransformers

Yu Sun

6 min read

Has Summary

NVIDIA

Intermediate

Mastering LLM Techniques: Training

The article discusses the intricacies of training Large Language Models (LLMs) using transformer networks, focusing on model architectures, attention mechanisms, and embedding techniques.

Attention MechanismBERTEmbeddingGPTLarge Language ModelsNeural NetworksRecurrent Neural NetworksSelf-AttentionTransformerTransformersV

Anjali Shah

14 min read

Has Summary

NVIDIA

Intermediate

Transformers4Rec: Building Session-Based Recommendations with an NVIDIA Merlin Library

The article introduces Transformers4Rec, a library from NVIDIA Merlin designed for building session-based recommendation systems using state-of-the-art Transformer architectures.

Hugging FaceKerasNeural NetworksPyTorchRecurrent Neural NetworksTensorFlowTransformerTransformers

Ronay AK

7 min read

Includes Code

Has Summary

NVIDIA

Advanced

NVIDIA Deep Learning Institute Instructor-Led Training Now Available Remotely

NVIDIA's Deep Learning Institute is now offering instructor-led workshops remotely, providing hands-on training in AI, accelerated computing, and data science.

Deep LearningKerasNeural NetworksRecurrent Neural NetworksTensorFlow

Nefi Alarcon

1 min read

Has Summary

NVIDIA

Intermediate

Video: Introduction to Recurrent Neural Networks in TensorRT

The article introduces NVIDIA TensorRT™, a high-performance deep learning inference optimizer and runtime, focusing on configuring a simple Recurrent Neural Network (RNN) using TensorRT.

EmbeddingNeural NetworksPythonRecurrent Neural Networks

Shiva Pentyala

2 min read

Has Summary

NVIDIA

Beginner

Video Tutorial: Introduction to Recurrent Neural Networks in TensorRT

This article introduces NVIDIA TensorRT, a high-performance deep learning inference optimizer, and demonstrates how to configure a simple Recurrent Neural Network (RNN) using TensorRT.

EmbeddingNeural NetworksPythonRecurrent Neural Networks

Nefi Alarcon

2 min read

Has Summary

NVIDIA

Beginner

NVIDIA Releases TensorRT 4

NVIDIA has released TensorRT 4, which enhances the acceleration of inference applications like neural machine translation, recommender systems, and speech.

Neural NetworksPyTorchRecurrent Neural Networks

Nefi Alarcon

1 min read

Has Summary

NVIDIA

Intermediate

NVIDIA Announces New Software and Updates to CUDA, Deep Learning SDK and More

NVIDIA announced significant updates to its software suite, including the CUDA Toolkit, NV Deep Learning SDK, and TensorRT, aimed at enhancing performance for deep learning and AI applications.

Deep LearningKongKubernetesMATLABNeural NetworksPyTorchRecurrent Neural NetworksTensorFlow

Brad Nemire

4 min read

Has Summary

NVIDIA

Intermediate

JetPack 3.1 Doubles Jetson’s Low-Latency Inference Performance

NVIDIA's JetPack 3. 1 significantly enhances the low-latency inference performance of the Jetson TX1 and TX2 platforms, doubling the deep learning inference capabilities for real-time applications.

GRULSTMNeural NetworksRecurrent Neural NetworksResNetYOLO

Dustin Franklin

6 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Recursive Neural Networks with PyTorch

The article discusses Recursive Neural Networks (RNNs) implemented using PyTorch, emphasizing their hierarchical structure for natural language processing.

KerasLSTMNeural NetworksPythonPyTorchRecurrent Neural NetworksTensorFlow

James Bradbury

22 min read

Includes Code

Has Summary

NVIDIA

Advanced

NVIDIA Jetson TX2 Delivers Twice the Intelligence to the Edge

The article discusses the launch of the NVIDIA Jetson TX2, a powerful low-power embedded platform designed for AI compute performance at the edge.

Neural NetworksOpenCVRecurrent Neural NetworksTensorFlow

Dustin Franklin

17 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Optimizing Recurrent Neural Networks in cuDNN 5

The article discusses the optimizations made in cuDNN 5 for Recurrent Neural Networks (RNNs), focusing on performance improvements and new features that enhance the efficiency of sequence learning ...

Deep LearningGRULSTMNeural NetworksRecurrent Neural NetworksTransformer

Jeremy Appleyard

9 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Deep Learning in a Nutshell: Sequence Learning

This article provides an introduction to sequence learning in deep learning, focusing on recurrent neural networks (RNNs) and Long Short-Term Memory (LSTM) units.

Deep LearningEmbeddingLSTMNeural NetworksRecurrent Neural Networks

Tim Dettmers

13 min read

Has Summary

NVIDIA

Intermediate

Mocha.jl: Deep Learning for Julia

Mocha. jl is a deep learning library for Julia, designed for scientific and numerical computing.

Computer VisionDeep LearningJuliaNeural NetworksRecurrent Neural Networks

Chiyuan Zhang

10 min read

Includes Code

Has Summary

NVIDIA

Advanced

Introduction to Neural Machine Translation with GPUs (part 3)

This article concludes a three-part series on Neural Machine Translation (NMT) with GPUs, focusing on the limitations of simple encoder-decoder architectures and the introduction of the soft attent...

Attention MechanismNeural NetworksPythonRecurrent Neural NetworksSciPyV

Kyunghyun Cho

18 min read

Includes Code

Has Summary

NVIDIA

Advanced

Introduction to Neural Machine Translation with GPUs (part 1)

This article introduces Neural Machine Translation (NMT) using GPUs, focusing on the encoder-decoder model and the role of recurrent neural networks (RNNs) in processing variable-length sequences.

Deep LearningLSTMNeural NetworksRecurrent Neural Networks

Kyunghyun Cho

12 min read

Has Summary

You've reached the end! All 16 articles loaded.