#
Recurrent Neural Networks Programming Tutorials & Engineering Articles
19 Recurrent Neural Networks tutorials, guides, and engineering insights from NVIDIA
Companies Using This
Recurrent Neural Networks Articles & Tutorials
Filter:
The article discusses the limitations of current large language models (LLMs) in handling long contexts and introduces Test-Time Training with an end-to-end formulation (TTT-E2E) as a solution.
Yu Sun
6 min read
Has Summary
--
The article discusses the intricacies of training Large Language Models (LLMs) using transformer networks, focusing on model architectures, attention mechanisms, and embedding techniques.
Attention MechanismBERTEmbeddingGPTLarge Language ModelsNeural NetworksRecurrent Neural NetworksSelf-AttentionTransformerTransformersV
Anjali Shah
14 min read
Has Summary
--
The article discusses five methods for database obfuscation, emphasizing the importance of using realistic data for performance testing in analytical databases like ClickHouse.
Alexey Milovidov
27 min read
Includes Code
Has Summary
--
The article introduces Transformers4Rec, a library from NVIDIA Merlin designed for building session-based recommendation systems using state-of-the-art Transformer architectures.
Ronay AK
7 min read
Includes Code
Has Summary
--
NVIDIA's Deep Learning Institute is now offering instructor-led workshops remotely, providing hands-on training in AI, accelerated computing, and data science.
Nefi Alarcon
1 min read
Has Summary
--
The article discusses Hierarchical Temporal Convolutional Networks (HierTCN), a deep learning architecture designed for dynamic recommendations based on user interactions.
Pinterest Engineering
4 min read
Has Summary
--
The article introduces NVIDIA TensorRT™, a high-performance deep learning inference optimizer and runtime, focusing on configuring a simple Recurrent Neural Network (RNN) using TensorRT.
Shiva Pentyala
2 min read
Has Summary
--
This article introduces NVIDIA TensorRT, a high-performance deep learning inference optimizer, and demonstrates how to configure a simple Recurrent Neural Network (RNN) using TensorRT.
Nefi Alarcon
2 min read
Has Summary
--
NVIDIA has released TensorRT 4, which enhances the acceleration of inference applications like neural machine translation, recommender systems, and speech.
Nefi Alarcon
1 min read
Has Summary
--
NVIDIA announced significant updates to its software suite, including the CUDA Toolkit, NV Deep Learning SDK, and TensorRT, aimed at enhancing performance for deep learning and AI applications.
Brad Nemire
4 min read
Has Summary
--
The article summarizes key highlights from the Uber Engineering Blog in 2017, showcasing advancements in technology that have enhanced user experiences across Uber's services.
Molly Vorwerck
2 min read
Has Summary
--
NVIDIA's JetPack 3. 1 significantly enhances the low-latency inference performance of the Jetson TX1 and TX2 platforms, doubling the deep learning inference capabilities for real-time applications.
Dustin Franklin
6 min read
Includes Code
Has Summary
--
The article discusses Recursive Neural Networks (RNNs) implemented using PyTorch, emphasizing their hierarchical structure for natural language processing.
James Bradbury
22 min read
Includes Code
Has Summary
--
The article discusses the launch of the NVIDIA Jetson TX2, a powerful low-power embedded platform designed for AI compute performance at the edge.
Dustin Franklin
17 min read
Includes Code
Has Summary
--
The article discusses the optimizations made in cuDNN 5 for Recurrent Neural Networks (RNNs), focusing on performance improvements and new features that enhance the efficiency of sequence learning ...
Jeremy Appleyard
9 min read
Includes Code
Has Summary
--
This article provides an introduction to sequence learning in deep learning, focusing on recurrent neural networks (RNNs) and Long Short-Term Memory (LSTM) units.
Tim Dettmers
13 min read
Has Summary
--
Mocha. jl is a deep learning library for Julia, designed for scientific and numerical computing.
Chiyuan Zhang
10 min read
Includes Code
Has Summary
--
This article concludes a three-part series on Neural Machine Translation (NMT) with GPUs, focusing on the limitations of simple encoder-decoder architectures and the introduction of the soft attent...
Kyunghyun Cho
18 min read
Includes Code
Has Summary
--
This article introduces Neural Machine Translation (NMT) using GPUs, focusing on the encoder-decoder model and the role of recurrent neural networks (RNNs) in processing variable-length sequences.
Kyunghyun Cho
12 min read
Has Summary
--
You've reached the end! All 19 articles loaded.