How NVIDIA Uses RoBERTa

7 engineering articles about RoBERTa from NVIDIA's engineering team

Other NVIDIA Technologies

Python(740)PyTorch(566)Deep Learning(505)TensorFlow(444)Docker(292)Kubernetes(251)

Other Companies Using RoBERTa

Pinterest(2)

Articles

Filter:

NVIDIA

Advanced

Efficiently Scale LLM Training Across a Large GPU Cluster with Alpa and Ray

The article discusses how to efficiently scale large language model (LLM) training across a large GPU cluster using the open-source frameworks Alpa and Ray.

AWSBERTChatGPTDALL-EGenerative AIGPTJAXPythonRoBERTaStable DiffusionT5TensorFlow

Jiao Dong

14 min read

Includes Code

Has Summary

NVIDIA

Intermediate

End-to-End AI for NVIDIA-Based PCs: NVIDIA TensorRT Deployment

This article discusses the deployment of NVIDIA TensorRT for AI inference on NVIDIA hardware, focusing on optimizing performance and compatibility.

PythonRoBERTa

Maximilian Müller

10 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Run State of the Art NLP Workloads at Scale with RAPIDS, HuggingFace, and Dask

This article discusses how to leverage RAPIDS, HuggingFace, and Dask to run state-of-the-art NLP workloads at scale on GPUs.

ApacheApache SparkBERTDaskGPTNLTKRapidsRoBERTaspaCyTransformers

Vibhu Jawa

7 min read

Includes Code

Has Summary

NVIDIA

Intermediate

State-of-the-Art Language Modeling Using Megatron on the NVIDIA A100 GPU

This article discusses the advancements in language modeling using Megatron on the NVIDIA A100 GPU, highlighting the significant improvements in natural language processing tasks achieved through m...

BERTPyTorchRoBERTaTransformers

Mohammad Shoeybi

9 min read

Has Summary

NVIDIA

Advanced

Allen Institute for AI Announces BERT-Breakthrough: Passing a 12th-Grade Science Exam

The Allen Institute for Artificial Intelligence has achieved a significant milestone with its BERT-based model, Aristo, which successfully passed a 12th-grade science exam with an accuracy of 83%.

AllenNLPArtificial IntelligenceBERTGoogle CloudPyTorchRoBERTa

Nefi Alarcon

3 min read

Has Summary

NVIDIA

Advanced

NVIDIA Clocks World’s Fastest BERT Training Time and Largest Transformer Based Model, Paving Path For Advanced

NVIDIA has achieved a groundbreaking milestone by training BERT-Large in just 47 minutes using the DGX SuperPOD, and has also developed the largest Transformer-based model, GPT-2 8B, with 8.

BERTGPTLessPyTorchRoBERTaTransformer

Shar Narasimhan

8 min read

Has Summary

NVIDIA

Advanced

Real-Time Natural Language Understanding with BERT Using TensorRT

The article discusses the optimizations NVIDIA has made to the BERT model using TensorRT, enabling real-time natural language understanding with significantly reduced latency.

BERTDockerGoogle CloudGPTPythonRoBERTaSelf-AttentionTransformerTransformersV

Purnendu Mukherjee

19 min read

Includes Code

Has Summary

You've reached the end! All 7 articles loaded.