NVIDIA logo

How NVIDIA Uses LLaMA

9 engineering articles about LLaMA from NVIDIA's engineering team

Articles

Filter:
NVIDIA logo
NVIDIA
Advanced
The article discusses the operational challenges of deploying large language models (LLMs) and introduces LLMOps as a framework for managing their lifecycle.
Liad Levi-Raz
12 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
Stanford University researchers developed an LLM system called STARA to streamline legal research by identifying redundant and outdated laws.
Elias Wolfberg
4 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the transformative role of domain-adapted large language models (LLMs) with reasoning capabilities in accelerating battery research.
Rucha Apte
11 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the Llama 3. 1 collection of large language models (LLMs) and their applications in enterprise settings.
Chintan Patel
10 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the deployment of LoRA (Low-Rank Adaptation) fine-tuned models using NVIDIA NIM, highlighting the advantages of customizing large language models (LLMs) for specific tasks.
Shashank Verma
11 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the integration of Metaflow and NVIDIA Triton Inference Server for developing and deploying machine learning models.
Eddie Mattia
12 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
NVIDIA has released TensorRT-LLM, an open-source library designed to optimize inference performance for large language models (LLMs) on NVIDIA GPUs.
Neal Vaidya
10 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
NVIDIA SteerLM is a novel technique designed to simplify the customization of large language models (LLMs) during inference.
Yi Dong
10 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article introduces the NVIDIA NeMo Data Curator, a scalable tool designed for curating trillion-token multilingual datasets for training large language models (LLMs).
Joseph Jennings
8 min read
Includes Code
Has Summary
--

You've reached the end! All 9 articles loaded.