How NVIDIA Uses LLaMA
9 engineering articles about LLaMA from NVIDIA's engineering team
Other NVIDIA Technologies
Other Companies Using LLaMA
Articles
Filter:
The article discusses the operational challenges of deploying large language models (LLMs) and introduces LLMOps as a framework for managing their lifecycle.
Liad Levi-Raz
12 min read
Includes Code
Has Summary
--
Stanford University researchers developed an LLM system called STARA to streamline legal research by identifying redundant and outdated laws.
The article discusses the transformative role of domain-adapted large language models (LLMs) with reasoning capabilities in accelerating battery research.
Rucha Apte
11 min read
Has Summary
--
The article discusses the Llama 3. 1 collection of large language models (LLMs) and their applications in enterprise settings.
Chintan Patel
10 min read
Includes Code
Has Summary
--
The article discusses the deployment of LoRA (Low-Rank Adaptation) fine-tuned models using NVIDIA NIM, highlighting the advantages of customizing large language models (LLMs) for specific tasks.
Shashank Verma
11 min read
Includes Code
Has Summary
--
The article discusses the integration of Metaflow and NVIDIA Triton Inference Server for developing and deploying machine learning models.
Eddie Mattia
12 min read
Includes Code
Has Summary
--
NVIDIA has released TensorRT-LLM, an open-source library designed to optimize inference performance for large language models (LLMs) on NVIDIA GPUs.
Neal Vaidya
10 min read
Includes Code
Has Summary
--
NVIDIA SteerLM is a novel technique designed to simplify the customization of large language models (LLMs) during inference.
The article introduces the NVIDIA NeMo Data Curator, a scalable tool designed for curating trillion-token multilingual datasets for training large language models (LLMs).
You've reached the end! All 9 articles loaded.