NVIDIA logo

How NVIDIA Uses RLHF

33 engineering articles about RLHF from NVIDIA's engineering team

Articles

Filter:
NVIDIA logo
NVIDIA
Advanced
This article explores how to train an AI agent to operate a new Command Line Interface (CLI) using synthetic data generation and reinforcement learning.
Chris Alexiuk
11 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the NVIDIA Rubin platform, which introduces six new chips designed to create a powerful AI supercomputer.
NVIDIA logo
NVIDIA
Intermediate
The article discusses the development of scientific AI agents using reinforcement learning (RL) techniques, specifically through the NVIDIA NeMo framework.
Christian Munley
12 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the innovative use of AI co-scientists in scientific research, specifically focusing on fusion research and cancer treatment.
Geetika Gupta
8 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the use of synthetic data in post-training procedures for large language models (LLMs) and highlights NVIDIA's open-sourcing of the Llama-Nemotron post-training dataset, which...
Vinh Nguyen
5 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the development and capabilities of NVIDIA's Llama Nemotron reasoning models, which enhance AI agents' reasoning abilities for complex problem-solving in various industries.
Chris Alexiuk
11 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the Llama Nemotron models, which enhance Agentic AI workflows by integrating large language models with advanced reasoning and planning capabilities.
Chintan Patel
7 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the development of a new reward model, Llama 3.
Zhilin Wang
3 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the deployment of the Llama 3.
Anjali Shah
6 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses how NVIDIA optimizes data center performance using AI agents and the OODA loop strategy.
Aaron Erickson
11 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the Mistral NeMo 12B model, a next-generation language model developed by NVIDIA and Mistral, designed for high performance on a single GPU.
Anjali Shah
6 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the launch of Meta's Llama 3.
Anjali Shah
8 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the Llama 3. 1 collection of large language models (LLMs) and their applications in enterprise settings.
Chintan Patel
10 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
NVIDIA has achieved new generative AI performance records in MLPerf Training v4. 0, showcasing significant advancements in training large language models (LLMs) and graph neural networks (GNNs).
NVIDIA logo
NVIDIA
Advanced
The article introduces DRaFT+, an enhanced algorithm for fine-tuning text-to-image diffusion models, which aims to improve the alignment between input prompts and generated images.
Ali Taghibakhshi
9 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses NVIDIA NeMo Customizer, a microservice designed to simplify the fine-tuning and alignment of large language models (LLMs) for enterprise AI applications.
Nirmal Kumar Juluru
5 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the NVIDIA NeMo microservices, which simplify the development of custom generative AI models for enterprises.
Nirmal Kumar Juluru
5 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses StarCoder2, an advanced large language model (LLM) designed to enhance coding efficiency for developers.
Chia-Chih Chen
7 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
NVIDIA collaborates with Google to enhance inference performance for the Gemma models using TensorRT-LLM, facilitating easier development with large language models (LLMs) on NVIDIA RTX GPUs.
Anjali Shah
4 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the collaboration between H2O. ai and NVIDIA to enhance AI applications in financial services through generative AI and predictive analytics.
Prabhu Ramamoorthy
13 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the latest features of the NVIDIA NeMo framework and the performance enhancements brought by the NVIDIA H200 GPUs, which significantly improve the training of large language m...
Ashraf Eassa
9 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
Dell Technologies and NVIDIA have collaborated to set new records in financial risk calculations using the NVIDIA H100 system for high-performance computing (HPC) and AI.
Prabhu Ramamoorthy
7 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the evolution of machine learning operations (MLOps) into specialized areas such as GenAIOps and LLMOps, focusing on the development and management of generative AI and large ...
Nik Spirin
13 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses NVIDIA's AI Foundation Models, specifically the Nemotron-3 8B family, which enables the creation of custom enterprise chatbots and co-pilots with production-ready capabilities.
Vivienne Zhang
12 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses how to build custom enterprise-grade generative AI applications using NVIDIA's AI Foundation Models.
NVIDIA logo
NVIDIA
Intermediate
The article discusses the application of Large Language Models (LLMs) in enterprise solutions, highlighting their capabilities in enhancing productivity across various industries.
NVIDIA logo
NVIDIA
Intermediate
NVIDIA has introduced the Jetson Generative AI Lab, enabling developers to leverage generative AI capabilities on Jetson edge devices.
NVIDIA logo
NVIDIA
Advanced
NVIDIA SteerLM is a novel technique designed to simplify the customization of large language models (LLMs) during inference.
Yi Dong
10 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses various techniques for customizing Large Language Models (LLMs) to better fit enterprise needs, emphasizing the importance of tailoring language processing capabilities for sp...
NVIDIA logo
NVIDIA
Advanced
The article discusses NVIDIA NeMo, an end-to-end platform designed to facilitate the development and deployment of enterprise-ready large language models (LLMs).
Amanda Saunders
9 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
Generative AI technologies are transforming the creation and interaction of non-playable characters (NPCs) in games, enabling developers to create more intelligent and dynamic gaming experiences.
Ike Nnoli
5 min read
Has Summary
--
NVIDIA logo
NVIDIA
Beginner
NVIDIA introduces NeMo Guardrails, an open-source toolkit designed to create safe and trustworthy large language model (LLM) conversational systems.
Annamalai Chockalingam
7 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
NVIDIA has introduced generative AI services aimed at enhancing language, visual content, and biology applications.

You've reached the end! All 33 articles loaded.