How OpenAI Uses RLHF

8 engineering articles about RLHF from OpenAI's engineering team

Other OpenAI Technologies

GPT(177)Transformers(23)Whisper(23)Neural Networks(23)Reinforcement Learning(20)Artificial Intelligence(19)

Other Companies Using RLHF

NVIDIA(33)

Google(2)

Articles

Filter:

OpenAI

Advanced

Introducing GPT-4.5

The article introduces GPT-4. 5, OpenAI's latest and most advanced model for chat, highlighting its improvements in unsupervised learning, emotional intelligence, and practical applications.

AzureGPTGPT-4RLHF

OpenAI

12 min read

Has Summary

OpenAI

Intermediate

OpenAI GPT-4.5 System Card

The OpenAI GPT-4. 5 System Card provides insights into the latest advancements in OpenAI's language model, highlighting its capabilities, safety evaluations, and preparedness framework.

GPTGPT-4RLHF

OpenAI

2 min read

Has Summary

OpenAI

Intermediate

Deliberative alignment: reasoning enables safer language models

The article discusses a new alignment strategy called deliberative alignment, which teaches reasoning to language models to enhance their safety.

ClaudeConstitutional AIGeminiGPTReinforcement LearningRLHF

Melody Guan

8 min read

Has Summary

OpenAI

Advanced

Improving Model Safety Behavior with Rule-Based Rewards

The article discusses the development and application of Rule-Based Rewards (RBRs) to enhance the safety behavior of AI models, reducing reliance on extensive human data collection.

GPTRLHF

Tong Mu

9 min read

Has Summary

OpenAI

Intermediate

GPT-4o mini: advancing cost-efficient intelligence

The article introduces GPT-4o mini, OpenAI's most cost-efficient small model, designed to make AI intelligence more accessible and affordable.

ClaudeGeminiGPTRLHF

OpenAI

6 min read

Has Summary

OpenAI

Advanced

Finding GPT-4’s mistakes with GPT-4

The article discusses CriticGPT, a model based on GPT-4, designed to identify errors in ChatGPT responses.

GPTGPT-4Reinforcement LearningRLHF

Nat McAleese

5 min read

Includes Code

Has Summary

OpenAI

Advanced

GPT-4

GPT-4 is the latest milestone in OpenAI's deep learning efforts, showcasing a large multimodal model that accepts both image and text inputs.

AzureGPTGPT-4PaLMRLHFTransformers

OpenAI

15 min read

Has Summary

OpenAI

Intermediate

Aligning language models to follow instructions

The article discusses advancements in training language models to better follow user instructions, specifically focusing on the InstructGPT models developed by OpenAI.

GPTLarge Language ModelsOpenAI APIRLHF

Ryan Lowe

12 min read

Has Summary

You've reached the end! All 8 articles loaded.