How OpenAI Uses Fine-tuning

6 engineering articles about Fine-tuning from OpenAI's engineering team

Other OpenAI Technologies

GPT(177)Transformers(23)Whisper(23)Neural Networks(23)Reinforcement Learning(20)Artificial Intelligence(19)

Other Companies Using Fine-tuning

Articles

Filter:

OpenAI

Advanced

Toward understanding and preventing misalignment generalization

The article discusses emergent misalignment in large language models, particularly focusing on how misaligned persona features can lead to generalized misalignment.

ChiFine-tuningGPTPIL

OpenAI Team

16 min read

Has Summary

OpenAI

Intermediate

Learning to play Minecraft with Video PreTraining

The article discusses the development of a neural network capable of playing Minecraft through a method called Video PreTraining (VPT), leveraging a large dataset of unlabeled gameplay videos.

Artificial IntelligenceFine-tuningGPT

Bowen Baker

8 min read

Has Summary

OpenAI

Advanced

Lessons learned on language model safety and misuse

The article discusses the lessons learned from deploying language models, focusing on safety and misuse.

Fine-tuningGPTOpenAI APIPercy

Miles Brundage

14 min read

Has Summary

OpenAI

Advanced

Learning to summarize with human feedback

The article discusses the application of reinforcement learning from human feedback to enhance the summarization capabilities of language models.

Fine-tuningGPTT5Transformers

Nisan Stiennon

16 min read

Has Summary

OpenAI

Advanced

Fine-tuning GPT-2 from human preferences

The article discusses the fine-tuning of the 774M parameter GPT-2 language model using human feedback to improve performance on various natural language tasks, including summarization and stylistic...

Fine-tuningGPTTransformers

Daniel Ziegler

9 min read

Has Summary

OpenAI

Advanced

Better language models and their implications

The article discusses the advancements in language models, particularly focusing on GPT-2, which generates coherent text and performs various language tasks without task-specific training.

Fine-tuningGPTTransformers

Alec Radford

12 min read

Has Summary

You've reached the end! All 6 articles loaded.