How OpenAI Uses Reinforcement Learning

20 engineering articles about Reinforcement Learning from OpenAI's engineering team

Other OpenAI Technologies

GPT(177)Transformers(23)Whisper(23)Neural Networks(23)Artificial Intelligence(19)GPT-4(17)

Other Companies Using Reinforcement Learning

Articles

Filter:

OpenAI

Intermediate

Deliberative alignment: reasoning enables safer language models

The article discusses a new alignment strategy called deliberative alignment, which teaches reasoning to language models to enhance their safety.

ClaudeConstitutional AIGeminiGPTReinforcement LearningRLHF

Melody Guan

8 min read

Has Summary

OpenAI

Beginner

Advancing red teaming with people and AI

The article discusses advancements in red teaming methodologies at OpenAI, focusing on the integration of human and AI efforts to identify potential risks in AI systems.

GPTReinforcement Learning

OpenAI

8 min read

Has Summary

OpenAI

Advanced

Finding GPT-4’s mistakes with GPT-4

The article discusses CriticGPT, a model based on GPT-4, designed to identify errors in ChatGPT responses.

GPTGPT-4Reinforcement LearningRLHF

Nat McAleese

5 min read

Includes Code

Has Summary

OpenAI

Intermediate

Benchmarking safe exploration in deep reinforcement learning

The article discusses the importance of safe exploration in deep reinforcement learning (RL), particularly in environments where safety is critical.

Reinforcement Learning

Alex Ray

2 min read

Has Summary

OpenAI

Intermediate

Quantifying generalization in reinforcement learning

The article discusses the challenges of generalization in reinforcement learning (RL) and introduces CoinRun, a training environment designed to quantify an agent's ability to transfer experience t...

LSTMReinforcement Learning

Karl Cobbe

7 min read

Has Summary

OpenAI

Intermediate

Some considerations on learning to explore via meta-reinforcement learning

The article discusses exploration in meta-reinforcement learning, introducing two new algorithms: E-MAML and E-RL².

Reinforcement Learning

Bradly Stadie

1 min read

Has Summary

OpenAI

Intermediate

Multi-Goal Reinforcement Learning: Challenging robotics environments and request for research

The article discusses Multi-Goal Reinforcement Learning, presenting a suite of challenging continuous control tasks integrated with OpenAI Gym, and outlines research ideas to enhance reinforcement ...

Reinforcement Learning

Matthias Plappert

1 min read

Has Summary

OpenAI

Intermediate

Requests for Research 2.0

The article 'Requests for Research 2. 0' presents a new set of seven unsolved problems identified during OpenAI's research.

Artificial IntelligenceLSTMReinforcement LearningTransfer LearningTransformerTransformers

Ilya Sutskever

7 min read

Includes Code

Has Summary

OpenAI

Advanced

Asymmetric actor critic for image-based robot learning

The article discusses the Asymmetric Actor-Critic method for image-based robot learning, highlighting its advantages in training control policies using physics simulators.

Reinforcement Learning

Lerrel Pinto

2 min read

Has Summary

OpenAI

Beginner

Sim-to-real transfer of robotic control with dynamics randomization

This article discusses the concept of sim-to-real transfer in robotic control, specifically focusing on dynamics randomization as a method to bridge the gap between simulation and real-world applic...

Reinforcement Learning

Xue Bin Peng

2 min read

Has Summary

OpenAI

Advanced

Hindsight Experience Replay

Hindsight Experience Replay is a novel technique in Reinforcement Learning (RL) that addresses the challenge of sparse rewards by enabling sample-efficient learning.

Reinforcement Learning

Marcin Andrychowicz

2 min read

Has Summary

OpenAI

Intermediate

Stochastic Neural Networks for hierarchical reinforcement learning

The article discusses a novel framework for hierarchical reinforcement learning using Stochastic Neural Networks, aimed at addressing challenges in tasks with sparse rewards or long horizons.

Neural NetworksReinforcement Learning

Carlos Florensa

2 min read

Has Summary

OpenAI

Intermediate

One-shot imitation learning

The article discusses one-shot imitation learning, a meta-learning framework that enables robots to learn from minimal demonstrations and generalize to new tasks without extensive feature engineeri...

Reinforcement Learning

Yan Duan

2 min read

Has Summary

OpenAI

Intermediate

Third-person imitation learning

The article discusses third-person imitation learning as a method to train agents in reinforcement learning (RL) without requiring first-person demonstrations.

Reinforcement Learning

Bradly Stadie

2 min read

Has Summary

OpenAI

Advanced

Attacking machine learning with adversarial examples

The article discusses adversarial examples in machine learning, which are inputs deliberately designed to mislead models.

Deep LearningMachine LearningReinforcement Learning

Ian Goodfellow

10 min read

Has Summary

OpenAI

Intermediate

#Exploration: A study of count-based exploration for deep reinforcement learning

The article explores count-based exploration algorithms in deep reinforcement learning, highlighting their effectiveness in high-dimensional state spaces.

Reinforcement Learning

Haoran Tang

2 min read

Has Summary

OpenAI

Intermediate

A connection between generative adversarial networks, inverse reinforcement learning, and energy-based models

This article explores the connections between Generative Adversarial Networks (GANs), Inverse Reinforcement Learning (IRL), and Energy-Based Models (EBMs).

Generative Adversarial NetworksGPTReinforcement Learning

Chelsea Finn

2 min read

Has Summary

OpenAI

Advanced

RL²: Fast reinforcement learning via slow reinforcement learning

The article discusses RL², a novel approach to reinforcement learning that leverages recurrent neural networks to enhance learning efficiency.

Reinforcement Learning

Yan Duan

2 min read

Has Summary

OpenAI

Intermediate

Transfer from simulation to real world through learning deep inverse dynamics model

This article discusses the challenges and methodologies involved in transferring control policies from simulation environments to real-world robotic applications.

Reinforcement Learning

Paul Christiano

2 min read

Has Summary

OpenAI

Intermediate

Generative models

The article discusses generative models, a branch of unsupervised learning techniques in machine learning, detailing their significance, applications, and recent advancements.

Generative Adversarial NetworksNeural NetworksReinforcement LearningVariational Autoencoders

Andrej Karpathy

16 min read

Includes Code

Has Summary

You've reached the end! All 20 articles loaded.