OpenAI logo

How OpenAI Uses Reinforcement Learning

20 engineering articles about Reinforcement Learning from OpenAI's engineering team

Articles

Filter:
OpenAI logo
OpenAI
Intermediate
The article discusses a new alignment strategy called deliberative alignment, which teaches reasoning to language models to enhance their safety.
OpenAI logo
OpenAI
Beginner
The article discusses advancements in red teaming methodologies at OpenAI, focusing on the integration of human and AI efforts to identify potential risks in AI systems.
OpenAI
8 min read
Has Summary
--
OpenAI logo
OpenAI
Advanced
The article discusses CriticGPT, a model based on GPT-4, designed to identify errors in ChatGPT responses.
Nat McAleese
5 min read
Includes Code
Has Summary
--
OpenAI logo
OpenAI
Intermediate
The article discusses the importance of safe exploration in deep reinforcement learning (RL), particularly in environments where safety is critical.
Alex Ray
2 min read
Has Summary
--
OpenAI logo
OpenAI
Intermediate
The article discusses the challenges of generalization in reinforcement learning (RL) and introduces CoinRun, a training environment designed to quantify an agent's ability to transfer experience t...
Karl Cobbe
7 min read
Has Summary
--
OpenAI logo
OpenAI
Intermediate
The article discusses exploration in meta-reinforcement learning, introducing two new algorithms: E-MAML and E-RL².
Bradly Stadie
1 min read
Has Summary
--
OpenAI logo
OpenAI
Intermediate
The article discusses Multi-Goal Reinforcement Learning, presenting a suite of challenging continuous control tasks integrated with OpenAI Gym, and outlines research ideas to enhance reinforcement ...
Matthias Plappert
1 min read
Has Summary
--
OpenAI logo
OpenAI
Intermediate
The article 'Requests for Research 2. 0' presents a new set of seven unsolved problems identified during OpenAI's research.
OpenAI logo
OpenAI
Advanced
The article discusses the Asymmetric Actor-Critic method for image-based robot learning, highlighting its advantages in training control policies using physics simulators.
Lerrel Pinto
2 min read
Has Summary
--
OpenAI logo
OpenAI
Beginner
This article discusses the concept of sim-to-real transfer in robotic control, specifically focusing on dynamics randomization as a method to bridge the gap between simulation and real-world applic...
Xue Bin Peng
2 min read
Has Summary
--
OpenAI logo
OpenAI
Advanced
Hindsight Experience Replay is a novel technique in Reinforcement Learning (RL) that addresses the challenge of sparse rewards by enabling sample-efficient learning.
Marcin Andrychowicz
2 min read
Has Summary
--
OpenAI logo
OpenAI
Intermediate
The article discusses a novel framework for hierarchical reinforcement learning using Stochastic Neural Networks, aimed at addressing challenges in tasks with sparse rewards or long horizons.
Carlos Florensa
2 min read
Has Summary
--
OpenAI logo
OpenAI
Intermediate
The article discusses one-shot imitation learning, a meta-learning framework that enables robots to learn from minimal demonstrations and generalize to new tasks without extensive feature engineeri...
Yan Duan
2 min read
Has Summary
--
OpenAI logo
OpenAI
Intermediate
The article discusses third-person imitation learning as a method to train agents in reinforcement learning (RL) without requiring first-person demonstrations.
Bradly Stadie
2 min read
Has Summary
--
OpenAI logo
OpenAI
Advanced
The article discusses adversarial examples in machine learning, which are inputs deliberately designed to mislead models.
Ian Goodfellow
10 min read
Has Summary
--
OpenAI logo
OpenAI
Intermediate
The article explores count-based exploration algorithms in deep reinforcement learning, highlighting their effectiveness in high-dimensional state spaces.
Haoran Tang
2 min read
Has Summary
--
OpenAI logo
OpenAI
Intermediate
This article explores the connections between Generative Adversarial Networks (GANs), Inverse Reinforcement Learning (IRL), and Energy-Based Models (EBMs).
OpenAI logo
OpenAI
Advanced
The article discusses RL², a novel approach to reinforcement learning that leverages recurrent neural networks to enhance learning efficiency.
Yan Duan
2 min read
Has Summary
--
OpenAI logo
OpenAI
Intermediate
This article discusses the challenges and methodologies involved in transferring control policies from simulation environments to real-world robotic applications.
Paul Christiano
2 min read
Has Summary
--
OpenAI logo
OpenAI
Intermediate
The article discusses generative models, a branch of unsupervised learning techniques in machine learning, detailing their significance, applications, and recent advancements.

You've reached the end! All 20 articles loaded.