How OpenAI Uses Reinforcement Learning
20 engineering articles about Reinforcement Learning from OpenAI's engineering team
Other OpenAI Technologies
Other Companies Using Reinforcement Learning
Articles
Filter:
The article discusses a new alignment strategy called deliberative alignment, which teaches reasoning to language models to enhance their safety.
Melody Guan
8 min read
Has Summary
--
The article discusses advancements in red teaming methodologies at OpenAI, focusing on the integration of human and AI efforts to identify potential risks in AI systems.
OpenAI
8 min read
Has Summary
--
The article discusses CriticGPT, a model based on GPT-4, designed to identify errors in ChatGPT responses.
Nat McAleese
5 min read
Includes Code
Has Summary
--
The article discusses the importance of safe exploration in deep reinforcement learning (RL), particularly in environments where safety is critical.
Alex Ray
2 min read
Has Summary
--
The article discusses the challenges of generalization in reinforcement learning (RL) and introduces CoinRun, a training environment designed to quantify an agent's ability to transfer experience t...
Karl Cobbe
7 min read
Has Summary
--
The article discusses exploration in meta-reinforcement learning, introducing two new algorithms: E-MAML and E-RL².
Bradly Stadie
1 min read
Has Summary
--
The article discusses Multi-Goal Reinforcement Learning, presenting a suite of challenging continuous control tasks integrated with OpenAI Gym, and outlines research ideas to enhance reinforcement ...
Matthias Plappert
1 min read
Has Summary
--
The article 'Requests for Research 2. 0' presents a new set of seven unsolved problems identified during OpenAI's research.
Ilya Sutskever
7 min read
Includes Code
Has Summary
--
The article discusses the Asymmetric Actor-Critic method for image-based robot learning, highlighting its advantages in training control policies using physics simulators.
Lerrel Pinto
2 min read
Has Summary
--
This article discusses the concept of sim-to-real transfer in robotic control, specifically focusing on dynamics randomization as a method to bridge the gap between simulation and real-world applic...
Xue Bin Peng
2 min read
Has Summary
--
Hindsight Experience Replay is a novel technique in Reinforcement Learning (RL) that addresses the challenge of sparse rewards by enabling sample-efficient learning.
Marcin Andrychowicz
2 min read
Has Summary
--
The article discusses a novel framework for hierarchical reinforcement learning using Stochastic Neural Networks, aimed at addressing challenges in tasks with sparse rewards or long horizons.
Carlos Florensa
2 min read
Has Summary
--
The article discusses one-shot imitation learning, a meta-learning framework that enables robots to learn from minimal demonstrations and generalize to new tasks without extensive feature engineeri...
Yan Duan
2 min read
Has Summary
--
The article discusses third-person imitation learning as a method to train agents in reinforcement learning (RL) without requiring first-person demonstrations.
Bradly Stadie
2 min read
Has Summary
--
The article discusses adversarial examples in machine learning, which are inputs deliberately designed to mislead models.
Ian Goodfellow
10 min read
Has Summary
--
The article explores count-based exploration algorithms in deep reinforcement learning, highlighting their effectiveness in high-dimensional state spaces.
Haoran Tang
2 min read
Has Summary
--
This article explores the connections between Generative Adversarial Networks (GANs), Inverse Reinforcement Learning (IRL), and Energy-Based Models (EBMs).
Chelsea Finn
2 min read
Has Summary
--
The article discusses RL², a novel approach to reinforcement learning that leverages recurrent neural networks to enhance learning efficiency.
Yan Duan
2 min read
Has Summary
--
This article discusses the challenges and methodologies involved in transferring control policies from simulation environments to real-world robotic applications.
Paul Christiano
2 min read
Has Summary
--
The article discusses generative models, a branch of unsupervised learning techniques in machine learning, detailing their significance, applications, and recent advancements.
Andrej Karpathy
16 min read
Includes Code
Has Summary
--
You've reached the end! All 20 articles loaded.