How OpenAI Uses Neural Networks
23 engineering articles about Neural Networks from OpenAI's engineering team
Other OpenAI Technologies
Other Companies Using Neural Networks
Articles
Filter:
The article discusses various techniques for training large neural networks, focusing on the challenges and strategies involved in parallelizing model training across multiple GPUs.
Lilian Weng
9 min read
Has Summary
--
The article discusses the discovery of multimodal neurons in CLIP, an artificial intelligence model developed by OpenAI.
Gabriel Goh
11 min read
Has Summary
--
The article discusses the scaling of Kubernetes clusters to 7,500 nodes, highlighting the infrastructure's ability to support large machine learning models like GPT-3, CLIP, and DALL·E.
Eric Sigler
17 min read
Includes Code
Has Summary
--
The article discusses the advancements in AI efficiency, highlighting a significant decrease in the compute required to train neural networks since 2012.
Danny Hernandez
14 min read
Has Summary
--
The article discusses OpenAI's achievement with OpenAI Five, an AI system that defeated world champions in Dota 2, highlighting the challenges of long time horizons, imperfect information, and comp...
Christopher Berner
2 min read
Has Summary
--
OpenAI Five has made history by defeating the world champion Dota 2 team OG in a competitive match, marking the first time an AI has triumphed over professional esports players in a live setting.
OpenAI
8 min read
Has Summary
--
Neural MMO is a massively multiagent game environment designed for reinforcement learning agents, supporting a large number of agents in a persistent and open-ended task.
Joseph Suarez
6 min read
Has Summary
--
The article discusses how AI training scales by examining the gradient noise scale, a statistical metric that predicts the parallelizability of neural network training.
Sam McCandlish
9 min read
Has Summary
--
The article discusses the results of the OpenAI Five benchmark, where the AI system played against top Dota 2 players.
OpenAI
4 min read
Has Summary
--
OpenAI Five is a team of five neural networks that has begun defeating amateur human teams in the complex video game Dota 2.
Greg Brockman
17 min read
Includes Code
Has Summary
--
The article discusses the exponential growth in compute power used for AI training since 2012, highlighting a 300,000x increase in compute with a doubling time of 3. 4 months.
Dario Amodei
8 min read
Has Summary
--
The article discusses the challenges and solutions encountered while scaling Kubernetes to over 2,500 nodes, detailing specific issues with components like etcd, Kube masters, and Docker image pull...
Christopher Berner
9 min read
Includes Code
Has Summary
--
The article discusses the release of optimized GPU kernels for block-sparse neural network architectures, which can significantly outperform traditional libraries like cuBLAS and cuSPARSE.
Scott Gray
6 min read
Includes Code
Has Summary
--
The article discusses a novel method for L₀ regularization in neural networks, focusing on pruning weights to zero during training.
Christos Louizos
2 min read
Has Summary
--
The article discusses how deep linear networks, despite being theoretically linear, can perform nonlinear computations due to the limitations of floating-point arithmetic.
Jakob Foerster
5 min read
Includes Code
Has Summary
--
The article discusses the advancements in AI through the self-play training of a Dota 2 bot, showcasing how it evolved from below human performance to defeating top professional players.
OpenAI
8 min read
Has Summary
--
The article discusses the release of a high-performance Python library for robotic simulation using the MuJoCo engine, highlighting its capabilities and performance improvements.
Jonas Schneider
3 min read
Includes Code
Has Summary
--
Roboschool is an open-source software for robot simulation integrated with OpenAI Gym, aimed at providing realistic environments for training robots.
OpenAI Team
5 min read
Includes Code
Has Summary
--
The article discusses a novel framework for hierarchical reinforcement learning using Stochastic Neural Networks, aimed at addressing challenges in tasks with sparse rewards or long horizons.
Carlos Florensa
2 min read
Has Summary
--
The article discusses the infrastructure necessary for deep learning, emphasizing the importance of a robust setup to facilitate research and experimentation.
AWSChefDeep LearningDockerKerasKubernetesNeural NetworksOpenCVPackerTensorBoardTensorFlowTerraformWhisper
Vicki Cheung
9 min read
Has Summary
--
The article discusses generative models, a branch of unsupervised learning techniques in machine learning, detailing their significance, applications, and recent advancements.
Andrej Karpathy
16 min read
Includes Code
Has Summary
--
The article discusses weight normalization, a reparameterization technique for neural networks that separates the length of weight vectors from their direction.
Tim Salimans
2 min read
Has Summary
--
You've reached the end! All 23 articles loaded.