How OpenAI Uses Transformers
23 engineering articles about Transformers from OpenAI's engineering team
Other OpenAI Technologies
Other Companies Using Transformers
Articles
Filter:
The article discusses the challenges of understanding neural networks and presents a novel approach to improve interpretability through sparse circuits.
OpenAI Team
7 min read
Includes Code
Has Summary
--
BrowseComp is a newly introduced benchmark designed to evaluate the capabilities of AI agents in locating hard-to-find information on the internet.
Jason Wei
11 min read
Includes Code
Has Summary
--
The article introduces GPT-4o, a new model from OpenAI that enhances human-computer interaction by accepting and generating text, audio, images, and video.
OpenAI
9 min read
Has Summary
--
The article discusses the training and capabilities of Sora, a video generation model that utilizes text-conditional diffusion techniques to create high-fidelity videos.
Tim Brooks
11 min read
Has Summary
--
The article explores the potential impact of Generative Pre-trained Transformers (GPTs) on the U. S.
OpenAI Team
1 min read
Has Summary
--
The article discusses the development of a neural theorem prover for Lean, which is capable of solving complex high-school math olympiad problems.
Stanislas Polu
6 min read
Includes Code
Has Summary
--
The article discusses advancements in AI systems for solving grade school math word problems, highlighting a model that achieves nearly double the accuracy of a fine-tuned GPT-3 model.
Karl Cobbe
5 min read
Has Summary
--
The article discusses the evaluation of large language models trained on code, specifically focusing on Codex, a model fine-tuned on publicly available code from GitHub.
Mark Chen
2 min read
Has Summary
--
The article discusses the discovery of multimodal neurons in CLIP, an artificial intelligence model developed by OpenAI.
Gabriel Goh
11 min read
Has Summary
--
DALL·E is a 12-billion parameter version of GPT-3 designed to generate images from text descriptions.
Aditya Ramesh
11 min read
Has Summary
--
The article introduces CLIP (Contrastive Language–Image Pre-training), a neural network that learns visual concepts from natural language supervision.
Alec Radford
18 min read
Has Summary
--
The article discusses the application of reinforcement learning from human feedback to enhance the summarization capabilities of language models.
Nisan Stiennon
16 min read
Has Summary
--
The article discusses the advancements in natural language processing (NLP) through the development of GPT-3, a language model with 175 billion parameters that excels in few-shot learning.
Tom Brown
2 min read
Has Summary
--
Prafulla Dhariwal
15 min read
Has Summary
--
The article discusses the release of the largest version of GPT-2, which contains 1. 5 billion parameters.
OpenAI Team
5 min read
Has Summary
--
The article discusses the fine-tuning of the 774M parameter GPT-2 language model using human feedback to improve performance on various natural language tasks, including summarization and stylistic...
Daniel Ziegler
9 min read
Has Summary
--
OpenAI Team
7 min read
Includes Code
Has Summary
--
The article discusses the development of Sparse Transformers, a novel deep neural network architecture that enhances the prediction of sequences in various domains, including text, images, and soun...
Rewon Child
7 min read
Has Summary
--
The article discusses the advancements in language models, particularly focusing on GPT-2, which generates coherent text and performs various language tasks without task-specific training.
Alec Radford
12 min read
Has Summary
--
The article 'Requests for Research 2. 0' presents a new set of seven unsolved problems identified during OpenAI's research.
Ilya Sutskever
7 min read
Includes Code
Has Summary
--
The article discusses the concept of competitive self-play in AI training, highlighting its effectiveness in enabling simulated AIs to learn complex physical skills without explicit environment des...
Trapit Bansal
4 min read
Has Summary
--
You've reached the end! All 23 articles loaded.