OpenAI logo

How OpenAI Uses Transformers

23 engineering articles about Transformers from OpenAI's engineering team

Articles

Filter:
OpenAI logo
OpenAI
Intermediate
The article discusses the challenges of understanding neural networks and presents a novel approach to improve interpretability through sparse circuits.
OpenAI Team
7 min read
Includes Code
Has Summary
--
OpenAI logo
OpenAI
Advanced
BrowseComp is a newly introduced benchmark designed to evaluate the capabilities of AI agents in locating hard-to-find information on the internet.
Jason Wei
11 min read
Includes Code
Has Summary
--
OpenAI logo
OpenAI
Advanced
The article introduces GPT-4o, a new model from OpenAI that enhances human-computer interaction by accepting and generating text, audio, images, and video.
OpenAI
9 min read
Has Summary
--
OpenAI logo
OpenAI
Intermediate
The article discusses the training and capabilities of Sora, a video generation model that utilizes text-conditional diffusion techniques to create high-fidelity videos.
OpenAI logo
OpenAI
Intermediate
The article explores the potential impact of Generative Pre-trained Transformers (GPTs) on the U. S.
OpenAI Team
1 min read
Has Summary
--
OpenAI logo
OpenAI
Advanced
GPT-4 is the latest milestone in OpenAI's deep learning efforts, showcasing a large multimodal model that accepts both image and text inputs.
OpenAI
15 min read
Has Summary
--
OpenAI logo
OpenAI
Advanced
The article discusses the development of a neural theorem prover for Lean, which is capable of solving complex high-school math olympiad problems.
Stanislas Polu
6 min read
Includes Code
Has Summary
--
OpenAI logo
OpenAI
Intermediate
The article discusses advancements in AI systems for solving grade school math word problems, highlighting a model that achieves nearly double the accuracy of a fine-tuned GPT-3 model.
Karl Cobbe
5 min read
Has Summary
--
OpenAI logo
OpenAI
Intermediate
The article discusses the evaluation of large language models trained on code, specifically focusing on Codex, a model fine-tuned on publicly available code from GitHub.
Mark Chen
2 min read
Has Summary
--
OpenAI logo
OpenAI
Intermediate
The article discusses the discovery of multimodal neurons in CLIP, an artificial intelligence model developed by OpenAI.
OpenAI logo
OpenAI
Intermediate
DALL·E is a 12-billion parameter version of GPT-3 designed to generate images from text descriptions.
Aditya Ramesh
11 min read
Has Summary
--
OpenAI logo
OpenAI
Intermediate
The article introduces CLIP (Contrastive Language–Image Pre-training), a neural network that learns visual concepts from natural language supervision.
Alec Radford
18 min read
Has Summary
--
OpenAI logo
OpenAI
Advanced
The article discusses the application of reinforcement learning from human feedback to enhance the summarization capabilities of language models.
Nisan Stiennon
16 min read
Has Summary
--
OpenAI logo
OpenAI
Advanced
The article discusses Image GPT, a generative model that applies the transformer architecture used in language models to image generation.
OpenAI logo
OpenAI
Intermediate
The article discusses the advancements in natural language processing (NLP) through the development of GPT-3, a language model with 175 billion parameters that excels in few-shot learning.
Tom Brown
2 min read
Has Summary
--
OpenAI logo
OpenAI
Advanced
Jukebox is a neural network developed by OpenAI that generates music, including rudimentary singing, as raw audio across various genres and artist styles.
Prafulla Dhariwal
15 min read
Has Summary
--
OpenAI logo
OpenAI
Advanced
The article discusses the release of the largest version of GPT-2, which contains 1. 5 billion parameters.
OpenAI Team
5 min read
Has Summary
--
OpenAI logo
OpenAI
Advanced
The article discusses the fine-tuning of the 774M parameter GPT-2 language model using human feedback to improve performance on various natural language tasks, including summarization and stylistic...
Daniel Ziegler
9 min read
Has Summary
--
OpenAI logo
OpenAI
Beginner
MuseNet is a deep neural network developed by OpenAI that generates 4-minute musical compositions using 10 different instruments and blends various musical styles.
OpenAI Team
7 min read
Includes Code
Has Summary
--
OpenAI logo
OpenAI
Advanced
The article discusses the development of Sparse Transformers, a novel deep neural network architecture that enhances the prediction of sequences in various domains, including text, images, and soun...
Rewon Child
7 min read
Has Summary
--
OpenAI logo
OpenAI
Advanced
The article discusses the advancements in language models, particularly focusing on GPT-2, which generates coherent text and performs various language tasks without task-specific training.
Alec Radford
12 min read
Has Summary
--
OpenAI logo
OpenAI
Intermediate
The article 'Requests for Research 2. 0' presents a new set of seven unsolved problems identified during OpenAI's research.
OpenAI logo
OpenAI
Intermediate
The article discusses the concept of competitive self-play in AI training, highlighting its effectiveness in enabling simulated AIs to learn complex physical skills without explicit environment des...
Trapit Bansal
4 min read
Has Summary
--

You've reached the end! All 23 articles loaded.