How OpenAI Uses Transformers

23 engineering articles about Transformers from OpenAI's engineering team

Other OpenAI Technologies

GPT(177)Whisper(23)Neural Networks(23)Reinforcement Learning(20)Artificial Intelligence(19)GPT-4(17)

Other Companies Using Transformers

Articles

Filter:

OpenAI

Intermediate

Understanding neural networks through sparse circuits

The article discusses the challenges of understanding neural networks and presents a novel approach to improve interpretability through sparse circuits.

GPTTransformers

OpenAI Team

7 min read

Includes Code

Has Summary

OpenAI

Advanced

BrowseComp: a benchmark for browsing agents

BrowseComp is a newly introduced benchmark designed to evaluate the capabilities of AI agents in locating hard-to-find information on the internet.

ClaudeGeminiGPTTransformers

Jason Wei

11 min read

Includes Code

Has Summary

OpenAI

Advanced

Hello GPT-4o

The article introduces GPT-4o, a new model from OpenAI that enhances human-computer interaction by accepting and generating text, audio, images, and video.

GeminiGPTGPT-4Transformers

OpenAI

9 min read

Has Summary

OpenAI

Intermediate

Video generation models as world simulators

The article discusses the training and capabilities of Sora, a video generation model that utilizes text-conditional diffusion techniques to create high-fidelity videos.

Computer VisionDiffusion ModelsEmbeddingGPTMachine LearningTransformerTransformers

Tim Brooks

11 min read

Has Summary

OpenAI

Intermediate

GPTs are GPTs: An early look at the labor market impact potential of large language models

The article explores the potential impact of Generative Pre-trained Transformers (GPTs) on the U. S.

GPTLarge Language ModelsTransformerTransformers

OpenAI Team

1 min read

Has Summary

OpenAI

Advanced

GPT-4

GPT-4 is the latest milestone in OpenAI's deep learning efforts, showcasing a large multimodal model that accepts both image and text inputs.

AzureGPTGPT-4PaLMRLHFTransformers

OpenAI

15 min read

Has Summary

OpenAI

Advanced

Solving (some) formal math olympiad problems

The article discusses the development of a neural theorem prover for Lean, which is capable of solving complex high-school math olympiad problems.

GolangTransformers

Stanislas Polu

6 min read

Includes Code

Has Summary

OpenAI

Intermediate

Solving math word problems

The article discusses advancements in AI systems for solving grade school math word problems, highlighting a model that achieves nearly double the accuracy of a fine-tuned GPT-3 model.

GPTTransformers

Karl Cobbe

5 min read

Has Summary

OpenAI

Intermediate

Evaluating large language models trained on code

The article discusses the evaluation of large language models trained on code, specifically focusing on Codex, a model fine-tuned on publicly available code from GitHub.

CopilotGPTLarge Language ModelsTransformers

Mark Chen

2 min read

Has Summary

OpenAI

Intermediate

Multimodal neurons in artificial neural networks

The article discusses the discovery of multimodal neurons in CLIP, an artificial intelligence model developed by OpenAI.

Artificial IntelligenceComputer VisionNeural NetworksResNetTransformers

Gabriel Goh

11 min read

Has Summary

OpenAI

Intermediate

DALL·E: Creating images from text

DALL·E is a 12-billion parameter version of GPT-3 designed to generate images from text descriptions.

GPTTransformersWhisper

Aditya Ramesh

11 min read

Has Summary

OpenAI

Intermediate

CLIP: Connecting text and images

The article introduces CLIP (Contrastive Language–Image Pre-training), a neural network that learns visual concepts from natural language supervision.

Computer VisionGPTResNetTransformerTransformers

Alec Radford

18 min read

Has Summary

OpenAI

Advanced

Learning to summarize with human feedback

The article discusses the application of reinforcement learning from human feedback to enhance the summarization capabilities of language models.

Fine-tuningGPTT5Transformers

Nisan Stiennon

16 min read

Has Summary

OpenAI

Advanced

Image GPT

The article discusses Image GPT, a generative model that applies the transformer architecture used in language models to image generation.

BERTConvolutional Neural NetworksGPTNeural NetworksResNetRoBERTaSupervised LearningT5Transfer LearningTransformersUnsupervised Learning

Mark Chen

20 min read

Has Summary

OpenAI

Intermediate

Language models are few-shot learners

The article discusses the advancements in natural language processing (NLP) through the development of GPT-3, a language model with 175 billion parameters that excels in few-shot learning.

GPTTransformers

Tom Brown

2 min read

Has Summary

OpenAI

Advanced

Jukebox

Jukebox is a neural network developed by OpenAI that generates music, including rudimentary singing, as raw audio across various genres and artist styles.

Artificial IntelligenceGPTMachine LearningTransformersWhisper

Prafulla Dhariwal

15 min read

Has Summary

OpenAI

Advanced

GPT-2: 1.5B release

The article discusses the release of the largest version of GPT-2, which contains 1. 5 billion parameters.

GPTMachine LearningTransformers

OpenAI Team

5 min read

Has Summary

OpenAI

Advanced

Fine-tuning GPT-2 from human preferences

The article discusses the fine-tuning of the 774M parameter GPT-2 language model using human feedback to improve performance on various natural language tasks, including summarization and stylistic...

Fine-tuningGPTTransformers

Daniel Ziegler

9 min read

Has Summary

OpenAI

Beginner

MuseNet

MuseNet is a deep neural network developed by OpenAI that generates 4-minute musical compositions using 10 different instruments and blends various musical styles.

GPTTransformerTransformersWhisper

OpenAI Team

7 min read

Includes Code

Has Summary

OpenAI

Advanced

Generative modeling with sparse transformers

The article discusses the development of Sparse Transformers, a novel deep neural network architecture that enhances the prediction of sequences in various domains, including text, images, and soun...

Self-AttentionTransformerTransformersWhisper

Rewon Child

7 min read

Has Summary

OpenAI

Advanced

Better language models and their implications

The article discusses the advancements in language models, particularly focusing on GPT-2, which generates coherent text and performs various language tasks without task-specific training.

Fine-tuningGPTTransformers

Alec Radford

12 min read

Has Summary

OpenAI

Intermediate

Requests for Research 2.0

The article 'Requests for Research 2. 0' presents a new set of seven unsolved problems identified during OpenAI's research.

Artificial IntelligenceLSTMReinforcement LearningTransfer LearningTransformerTransformers

Ilya Sutskever

7 min read

Includes Code

Has Summary

OpenAI

Intermediate

Competitive self-play

The article discusses the concept of competitive self-play in AI training, highlighting its effectiveness in enabling simulated AIs to learn complex physical skills without explicit environment des...

Transfer LearningTransformers

Trapit Bansal

4 min read

Has Summary

You've reached the end! All 23 articles loaded.