How OpenAI Uses Transformer
13 engineering articles about Transformer from OpenAI's engineering team
Other OpenAI Technologies
Other Companies Using Transformer
Articles
Filter:
The article discusses Mirakl's vision for agentic commerce, emphasizing the integration of AI across the company to enhance workflows and product offerings.
OpenAI Team
4 min read
Has Summary
--
The article introduces gpt-oss, two state-of-the-art open-weight language models, gpt-oss-120b and gpt-oss-20b, which excel in reasoning tasks and are optimized for deployment on consumer hardware.
The article discusses the training and capabilities of Sora, a video generation model that utilizes text-conditional diffusion techniques to create high-fidelity videos.
Tim Brooks
11 min read
Has Summary
--
The article explores the potential impact of Generative Pre-trained Transformers (GPTs) on the U. S.
OpenAI Team
1 min read
Has Summary
--
The article introduces Whisper, an automatic speech recognition (ASR) system developed by OpenAI, trained on 680,000 hours of multilingual and multitask supervised data.
OpenAI Team
3 min read
Has Summary
--
The article discusses various techniques for training large neural networks, focusing on the challenges and strategies involved in parallelizing model training across multiple GPUs.
Lilian Weng
9 min read
Has Summary
--
The article introduces CLIP (Contrastive Language–Image Pre-training), a neural network that learns visual concepts from natural language supervision.
Alec Radford
18 min read
Has Summary
--
The article discusses the advancements in AI efficiency, highlighting a significant decrease in the compute required to train neural networks since 2012.
Danny Hernandez
14 min read
Has Summary
--
The article discusses the six-month follow-up on the GPT-2 language model, detailing its release, partnerships for research, and insights gained regarding the model's societal implications and pote...
OpenAI Team
7 min read
Has Summary
--
OpenAI Team
7 min read
Includes Code
Has Summary
--
The article discusses the development of Sparse Transformers, a novel deep neural network architecture that enhances the prediction of sequences in various domains, including text, images, and soun...
Rewon Child
7 min read
Has Summary
--
The article discusses advancements in language understanding through unsupervised learning, highlighting the effectiveness of combining transformers and unsupervised pre-training.
Alec Radford
8 min read
Includes Code
Has Summary
--
The article 'Requests for Research 2. 0' presents a new set of seven unsolved problems identified during OpenAI's research.
Ilya Sutskever
7 min read
Includes Code
Has Summary
--
You've reached the end! All 13 articles loaded.