OpenAI logo

How OpenAI Uses Whisper

23 engineering articles about Whisper from OpenAI's engineering team

Articles

Filter:
OpenAI logo
OpenAI
Advanced
The article introduces gpt-oss, two state-of-the-art open-weight language models, gpt-oss-120b and gpt-oss-20b, which excel in reasoning tasks and are optimized for deployment on consumer hardware.
OpenAI logo
OpenAI
Intermediate
OpenAI has launched next-generation audio models that enhance voice agent capabilities through improved speech-to-text and text-to-speech functionalities.
OpenAI
6 min read
Includes Code
Has Summary
--
OpenAI logo
OpenAI
Advanced
Internet (Expert N=39, Novice N=28)
OpenAI logo
OpenAI
Intermediate
The article introduces Whisper, an automatic speech recognition (ASR) system developed by OpenAI, trained on 680,000 hours of multilingual and multitask supervised data.
OpenAI Team
3 min read
Has Summary
--
OpenAI logo
OpenAI
Intermediate
The article discusses a novel two-stage model for hierarchical text-conditional image generation using CLIP latents.
Aditya Ramesh
1 min read
Has Summary
--
OpenAI logo
OpenAI
Advanced
The article introduces Triton, an open-source programming language designed for efficient GPU programming in neural networks.
Philippe Tillet
10 min read
Includes Code
Has Summary
--
OpenAI logo
OpenAI
Intermediate
DALL·E is a 12-billion parameter version of GPT-3 designed to generate images from text descriptions.
Aditya Ramesh
11 min read
Has Summary
--
OpenAI logo
OpenAI
Advanced
Jukebox is a neural network developed by OpenAI that generates music, including rudimentary singing, as raw audio across various genres and artist styles.
Prafulla Dhariwal
15 min read
Has Summary
--
OpenAI logo
OpenAI
Beginner
MuseNet is a deep neural network developed by OpenAI that generates 4-minute musical compositions using 10 different instruments and blends various musical styles.
OpenAI Team
7 min read
Includes Code
Has Summary
--
OpenAI logo
OpenAI
Advanced
The article discusses the development of Sparse Transformers, a novel deep neural network architecture that enhances the prediction of sequences in various domains, including text, images, and soun...
Rewon Child
7 min read
Has Summary
--
OpenAI logo
OpenAI
Intermediate
Neural MMO is a massively multiagent game environment designed for reinforcement learning agents, supporting a large number of agents in a persistent and open-ended task.
Joseph Suarez
6 min read
Has Summary
--
OpenAI logo
OpenAI
Advanced
Gym Retro is a platform for reinforcement learning research that expands the available game count to over 1,000 across various emulators.
Vicki Pfau
4 min read
Includes Code
Has Summary
--
OpenAI logo
OpenAI
Intermediate
The article discusses the release of eight simulated robotics environments and a Baselines implementation of Hindsight Experience Replay (HER) developed for robotics research.
Matthias Plappert
9 min read
Includes Code
Has Summary
--
OpenAI logo
OpenAI
Intermediate
The article discusses the release of optimized GPU kernels for block-sparse neural network architectures, which can significantly outperform traditional libraries like cuBLAS and cuSPARSE.
Scott Gray
6 min read
Includes Code
Has Summary
--
OpenAI logo
OpenAI
Advanced
The article discusses the release of two new OpenAI Baselines implementations: ACKTR and A2C.
Yuhuai Wu
5 min read
Has Summary
--
OpenAI logo
OpenAI
Intermediate
The article discusses the RL-Teacher, an open-source implementation designed to train AI systems using human feedback instead of traditional reward functions.
Tom Brown
2 min read
Includes Code
Has Summary
--
OpenAI logo
OpenAI
Intermediate
Proximal Policy Optimization (PPO) is a new class of reinforcement learning algorithms that offers comparable or superior performance to state-of-the-art methods while being simpler to implement an...
John Schulman
4 min read
Has Summary
--
OpenAI logo
OpenAI
Advanced
The article discusses the release of a high-performance Python library for robotic simulation using the MuJoCo engine, highlighting its capabilities and performance improvements.
Jonas Schneider
3 min read
Includes Code
Has Summary
--
OpenAI logo
OpenAI
Intermediate
OpenAI Baselines is an initiative to open-source reinforcement learning algorithms, starting with DQN and its variants.
Szymon Sidor
5 min read
Includes Code
Has Summary
--
OpenAI logo
OpenAI
Intermediate
Roboschool is an open-source software for robot simulation integrated with OpenAI Gym, aimed at providing realistic environments for training robots.
OpenAI Team
5 min read
Includes Code
Has Summary
--
OpenAI logo
OpenAI
Intermediate
The article discusses Universe, a software platform developed by OpenAI for measuring and training AI's general intelligence across various applications, including games and websites.
OpenAI logo
OpenAI
Intermediate
The article discusses the infrastructure necessary for deep learning, emphasizing the importance of a robust setup to facilitate research and experimentation.
OpenAI logo
OpenAI
Intermediate
OpenAI Gym Beta is a toolkit designed for developing and comparing reinforcement learning (RL) algorithms.
Greg Brockman
6 min read
Has Summary
--

You've reached the end! All 23 articles loaded.