How OpenAI Uses Whisper

23 engineering articles about Whisper from OpenAI's engineering team

Other OpenAI Technologies

GPT(177)Transformers(23)Neural Networks(23)Reinforcement Learning(20)Artificial Intelligence(19)GPT-4(17)

Other Companies Using Whisper

NVIDIA(12)

Fly.io(4)

Cloudflare(2)

Articles

Filter:

OpenAI

Advanced

Introducing gpt-oss

The article introduces gpt-oss, two state-of-the-art open-weight language models, gpt-oss-120b and gpt-oss-20b, which excel in reasoning tasks and are optimized for deployment on consumer hardware.

ApacheAWSAzureEmbeddingGPTHugging FaceOllamaPyTorchRustTransformerVercelWhisper

OpenAI

15 min read

Has Summary

OpenAI

Intermediate

Introducing next-generation audio models in the API

OpenAI has launched next-generation audio models that enhance voice agent capabilities through improved speech-to-text and text-to-speech functionalities.

Few-shot LearningGPTWhisper

OpenAI

6 min read

Includes Code

Has Summary

OpenAI

Advanced

GPT-4o System Card

Internet (Expert N=39, Novice N=28)

AzureCrystalGeminiGPT-4Hugging FaceOpenAI APIPaLMSolidWhisper

OpenAI

52 min read

Includes Code

OpenAI

Intermediate

Introducing Whisper

The article introduces Whisper, an automatic speech recognition (ASR) system developed by OpenAI, trained on 680,000 hours of multilingual and multitask supervised data.

TransformerWhisper

OpenAI Team

3 min read

Has Summary

OpenAI

Intermediate

Hierarchical text-conditional image generation with CLIP latents

The article discusses a novel two-stage model for hierarchical text-conditional image generation using CLIP latents.

Whisper

Aditya Ramesh

1 min read

Has Summary

OpenAI

Advanced

Introducing Triton: Open-source GPU programming for neural networks

The article introduces Triton, an open-source programming language designed for efficient GPU programming in neural networks.

ApacheKubernetesMachine LearningNumbaNumPyPyTorchWarpWhisper

Philippe Tillet

10 min read

Includes Code

Has Summary

OpenAI

Intermediate

DALL·E: Creating images from text

DALL·E is a 12-billion parameter version of GPT-3 designed to generate images from text descriptions.

GPTTransformersWhisper

Aditya Ramesh

11 min read

Has Summary

OpenAI

Advanced

Jukebox

Jukebox is a neural network developed by OpenAI that generates music, including rudimentary singing, as raw audio across various genres and artist styles.

Artificial IntelligenceGPTMachine LearningTransformersWhisper

Prafulla Dhariwal

15 min read

Has Summary

OpenAI

Beginner

MuseNet

MuseNet is a deep neural network developed by OpenAI that generates 4-minute musical compositions using 10 different instruments and blends various musical styles.

GPTTransformerTransformersWhisper

OpenAI Team

7 min read

Includes Code

Has Summary

OpenAI

Advanced

Generative modeling with sparse transformers

The article discusses the development of Sparse Transformers, a novel deep neural network architecture that enhances the prediction of sequences in various domains, including text, images, and soun...

Self-AttentionTransformerTransformersWhisper

Rewon Child

7 min read

Has Summary

OpenAI

Intermediate

Neural MMO: A massively multiagent game environment

Neural MMO is a massively multiagent game environment designed for reinforcement learning agents, supporting a large number of agents in a persistent and open-ended task.

Neural NetworksPyTorchWhisper

Joseph Suarez

6 min read

Has Summary

OpenAI

Advanced

Gym Retro

Gym Retro is a platform for reinforcement learning research that expands the available game count to over 1,000 across various emulators.

Whisper

Vicki Pfau

4 min read

Includes Code

Has Summary

OpenAI

Intermediate

Ingredients for robotics research

The article discusses the release of eight simulated robotics environments and a Baselines implementation of Hindsight Experience Replay (HER) developed for robotics research.

Whisper

Matthias Plappert

9 min read

Includes Code

Has Summary

OpenAI

Intermediate

Block-sparse GPU kernels

The article discusses the release of optimized GPU kernels for block-sparse neural network architectures, which can significantly outperform traditional libraries like cuBLAS and cuSPARSE.

GPTNeural NetworksWhisper

Scott Gray

6 min read

Includes Code

Has Summary

OpenAI

Advanced

OpenAI Baselines: ACKTR & A2C

The article discusses the release of two new OpenAI Baselines implementations: ACKTR and A2C.

Whisper

Yuhuai Wu

5 min read

Has Summary

OpenAI

Intermediate

Gathering human feedback

The article discusses the RL-Teacher, an open-source implementation designed to train AI systems using human feedback instead of traditional reward functions.

Whisper

Tom Brown

2 min read

Includes Code

Has Summary

OpenAI

Intermediate

Proximal Policy Optimization

Proximal Policy Optimization (PPO) is a new class of reinforcement learning algorithms that offers comparable or superior performance to state-of-the-art methods while being simpler to implement an...

TensorFlowWhisper

John Schulman

4 min read

Has Summary

OpenAI

Advanced

Faster physics in Python

The article discusses the release of a high-performance Python library for robotic simulation using the MuJoCo engine, highlighting its capabilities and performance improvements.

CythonNeural NetworksNumPyWhisper

Jonas Schneider

3 min read

Includes Code

Has Summary

OpenAI

Intermediate

OpenAI Baselines: DQN

OpenAI Baselines is an initiative to open-source reinforcement learning algorithms, starting with DQN and its variants.

TensorFlowWhisper

Szymon Sidor

5 min read

Includes Code

Has Summary

OpenAI

Intermediate

Roboschool

Roboschool is an open-source software for robot simulation integrated with OpenAI Gym, aimed at providing realistic environments for training robots.

Neural NetworksWhisper

OpenAI Team

5 min read

Includes Code

Has Summary

OpenAI

Intermediate

Universe

The article discusses Universe, a software platform developed by OpenAI for measuring and training AI's general intelligence across various applications, including games and websites.

Computer VisionDockerJavaScriptKubernetesNumPyPercyTensorFlowWebSocketWhisper

OpenAI

18 min read

Includes Code

Has Summary

OpenAI

Intermediate

Infrastructure for deep learning

The article discusses the infrastructure necessary for deep learning, emphasizing the importance of a robust setup to facilitate research and experimentation.

AWSChefDeep LearningDockerKerasKubernetesNeural NetworksOpenCVPackerTensorBoardTensorFlowTerraformWhisper

Vicki Cheung

9 min read

Has Summary

OpenAI

Intermediate

OpenAI Gym Beta

OpenAI Gym Beta is a toolkit designed for developing and comparing reinforcement learning (RL) algorithms.

AWSWhisper

Greg Brockman

6 min read

Has Summary

You've reached the end! All 23 articles loaded.