#
Whisper Programming Tutorials & Engineering Articles
43 Whisper tutorials, guides, and engineering insights from OpenAI, NVIDIA, Fly.io, and more
Companies Using This
Whisper Articles & Tutorials
Filter:
The article discusses NVIDIA's Blackwell Ultra architecture, which sets new inference records in the MLPerf Inference v5. 1 benchmark.
Zhihan Jiang
10 min read
Has Summary
--
The article introduces gpt-oss, two state-of-the-art open-weight language models, gpt-oss-120b and gpt-oss-20b, which excel in reasoning tasks and are optimized for deployment on consumer hardware.
The article discusses the integration of advanced agentic architectures in MONAI, an open-source framework for medical imaging, to create a multimodal medical AI ecosystem.
Michael Zephyr
7 min read
Has Summary
--
OpenAI has launched next-generation audio models that enhance voice agent capabilities through improved speech-to-text and text-to-speech functionalities.
OpenAI
6 min read
Includes Code
Has Summary
--
The article discusses the deployment of NVIDIA Riva's multilingual Automatic Speech Recognition (ASR) capabilities using Whisper and Canary architectures. It highlights the new features in Riva 2.
The article discusses the integration of NVIDIA ACE AI characters into games using the new In-Game Inferencing SDK (NVIGI).
The article discusses how the partnership between NVIDIA and Dataloop is transforming the preparation of multimodal datasets for large language models (LLMs).
Amit Bleiweiss
9 min read
Has Summary
--
The article discusses the UlangiziAI chatbot, which provides on-demand, AI-powered agricultural advice to farmers in Malawi through WhatsApp.
Cloudflare has announced significant upgrades to its AI platform, including Workers AI, AI Gateway, and Vectorize, aimed at enhancing performance, flexibility, and cost-effectiveness for developers.
Michelle Chen
14 min read
Has Summary
--
The article discusses how NVIDIA NeMo has accelerated automatic speech recognition (ASR) models, achieving up to 10x speed improvements through various optimizations.
Daniel Galvez
12 min read
Includes Code
Has Summary
--
NVIDIA has introduced its first on-device small language model (SLM) aimed at enhancing game character interactions, showcased in the game Mecha BREAK.
Fly. io has announced a significant price reduction for their NVIDIA L40S GPUs, now available at $1. 25 per hour.
Kurt Mackey
6 min read
Has Summary
--
OpenAI
52 min read
Includes Code
--
Developing Robust Georgian Automatic Speech Recognition with FastConformer Hybrid Transducer CTC BPE
The article discusses the development of a robust Automatic Speech Recognition (ASR) model for the Georgian language using the FastConformer Hybrid Transducer CTC BPE architecture.
Sofia Kostandian
9 min read
Includes Code
Has Summary
--
The article introduces Cloudflare's new feature for generating AI-powered captions for videos, simplifying the process for users by eliminating the need for third-party transcription services.
Mickie Betz
7 min read
Includes Code
Has Summary
--
The article discusses the integration of AI chatbots, particularly Gipi, with NVIDIA TensorRT-LLM and AI foundation models to enhance personalized learning experiences.
Nisanur Genc
5 min read
Has Summary
--
The article discusses the release of the NVIDIA NeMo Canary model, a state-of-the-art multilingual model for speech recognition and translation.
The article explores the nature and capabilities of Graphics Processing Units (GPUs), particularly in the context of AI/ML workloads.
Artificial IntelligenceCrystalGPTLarge Language ModelsMachine LearningMistralOllamaStable DiffusionWhisper
Xe Iaso
13 min read
Has Summary
--
The article discusses how to scale large language models to zero using Ollama on Fly. io, emphasizing the benefits of self-hosting AI tools and the efficient use of GPU resources.
Xe Iaso
11 min read
Includes Code
Has Summary
--
The article discusses how to utilize Fly. io's GPU Machines for running AI workloads, specifically focusing on the Whisper Webservice for audio transcription.
The article discusses the launch of Supermicro's liquid-cooled AI development platform, designed to facilitate the rapid deployment of AI workloads.
The article introduces Whisper, an automatic speech recognition (ASR) system developed by OpenAI, trained on 680,000 hours of multilingual and multitask supervised data.
OpenAI Team
3 min read
Has Summary
--
The article discusses the evolution of data structures in Yandex. Metrica, detailing the transition from MyISAM tables to LSM-trees and ultimately to the column-oriented database ClickHouse.
The article discusses a novel two-stage model for hierarchical text-conditional image generation using CLIP latents.
Aditya Ramesh
1 min read
Has Summary
--
The article introduces Triton, an open-source programming language designed for efficient GPU programming in neural networks.
Philippe Tillet
10 min read
Includes Code
Has Summary
--
DALL·E is a 12-billion parameter version of GPT-3 designed to generate images from text descriptions.
Aditya Ramesh
11 min read
Has Summary
--
Prafulla Dhariwal
15 min read
Has Summary
--
OpenAI Team
7 min read
Includes Code
Has Summary
--
The article discusses the development of Sparse Transformers, a novel deep neural network architecture that enhances the prediction of sequences in various domains, including text, images, and soun...
Rewon Child
7 min read
Has Summary
--
Neural MMO is a massively multiagent game environment designed for reinforcement learning agents, supporting a large number of agents in a persistent and open-ended task.
Joseph Suarez
6 min read
Has Summary
--
Vicki Pfau
4 min read
Includes Code
Has Summary
--
The article discusses the release of eight simulated robotics environments and a Baselines implementation of Hindsight Experience Replay (HER) developed for robotics research.
Matthias Plappert
9 min read
Includes Code
Has Summary
--
The article discusses the release of optimized GPU kernels for block-sparse neural network architectures, which can significantly outperform traditional libraries like cuBLAS and cuSPARSE.
Scott Gray
6 min read
Includes Code
Has Summary
--
The article discusses the release of two new OpenAI Baselines implementations: ACKTR and A2C.
Yuhuai Wu
5 min read
Has Summary
--
The article discusses the RL-Teacher, an open-source implementation designed to train AI systems using human feedback instead of traditional reward functions.
Tom Brown
2 min read
Includes Code
Has Summary
--
Proximal Policy Optimization (PPO) is a new class of reinforcement learning algorithms that offers comparable or superior performance to state-of-the-art methods while being simpler to implement an...
John Schulman
4 min read
Has Summary
--
The article discusses the release of a high-performance Python library for robotic simulation using the MuJoCo engine, highlighting its capabilities and performance improvements.
Jonas Schneider
3 min read
Includes Code
Has Summary
--
OpenAI Baselines is an initiative to open-source reinforcement learning algorithms, starting with DQN and its variants.
Szymon Sidor
5 min read
Includes Code
Has Summary
--
Roboschool is an open-source software for robot simulation integrated with OpenAI Gym, aimed at providing realistic environments for training robots.
OpenAI Team
5 min read
Includes Code
Has Summary
--
OpenAI
18 min read
Includes Code
Has Summary
--
The article discusses the infrastructure necessary for deep learning, emphasizing the importance of a robust setup to facilitate research and experimentation.
AWSChefDeep LearningDockerKerasKubernetesNeural NetworksOpenCVPackerTensorBoardTensorFlowTerraformWhisper
Vicki Cheung
9 min read
Has Summary
--
OpenAI Gym Beta is a toolkit designed for developing and comparing reinforcement learning (RL) algorithms.
The article discusses how LinkedIn operates Apache Samza at scale, focusing on its integration with Apache Kafka for processing high volumes of data.
Jon Bringhurst
11 min read
Has Summary
--
You've reached the end! All 43 articles loaded.