#

Hugging Face Programming Tutorials & Engineering Articles

257 Hugging Face tutorials, guides, and engineering insights from NVIDIA, Google, ClickHouse, and more

Hugging Face Articles & Tutorials

Filter:
NVIDIA logo
NVIDIA
Advanced
The article discusses the use of NVFP4 low-precision model training to achieve higher throughput without sacrificing accuracy in AI model training.
Aditya Vavre
7 min read
Includes Code
Has Summary
--
Notion logo
Notion
Intermediate
The article discusses Notion's journey in scaling its vector search infrastructure, achieving a 10x increase in scale while reducing costs by 90% over two years.
Preeti Gondi, Mickey Liu, Nathan Louie, Calder Lund, Jacob Sager
10 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses how NVIDIA's hardware-software co-design significantly enhanced the inference performance of Sarvam AI's Sovereign 30B model, achieving a 4x speedup on NVIDIA Blackwell archit...
Utkarsh Uppal
14 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses NVIDIA TensorRT LLM AutoDeploy, a beta feature that automates the inference optimization process for large language models (LLMs).
​​Lucas Liebenwein
8 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
Kimi K2. 5 is an advanced multimodal vision language model (VLM) developed by Kimi, optimized for various AI tasks.
Anu Srivastava
4 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article provides a comprehensive guide on building a document processing pipeline using NVIDIA Nemotron RAG, focusing on the extraction of structured data from complex documents like PDFs.
Chia-Chih Chen
9 min read
Includes Code
Has Summary
--
Google logo
Google
Advanced
This tutorial demonstrates how to fine-tune FunctionGemma, a small language model for translating natural language into API calls, using Google's Tunix library on TPUs.
Wei Wei
4 min read
Includes Code
Has Summary
--
Microsoft logo
Microsoft
Intermediate
This article from Microsoft Engineering identifies a specific failure mode in production-scale reinforcement learning for tool-using agents: variance amplification in tool-conditioned contexts that...
Aditya Challapally
7 min read
Includes Code
Has Summary
--
Google logo
Google
Advanced
LiteRT has evolved from its TensorFlow Lite foundation into a universal on-device AI inference framework, now offering production-ready GPU acceleration across six platforms and streamlined NPU int...
Lu Wang, Chintan Parikh, Jingjiang Li, Terry Heo
9 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses how to utilize NVIDIA Earth-2 to downscale coarse climate projections into high-resolution, bias-corrected fields, enabling better assessment of local climate extremes.
Georg Ertl
11 min read
Includes Code
Has Summary
--
Google logo
Google
Intermediate
This article demonstrates how to fine-tune FunctionGemma, a specialized 270M parameter Gemma 3 model designed for function calling in agentic AI systems.
Juyeong Ji
5 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
This article explores how to train an AI agent to operate a new Command Line Interface (CLI) using synthetic data generation and reinforcement learning.
Chris Alexiuk
11 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the development of generalist humanoid capabilities using NVIDIA Isaac GR00T N1. 6 through a sim-to-real workflow.
Edith Llontop
7 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the introduction of NVIDIA TensorRT Edge-LLM, an open-source C++ framework designed for high-performance inference of Large Language Models (LLMs) and Vision Language Models (...
Lin Chai
5 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the latest software and model optimizations for NVIDIA DGX Spark, highlighting significant performance improvements in AI workflows.
Allen Bourgoyne
5 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the NVIDIA Rubin platform, which introduces six new chips designed to create a powerful AI supercomputer.
NVIDIA logo
NVIDIA
Advanced
The article introduces NVIDIA Isaac Lab-Arena, an open-source framework designed for efficient and scalable evaluation of generalist robot policies in simulation.
Sangeeta Subramanian
9 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
This article provides a comprehensive tutorial on building a voice agent using NVIDIA's Nemotron models, focusing on retrieval-augmented generation (RAG) and safety guardrails.
Chris Alexiuk
8 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses NVIDIA's Alpamayo, a comprehensive ecosystem designed for developing reasoning-based autonomous vehicle (AV) systems.
Marco Pavone
11 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the integration of AI Physics into Technology Computer-Aided Design (TCAD) simulations, highlighting its significance in semiconductor manufacturing.
NVIDIA logo
NVIDIA
Intermediate
The article discusses the NVIDIA Nemotron 3, a family of open models designed for agentic AI systems, emphasizing its efficiency and accuracy through innovative architectures and techniques.
NVIDIA logo
NVIDIA
Intermediate
The article discusses the creation of privacy-preserving evaluation benchmarks using synthetic data, particularly in regulated domains like healthcare.
Isabel Hulseman
11 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the implementation of Edge AI on the NVIDIA Jetson platform, focusing on the use of Large Language Models (LLMs), Vision Language Models (VLMs), and Foundation Models in robot...
Chitoku Yato
9 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses enhancing the quality of 3D Gaussian reconstruction for simulation, focusing on the use of NVIDIA's Fixer model to eliminate rendering artifacts.
Wonsik Han
7 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The NVIDIA-accelerated Mistral 3 open model family offers developers and enterprises industry-leading accuracy, efficiency, and customization capabilities.
Anu Srivastava
6 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses the evolution and scaling of Uber's Delivery Search Platform, emphasizing the transition from traditional lexical search to a semantic search model that enhances user experien...
Divya Nagar, Zheng Liu, Jiasen Xu, Bo Ling, Haoyang Chen
11 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article introduces Broadened Reinforcement Learning (BroRL), a new paradigm that enhances the training of large language models (LLMs) by focusing on rollout scaling rather than just increasing...
Jian Hu
6 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
This article discusses how to achieve 4x faster inference for math problem solving using large language models by optimizing the serving stack, quantization strategy, and decoding methods.
Igor Gitman
7 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses NVIDIA Grove, a Kubernetes API designed to streamline complex AI inference workloads by managing multicomponent systems.
Sanjay Chatterjee
9 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Beginner
The article discusses the benchmarking of AI coding assistants in writing efficient CUDA code using the ComputeEval framework.
Daniel Rodriguez
2 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses how NVIDIA's NeMo Automodel simplifies the training of large-scale mixture-of-experts (MoE) models in PyTorch, making it accessible to a broader audience.
Hemil Desai
7 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses how to scale biology transformer models using PyTorch and NVIDIA BioNeMo Recipes, focusing on advanced parallel computing techniques and the integration of the NVIDIA Transfor...
Kyle Tretina
6 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the advancements in Explainable AI for radiology through NVIDIA Clara Reason, focusing on the NV-Reason-CXR-3B model that enhances diagnostic transparency and mimics radiologi...
Andriy Myronenko
11 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses how NVIDIA Run:ai enhances AI infrastructure management on Microsoft Azure by optimizing GPU utilization and simplifying workload orchestration.
Julie Adrounie
8 min read
Has Summary
--
OpenAI logo
OpenAI
Advanced
The article introduces gpt-oss-safeguard, OpenAI's new open-weight reasoning models designed for safety classification tasks.
OpenAI
9 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article introduces CodonFM, a new state-of-the-art RNA foundation model developed by NVIDIA as part of the Clara open model family.
Kyle Gion
10 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the launch of NVIDIA's new Nemotron models designed for developing specialized AI agents that integrate language and vision capabilities.
Chris Alexiuk
8 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses how the NVIDIA DGX Spark supercomputer enhances performance for intensive AI tasks, providing a local alternative to cloud computing.
Allen Bourgoyne
5 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
This article outlines a streamlined process for reconstructing 3D environments for robotics simulation using only a smartphone, specifically an iPhone.
Wonsik Han
10 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses how to fine-tune and scale large language models (LLMs) using the open-source Unsloth framework on NVIDIA Blackwell GPUs.
Paul Abruzzo
5 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
This article guides readers through the process of creating a Bash computer use agent using the NVIDIA Nemotron Nano v2 model.
Mehran Maghoumi
14 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses building an AI agent using NVIDIA Nemotron to analyze IT tickets, focusing on extracting insights from unstructured data through advanced AI reasoning and graph databases.
Bhaskar Bhowmik
10 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the development of an AI-powered log analysis solution using NVIDIA's Generative AI reference workflows.
Prashant Bhende
5 min read
Includes Code
Has Summary
--
Google logo
Google
Intermediate
The article discusses how to fine-tune the Gemma 3 270M model for on-device applications, enabling developers to create custom AI models without the need for expensive hardware.
Ian Ballantyne, Jason Mayes
5 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the optimization of large language models (LLMs) through pruning and knowledge distillation using NVIDIA TensorRT Model Optimizer.
Max Xu
10 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the integration of the NVIDIA KAI Scheduler with Ray, enabling advanced scheduling features like gang scheduling, workload prioritization, and autoscaling in Ray clusters.
Ekin Karabulut
9 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the integration of NVIDIA Run:ai v2. 23 with NVIDIA Dynamo to address the challenges of large language model (LLM) inference across distributed environments.
Ekin Karabulut
9 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses how OpenUSD can enhance robotics development through improved data ingestion, aggregation, and the use of SimReady assets.
Matias Codesal
6 min read
Includes Code
Has Summary
--
Google logo
Google
Intermediate
The article provides an in-depth exploration of the EmbeddingGemma architecture, detailing its origins, embedding generation process, and the comprehensive training methodology.
Henrique Schechter Vera, Juyeong Ji, Sahil Dua
7 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the integration of computer vision pipelines with Generative AI and reasoning, highlighting the advancements in video analytics through NVIDIA's Blueprint for Video Search and...
Samuel Ochoa
11 min read
Has Summary
--