Fine-tuning Programming Tutorials &amp; Engineering Articles

How Painkiller RTX Uses Generative AI to Modernize Game Assets at Scale

Advanced

The article discusses how Painkiller RTX utilizes generative AI to enhance game assets by transforming legacy textures into high-quality Physically Based Rendering (PBR) materials.

Deep LearningFine-tuningGenerative AIRemix

Phillip Singh

14 min read

Has Summary

Build with Kimi K2.5 Multimodal VLM Using NVIDIA GPU-Accelerated Endpoints

Advanced

Kimi K2. 5 is an advanced multimodal vision language model (VLM) developed by Kimi, optimized for various AI tasks.

EmbeddingFine-tuningHugging FacePyTorch

Anu Srivastava

4 min read

Includes Code

Has Summary

Beyond the Chatbot: A Blueprint for Trustable AI

Advanced

This article presents a blueprint for building trustable AI systems, demonstrated through a real-world field test at Thunderhill Raceway where Google Developer Experts built a real-time AI racing c...

Fine-tuningFirebaseGeminiVertex AI

Matt Thompson, Ajeet Mirwani

5 min read

Has Summary

A Guide to Fine-Tuning FunctionGemma

Intermediate

This article demonstrates how to fine-tune FunctionGemma, a specialized 270M parameter Gemma 3 model designed for function calling in agentic AI systems.

Fine-tuningHugging FaceJAXJSONShell

Juyeong Ji

5 min read

Includes Code

Has Summary

Advanced

PinLanding: Turn Billions of Products into Instant Shopping Collections with Multimodal AI

PinLanding is a multimodal AI pipeline developed by Pinterest to generate shopping collections from billions of products.

ApacheApache SparkCLIPFine-tuningGPTMachine LearningModalV

Pinterest Engineering

8 min read

Has Summary

Optimizing Semiconductor Defect Classification with Generative AI and Vision Foundation Models

Intermediate

The article discusses the optimization of semiconductor defect classification using generative AI and vision foundation models (VFMs).

Fine-tuningGenerative AIYAML

Tim Lin

11 min read

Includes Code

Has Summary

Build Efficient Financial Data Workflows with AI Model Distillation

Advanced

The article discusses the use of AI Model Distillation to create efficient financial data workflows, focusing on the optimization of large language models (LLMs) for applications in quantitative fi...

Deep LearningDockerElasticsearchFine-tuningJSONKubernetesMicroservicesYAML

Dhruv Desai

10 min read

Includes Code

Has Summary

Spotify

Beginner

Shuffle: Making Random Feel More Human

The article discusses Spotify's approach to enhancing the Shuffle feature by balancing statistical randomness with user perception.

Ludvig Borgne (Staff Engineer) and Chidem Sahiner (Senior Product Manager)

3 min read

Has Summary

Gen AI Super-Resolution Accelerates Weather Prediction with Scalable, Low-Compute Models

Advanced

The article discusses how NVIDIA's CorrDiff model leverages generative AI for downscaling weather predictions, significantly improving efficiency and reducing computational costs.

Fine-tuningPythonPyTorchYAML

Alicia Sui

11 min read

Includes Code

Has Summary

Introducing the CodonFM Open Model for RNA Design and Analysis

Advanced

The article introduces CodonFM, a new state-of-the-art RNA foundation model developed by NVIDIA as part of the Clara open model family.

BERTFine-tuningHugging FaceTransformer

Kyle Gion

10 min read

Includes Code

Has Summary

Netflix

Advanced

Post-Training Generative Recommenders with Advantage-Weighted Supervised Finetuning

Netflix introduces Advantage-Weighted Supervised Fine-Tuning (A-SFT), a novel post-training algorithm for generative recommender systems that addresses the unique challenges of applying reinforceme...

Fine-tuningReinforcement LearningRLHF

Netflix Technology Blog

12 min read

Has Summary

How NVIDIA DGX Spark’s Performance Enables Intensive AI Tasks

Intermediate

The article discusses how the NVIDIA DGX Spark supercomputer enhances performance for intensive AI tasks, providing a local alternative to cloud computing.

Fine-tuningGPTHugging FacePyTorchscikit-learn

Allen Bourgoyne

5 min read

Has Summary

Train an LLM on NVIDIA Blackwell with Unsloth—and Scale for Production

Intermediate

The article discusses how to fine-tune and scale large language models (LLMs) using the open-source Unsloth framework on NVIDIA Blackwell GPUs.

DockerFine-tuningHugging FacePython

Paul Abruzzo

5 min read

Includes Code

Has Summary

Build a Log Analysis Multi-Agent Self-Corrective RAG System with NVIDIA Nemotron

Advanced

The article discusses the development of an AI-powered log analysis solution using NVIDIA's Generative AI reference workflows.

EmbeddingFine-tuningGenerative AIHugging Face

Prashant Bhende

5 min read

Includes Code

Has Summary

Own your AI: Learn how to fine-tune Gemma 3 270M and run it on-device

Intermediate

The article discusses how to fine-tune the Gemma 3 270M model for on-device applications, enabling developers to create custom AI models without the need for expensive hardware.

Fine-tuningGeminiHugging FaceJavaScriptTransformers

Ian Ballantyne, Jason Mayes

5 min read

Includes Code

Has Summary

Smarter Anomaly Detection in Semiconductor Manufacturing with NVIDIA NV-Tesseract and NVIDIA NIM

Advanced

The article discusses the implementation of NVIDIA NV-Tesseract and NVIDIA NIM for smarter anomaly detection in semiconductor manufacturing.

DockerFine-tuningJSONKubernetes

Aditi Gautam

7 min read

Includes Code

Has Summary

Advancing Robotics Development with Neural Dynamics in Newton

Advanced

The article discusses Neural Robot Dynamics (NeRD), a neural simulation framework designed to enhance robotics development by accurately predicting the dynamics of articulated robots.

Fine-tuningGPT

Jie Xu

8 min read

Has Summary

Gemma explained: EmbeddingGemma Architecture and Recipe

Intermediate

The article provides an in-depth exploration of the EmbeddingGemma architecture, detailing its origins, embedding generation process, and the comprehensive training methodology.

EmbeddingFine-tuningGeminiHugging FaceTransformerTransformersVertex AI

Henrique Schechter Vera, Juyeong Ji, Sahil Dua

7 min read

Includes Code

Has Summary

R²D²: Three Neural Breakthroughs Transforming Robot Learning from NVIDIA Research

Intermediate

The article discusses three neural innovations from NVIDIA Research that are enhancing robot learning capabilities, specifically focusing on bridging the gap between controlled simulations and real...

AssemblyFine-tuningGPTTransformerWarp

Rishabh Chadha

8 min read

Has Summary

How to Run AI-Powered CAE Simulations

Intermediate

The article discusses the integration of AI-powered simulations in computer-aided engineering (CAE) to accelerate design processes.

DGLFine-tuningNumPyStencilVTKYAML

Abouzar Ghasemi

12 min read

Includes Code

Has Summary

How Small Language Models Are Key to Scalable Agentic AI

Advanced

The article discusses the significance of small language models (SLMs) in the development of scalable agentic AI, emphasizing their efficiency and cost-effectiveness compared to large language mode...

Fine-tuningHugging FaceJSON

Peter Belcak

8 min read

Has Summary

Maximize Robotics Performance by Post-Training NVIDIA Cosmos Reason

Intermediate

NVIDIA Cosmos Reason is an open and customizable vision language model designed for robotics and physical AI, enabling robots to reason using prior knowledge and common sense.

DockerFine-tuningHugging Face

Tsung-Yi Lin

5 min read

Includes Code

Has Summary

Netflix

Advanced

Accelerating Video Quality Control at Netflix with Pixel Error Detection

The article discusses how Netflix has developed an automated quality control method for video content that detects pixel-level artifacts, significantly reducing the need for manual reviews.

Netflix Technology Blog

6 min read

Has Summary

Train a Reasoning-Capable LLM in One Weekend with NVIDIA NeMo

Advanced

This article provides a comprehensive guide on how to train a reasoning-capable language model using NVIDIA NeMo in just 48 hours on a single GPU.

Fine-tuningHugging FaceJSON

Mehran Maghoumi

17 min read

Includes Code

Has Summary

Shopify

Intermediate

Leveraging Multimodal LLMs for Shopify’s Global Catalogue: Recap of Expo Talk at ICLR 2025

The article discusses Shopify's Global Catalogue, which utilizes multimodal Large Language Models (LLMs) to standardize and enrich product data across its platform.

Active LearningFine-tuningGeminiLarge Language ModelsLLaMA

Audrey-Anne Guindon

13 min read

Has Summary

Run Google DeepMind’s Gemma 3n on NVIDIA Jetson and RTX

Intermediate

The article discusses the general availability of Google DeepMind's Gemma 3n on NVIDIA RTX and Jetson platforms, highlighting its capabilities in multi-modal on-device deployment, including audio, ...

Fine-tuningHugging FaceOllama

Anu Srivastava

4 min read

Includes Code

Has Summary

Multilingual innovation in LLMs: How open models help unlock global communication

Intermediate

The article discusses the contributions of the community to the Unlock Global Communication with Gemma competition on Kaggle, focusing on adapting large language models (LLMs) for diverse linguisti...

Glenn Cameron

6 min read

Has Summary

Fine-Tuning LLMOps for Rapid Model Evaluation and Ongoing Optimization

Advanced

The article discusses the operational challenges of deploying large language models (LLMs) and introduces LLMOps as a framework for managing their lifecycle.

AzureFine-tuningGitJSONKubernetesLLaMAMicroservicesMLflow

Liad Levi-Raz

12 min read

Includes Code

Has Summary

OpenAI

Advanced

Toward understanding and preventing misalignment generalization

The article discusses emergent misalignment in large language models, particularly focusing on how misaligned persona features can lead to generalized misalignment.

ChiFine-tuningGPTPIL

OpenAI Team

16 min read

Has Summary

Build Efficient AI Agents Through Model Distillation With the NVIDIA Data Flywheel Blueprint

Intermediate

The article discusses the NVIDIA AI Blueprint for building efficient AI agents through model distillation, focusing on the challenges of scaling intelligent applications and managing inference cost...

ElasticsearchFine-tuningYAML

Daniel Glogowski

10 min read

Includes Code

Has Summary

A Fine-tuning–Free Approach for Rapidly Recovering LLM Compression Errors with EoRA

Advanced

The article discusses Eigenspace Low-Rank Approximation (EoRA), a fine-tuning-free method developed by NVIDIA for compensating compression errors in large language models (LLMs).

Fine-tuningHugging FacePython

Min-Hung Chen

8 min read

Includes Code

Has Summary

Curating Synthetic Datasets to Train Physical AI Models with NVIDIA Cosmos Reason

Beginner

The article discusses NVIDIA Cosmos Reason, a world foundation model designed to enhance physical AI by curating synthetic datasets for training robots and autonomous vehicles.

DockerFine-tuningHugging Face

Tsung-Yi Lin

6 min read

Has Summary

Run Hugging Face Models Instantly with Day-0 Support from NVIDIA NeMo Framework

Intermediate

The article discusses the introduction of the AutoModel feature in the NVIDIA NeMo Framework, which allows users to run Hugging Face models with Day-0 support.

Fine-tuningHugging FaceMistralPyTorchTransformer

Shashank Verma

5 min read

Includes Code

Has Summary

Cloudflare

Intermediate

Introducing AutoRAG: fully managed Retrieval-Augmented Generation on Cloudflare

The article introduces AutoRAG, a fully managed Retrieval-Augmented Generation (RAG) pipeline available in open beta on Cloudflare.

EmbeddingFine-tuningHTMLJSONREST APITypeScript

Anni Wang

11 min read

Includes Code

Has Summary

NVIDIA Accelerates Inference on Meta Llama 4 Scout and Maverick

Intermediate

NVIDIA has introduced the Llama 4 Scout and Llama 4 Maverick models, which leverage NVIDIA's open-source software to achieve impressive performance metrics on Blackwell B200 GPUs.

Fine-tuningTransformer

Anu Srivastava

4 min read

Has Summary

Deploying the NVIDIA AI Blueprint for Cost-Efficient LLM Routing

Advanced

The article discusses the NVIDIA AI Blueprint for an LLM router, which provides a cost-efficient framework for dynamically routing prompts to the most suitable large language models (LLMs).

ChatGPTDockerFine-tuningGrafanaOpenAI APIPythonRust

Arun Raman

7 min read

Has Summary

Boosting Q&A Accuracy with GraphRAG Using PyG and Graph Databases

Advanced

The article discusses the implementation of GraphRAG, a graph-powered retrieval-augmented generation technique that enhances the accuracy of large language models (LLMs) in answering domain-specifi...

Fine-tuningNeo4jOpenAI APIPyTorchPyTorch Geometric

Brian Shi

8 min read

Includes Code

Has Summary

Introducing TxGemma: Open models to improve therapeutics development

Advanced

TxGemma is a collection of open models designed to enhance the efficiency of therapeutic development by utilizing large language models.

Fine-tuningGeminiHugging FaceVertex AI

Shekoofeh Azizi

4 min read

Has Summary

Seamlessly Scale AI Across Cloud Environments with NVIDIA DGX Cloud Serverless Inference

Advanced

The article discusses NVIDIA DGX Cloud Serverless Inference, an auto-scaling AI inference solution that simplifies the deployment and scaling of AI applications across multi-cloud and on-premises e...

Fine-tuninggRPCHelmServerless

Vishal Ganeriwala

9 min read

Has Summary

Vision Language Model Prompt Engineering Guide for Image and Video Understanding

Advanced

This article provides a comprehensive guide on Vision Language Models (VLMs) and their evolution from single-image understanding to advanced video comprehension.

Fine-tuningJSONLarge Language ModelsPrompt Engineering

Shubham Agrawal

11 min read

Includes Code

Has Summary

Understanding the Language of Life’s Biomolecules Across Evolution at a New Scale with Evo 2

Advanced

The article discusses the advancements in AI-driven biological research with the introduction of Evo 2, a foundation model that integrates genomic, RNA, and protein data across multiple life domain...

AWSFine-tuningJSONTransformerTransformersYAML

Kyle Tretina

9 min read

Includes Code

Has Summary

Improving Translation Quality with Domain-Specific Fine-Tuning and NVIDIA NIM

Intermediate

The article discusses enhancing translation quality through domain-specific fine-tuning using LoRA adapters and NVIDIA NIM.

Fine-tuningMistralRemix

Cheng-Han (Hank) Du

7 min read

Includes Code

Has Summary

Advancing Physical AI with NVIDIA Cosmos World Foundation Model Platform

Advanced

The article discusses the NVIDIA Cosmos World Foundation Model Platform, which accelerates the development of physical AI by enabling autonomous machines to perceive and interact with their environ...

Fine-tuningHugging FaceRapids

Pranjali Joshi

13 min read

Has Summary

Fine-Tuning Small Language Models to Optimize Code Review Accuracy

Advanced

The article discusses the fine-tuning of small language models (SLMs) to enhance code review accuracy, addressing challenges faced by enterprises in adopting large foundational models.

DockerFine-tuningGenerative AIGPTGPT-4JSONPython

Japinder Singh

14 min read

Includes Code

Has Summary

Insights, Techniques, and Evaluation for LLM-Driven Knowledge Graphs

Advanced

The article discusses how integrating large language models (LLMs) with knowledge graphs enhances the extraction of structured insights from unstructured data, addressing challenges faced by tradit...

ChatGPTFine-tuningJSONNetworkXPythonShell

Rohan Rao

15 min read

Includes Code

Has Summary

An Easy Introduction to Multimodal Retrieval-Augmented Generation for Video and Audio

Intermediate

This article provides an introduction to building a multimodal retrieval-augmented generation (RAG) system for video and audio content.

CLIPEmbeddingFine-tuning

Tanay Varshney

11 min read

Has Summary

Advancing Neuroscience Research with Visual Question Answering and Multimodal Retrieval

Intermediate

The article discusses how the IIT Madras Brain Centre is leveraging generative AI, specifically visual question answering (VQA) and multimodal retrieval, to enhance neuroscience research.

EmbeddingFine-tuningHelmVector Database

Pralaypati Ta

7 min read

Has Summary

Exploring the Case of Super Protocol with Self-Sovereign AI and NVIDIA Confidential Computing

Advanced

The article discusses the innovative approach of Confidential and self-sovereign AI, emphasizing how Super Protocol leverages decentralized systems and NVIDIA Confidential Computing to enhance data...

Airbnb’s AI-powered photo tour using Vision Transformer

Rob Nertney

14 min read

Has Summary

Airbnb

Advanced

The article discusses Airbnb's implementation of an AI-powered photo tour feature using Vision Transformers to enhance the guest experience by accurately classifying and organizing listing images.

Fine-tuningMachine LearningTransformerTransformers

Pei Xiong

9 min read

Has Summary

DataStax Announces New AI Development Platform, Built with NVIDIA AI

Advanced

DataStax has launched a new AI development platform in collaboration with NVIDIA, designed to streamline the development, security, and optimization of AI applications.