#
Fine-tuning Programming Tutorials & Engineering Articles
122 Fine-tuning tutorials, guides, and engineering insights from NVIDIA, Google, OpenAI, and more
Companies Using This
Fine-tuning Articles & Tutorials
Filter:
The article discusses how Painkiller RTX utilizes generative AI to enhance game assets by transforming legacy textures into high-quality Physically Based Rendering (PBR) materials.
Phillip Singh
14 min read
Has Summary
--
Kimi K2. 5 is an advanced multimodal vision language model (VLM) developed by Kimi, optimized for various AI tasks.
Anu Srivastava
4 min read
Includes Code
Has Summary
--
This article presents a blueprint for building trustable AI systems, demonstrated through a real-world field test at Thunderhill Raceway where Google Developer Experts built a real-time AI racing c...
Matt Thompson, Ajeet Mirwani
5 min read
Has Summary
--
This article demonstrates how to fine-tune FunctionGemma, a specialized 270M parameter Gemma 3 model designed for function calling in agentic AI systems.
Juyeong Ji
5 min read
Includes Code
Has Summary
--
PinLanding is a multimodal AI pipeline developed by Pinterest to generate shopping collections from billions of products.
Pinterest Engineering
8 min read
Has Summary
--
The article discusses the optimization of semiconductor defect classification using generative AI and vision foundation models (VFMs).
Tim Lin
11 min read
Includes Code
Has Summary
--
The article discusses the use of AI Model Distillation to create efficient financial data workflows, focusing on the optimization of large language models (LLMs) for applications in quantitative fi...
Dhruv Desai
10 min read
Includes Code
Has Summary
--
The article discusses Spotify's approach to enhancing the Shuffle feature by balancing statistical randomness with user perception.
Ludvig Borgne (Staff Engineer) and Chidem Sahiner (Senior Product Manager)
3 min read
Has Summary
--
The article discusses how NVIDIA's CorrDiff model leverages generative AI for downscaling weather predictions, significantly improving efficiency and reducing computational costs.
Alicia Sui
11 min read
Includes Code
Has Summary
--
The article introduces CodonFM, a new state-of-the-art RNA foundation model developed by NVIDIA as part of the Clara open model family.
Kyle Gion
10 min read
Includes Code
Has Summary
--
Netflix introduces Advantage-Weighted Supervised Fine-Tuning (A-SFT), a novel post-training algorithm for generative recommender systems that addresses the unique challenges of applying reinforceme...
Netflix Technology Blog
12 min read
Has Summary
--
The article discusses how the NVIDIA DGX Spark supercomputer enhances performance for intensive AI tasks, providing a local alternative to cloud computing.
Allen Bourgoyne
5 min read
Has Summary
--
The article discusses how to fine-tune and scale large language models (LLMs) using the open-source Unsloth framework on NVIDIA Blackwell GPUs.
Paul Abruzzo
5 min read
Includes Code
Has Summary
--
The article discusses the development of an AI-powered log analysis solution using NVIDIA's Generative AI reference workflows.
Prashant Bhende
5 min read
Includes Code
Has Summary
--
The article discusses how to fine-tune the Gemma 3 270M model for on-device applications, enabling developers to create custom AI models without the need for expensive hardware.
Ian Ballantyne, Jason Mayes
5 min read
Includes Code
Has Summary
--
The article discusses the implementation of NVIDIA NV-Tesseract and NVIDIA NIM for smarter anomaly detection in semiconductor manufacturing.
Aditi Gautam
7 min read
Includes Code
Has Summary
--
The article discusses Neural Robot Dynamics (NeRD), a neural simulation framework designed to enhance robotics development by accurately predicting the dynamics of articulated robots.
Jie Xu
8 min read
Has Summary
--
The article provides an in-depth exploration of the EmbeddingGemma architecture, detailing its origins, embedding generation process, and the comprehensive training methodology.
Henrique Schechter Vera, Juyeong Ji, Sahil Dua
7 min read
Includes Code
Has Summary
--
The article discusses three neural innovations from NVIDIA Research that are enhancing robot learning capabilities, specifically focusing on bridging the gap between controlled simulations and real...
Rishabh Chadha
8 min read
Has Summary
--
The article discusses the integration of AI-powered simulations in computer-aided engineering (CAE) to accelerate design processes.
The article discusses the significance of small language models (SLMs) in the development of scalable agentic AI, emphasizing their efficiency and cost-effectiveness compared to large language mode...
Peter Belcak
8 min read
Has Summary
--
NVIDIA Cosmos Reason is an open and customizable vision language model designed for robotics and physical AI, enabling robots to reason using prior knowledge and common sense.
Tsung-Yi Lin
5 min read
Includes Code
Has Summary
--
The article discusses how Netflix has developed an automated quality control method for video content that detects pixel-level artifacts, significantly reducing the need for manual reviews.
Netflix Technology Blog
6 min read
Has Summary
--
This article provides a comprehensive guide on how to train a reasoning-capable language model using NVIDIA NeMo in just 48 hours on a single GPU.
Mehran Maghoumi
17 min read
Includes Code
Has Summary
--
The article discusses Shopify's Global Catalogue, which utilizes multimodal Large Language Models (LLMs) to standardize and enrich product data across its platform.
Audrey-Anne Guindon
13 min read
Has Summary
--
The article discusses the general availability of Google DeepMind's Gemma 3n on NVIDIA RTX and Jetson platforms, highlighting its capabilities in multi-modal on-device deployment, including audio, ...
Anu Srivastava
4 min read
Includes Code
Has Summary
--
The article discusses the contributions of the community to the Unlock Global Communication with Gemma competition on Kaggle, focusing on adapting large language models (LLMs) for diverse linguisti...
Glenn Cameron
6 min read
Has Summary
--
The article discusses the operational challenges of deploying large language models (LLMs) and introduces LLMOps as a framework for managing their lifecycle.
Liad Levi-Raz
12 min read
Includes Code
Has Summary
--
The article discusses emergent misalignment in large language models, particularly focusing on how misaligned persona features can lead to generalized misalignment.
OpenAI Team
16 min read
Has Summary
--
The article discusses the NVIDIA AI Blueprint for building efficient AI agents through model distillation, focusing on the challenges of scaling intelligent applications and managing inference cost...
Daniel Glogowski
10 min read
Includes Code
Has Summary
--
The article discusses Eigenspace Low-Rank Approximation (EoRA), a fine-tuning-free method developed by NVIDIA for compensating compression errors in large language models (LLMs).
Min-Hung Chen
8 min read
Includes Code
Has Summary
--
The article discusses NVIDIA Cosmos Reason, a world foundation model designed to enhance physical AI by curating synthetic datasets for training robots and autonomous vehicles.
Tsung-Yi Lin
6 min read
Has Summary
--
The article discusses the introduction of the AutoModel feature in the NVIDIA NeMo Framework, which allows users to run Hugging Face models with Day-0 support.
Shashank Verma
5 min read
Includes Code
Has Summary
--
The article introduces AutoRAG, a fully managed Retrieval-Augmented Generation (RAG) pipeline available in open beta on Cloudflare.
Anni Wang
11 min read
Includes Code
Has Summary
--
NVIDIA has introduced the Llama 4 Scout and Llama 4 Maverick models, which leverage NVIDIA's open-source software to achieve impressive performance metrics on Blackwell B200 GPUs.
Anu Srivastava
4 min read
Has Summary
--
The article discusses the NVIDIA AI Blueprint for an LLM router, which provides a cost-efficient framework for dynamically routing prompts to the most suitable large language models (LLMs).
Arun Raman
7 min read
Has Summary
--
The article discusses the implementation of GraphRAG, a graph-powered retrieval-augmented generation technique that enhances the accuracy of large language models (LLMs) in answering domain-specifi...
Brian Shi
8 min read
Includes Code
Has Summary
--
TxGemma is a collection of open models designed to enhance the efficiency of therapeutic development by utilizing large language models.
Shekoofeh Azizi
4 min read
Has Summary
--
The article discusses NVIDIA DGX Cloud Serverless Inference, an auto-scaling AI inference solution that simplifies the deployment and scaling of AI applications across multi-cloud and on-premises e...
Vishal Ganeriwala
9 min read
Has Summary
--
This article provides a comprehensive guide on Vision Language Models (VLMs) and their evolution from single-image understanding to advanced video comprehension.
Shubham Agrawal
11 min read
Includes Code
Has Summary
--
The article discusses the advancements in AI-driven biological research with the introduction of Evo 2, a foundation model that integrates genomic, RNA, and protein data across multiple life domain...
Kyle Tretina
9 min read
Includes Code
Has Summary
--
The article discusses enhancing translation quality through domain-specific fine-tuning using LoRA adapters and NVIDIA NIM.
Cheng-Han (Hank) Du
7 min read
Includes Code
Has Summary
--
The article discusses the NVIDIA Cosmos World Foundation Model Platform, which accelerates the development of physical AI by enabling autonomous machines to perceive and interact with their environ...
Pranjali Joshi
13 min read
Has Summary
--
The article discusses the fine-tuning of small language models (SLMs) to enhance code review accuracy, addressing challenges faced by enterprises in adopting large foundational models.
Japinder Singh
14 min read
Includes Code
Has Summary
--
The article discusses how integrating large language models (LLMs) with knowledge graphs enhances the extraction of structured insights from unstructured data, addressing challenges faced by tradit...
This article provides an introduction to building a multimodal retrieval-augmented generation (RAG) system for video and audio content.
Tanay Varshney
11 min read
Has Summary
--
The article discusses how the IIT Madras Brain Centre is leveraging generative AI, specifically visual question answering (VQA) and multimodal retrieval, to enhance neuroscience research.
Pralaypati Ta
7 min read
Has Summary
--
The article discusses the innovative approach of Confidential and self-sovereign AI, emphasizing how Super Protocol leverages decentralized systems and NVIDIA Confidential Computing to enhance data...
Rob Nertney
14 min read
Has Summary
--
The article discusses Airbnb's implementation of an AI-powered photo tour feature using Vision Transformers to enhance the guest experience by accurately classifying and organizing listing images.
Pei Xiong
9 min read
Has Summary
--
DataStax has launched a new AI development platform in collaboration with NVIDIA, designed to streamline the development, security, and optimization of AI applications.
Nicola Sessions
6 min read
Has Summary
--