How NVIDIA Uses Fine-tuning

87 engineering articles about Fine-tuning from NVIDIA's engineering team

Other NVIDIA Technologies

Python(740)PyTorch(566)Deep Learning(505)TensorFlow(444)Docker(292)Kubernetes(251)

Other Companies Using Fine-tuning

Articles

Filter:

NVIDIA

Advanced

How Painkiller RTX Uses Generative AI to Modernize Game Assets at Scale

The article discusses how Painkiller RTX utilizes generative AI to enhance game assets by transforming legacy textures into high-quality Physically Based Rendering (PBR) materials.

Deep LearningFine-tuningGenerative AIRemix

Phillip Singh

14 min read

Has Summary

NVIDIA

Advanced

Build with Kimi K2.5 Multimodal VLM Using NVIDIA GPU-Accelerated Endpoints

Kimi K2. 5 is an advanced multimodal vision language model (VLM) developed by Kimi, optimized for various AI tasks.

EmbeddingFine-tuningHugging FacePyTorch

Anu Srivastava

4 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Optimizing Semiconductor Defect Classification with Generative AI and Vision Foundation Models

The article discusses the optimization of semiconductor defect classification using generative AI and vision foundation models (VFMs).

Fine-tuningGenerative AIYAML

Tim Lin

11 min read

Includes Code

Has Summary

NVIDIA

Advanced

Build Efficient Financial Data Workflows with AI Model Distillation

The article discusses the use of AI Model Distillation to create efficient financial data workflows, focusing on the optimization of large language models (LLMs) for applications in quantitative fi...

Deep LearningDockerElasticsearchFine-tuningJSONKubernetesMicroservicesYAML

Dhruv Desai

10 min read

Includes Code

Has Summary

NVIDIA

Advanced

Gen AI Super-Resolution Accelerates Weather Prediction with Scalable, Low-Compute Models

The article discusses how NVIDIA's CorrDiff model leverages generative AI for downscaling weather predictions, significantly improving efficiency and reducing computational costs.

Fine-tuningPythonPyTorchYAML

Alicia Sui

11 min read

Includes Code

Has Summary

NVIDIA

Advanced

Introducing the CodonFM Open Model for RNA Design and Analysis

The article introduces CodonFM, a new state-of-the-art RNA foundation model developed by NVIDIA as part of the Clara open model family.

BERTFine-tuningHugging FaceTransformer

Kyle Gion

10 min read

Includes Code

Has Summary

NVIDIA

Intermediate

How NVIDIA DGX Spark’s Performance Enables Intensive AI Tasks

The article discusses how the NVIDIA DGX Spark supercomputer enhances performance for intensive AI tasks, providing a local alternative to cloud computing.

Fine-tuningGPTHugging FacePyTorchscikit-learn

Allen Bourgoyne

5 min read

Has Summary

NVIDIA

Intermediate

Train an LLM on NVIDIA Blackwell with Unsloth—and Scale for Production

The article discusses how to fine-tune and scale large language models (LLMs) using the open-source Unsloth framework on NVIDIA Blackwell GPUs.

DockerFine-tuningHugging FacePython

Paul Abruzzo

5 min read

Includes Code

Has Summary

NVIDIA

Advanced

Build a Log Analysis Multi-Agent Self-Corrective RAG System with NVIDIA Nemotron

The article discusses the development of an AI-powered log analysis solution using NVIDIA's Generative AI reference workflows.

EmbeddingFine-tuningGenerative AIHugging Face

Prashant Bhende

5 min read

Includes Code

Has Summary

NVIDIA

Advanced

Smarter Anomaly Detection in Semiconductor Manufacturing with NVIDIA NV-Tesseract and NVIDIA NIM

The article discusses the implementation of NVIDIA NV-Tesseract and NVIDIA NIM for smarter anomaly detection in semiconductor manufacturing.

DockerFine-tuningJSONKubernetes

Aditi Gautam

7 min read

Includes Code

Has Summary

NVIDIA

Advanced

Advancing Robotics Development with Neural Dynamics in Newton

The article discusses Neural Robot Dynamics (NeRD), a neural simulation framework designed to enhance robotics development by accurately predicting the dynamics of articulated robots.

Fine-tuningGPT

Jie Xu

8 min read

Has Summary

NVIDIA

Intermediate

R²D²: Three Neural Breakthroughs Transforming Robot Learning from NVIDIA Research

The article discusses three neural innovations from NVIDIA Research that are enhancing robot learning capabilities, specifically focusing on bridging the gap between controlled simulations and real...

AssemblyFine-tuningGPTTransformerWarp

Rishabh Chadha

8 min read

Has Summary

NVIDIA

Intermediate

How to Run AI-Powered CAE Simulations

The article discusses the integration of AI-powered simulations in computer-aided engineering (CAE) to accelerate design processes.

DGLFine-tuningNumPyStencilVTKYAML

Abouzar Ghasemi

12 min read

Includes Code

Has Summary

NVIDIA

Advanced

How Small Language Models Are Key to Scalable Agentic AI

The article discusses the significance of small language models (SLMs) in the development of scalable agentic AI, emphasizing their efficiency and cost-effectiveness compared to large language mode...

Fine-tuningHugging FaceJSON

Peter Belcak

8 min read

Has Summary

NVIDIA

Intermediate

Maximize Robotics Performance by Post-Training NVIDIA Cosmos Reason

NVIDIA Cosmos Reason is an open and customizable vision language model designed for robotics and physical AI, enabling robots to reason using prior knowledge and common sense.

DockerFine-tuningHugging Face

Tsung-Yi Lin

5 min read

Includes Code

Has Summary

NVIDIA

Advanced

Train a Reasoning-Capable LLM in One Weekend with NVIDIA NeMo

This article provides a comprehensive guide on how to train a reasoning-capable language model using NVIDIA NeMo in just 48 hours on a single GPU.

Fine-tuningHugging FaceJSON

Mehran Maghoumi

17 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Run Google DeepMind’s Gemma 3n on NVIDIA Jetson and RTX

The article discusses the general availability of Google DeepMind's Gemma 3n on NVIDIA RTX and Jetson platforms, highlighting its capabilities in multi-modal on-device deployment, including audio, ...

Fine-tuningHugging FaceOllama

Anu Srivastava

4 min read

Includes Code

Has Summary

NVIDIA

Advanced

Fine-Tuning LLMOps for Rapid Model Evaluation and Ongoing Optimization

The article discusses the operational challenges of deploying large language models (LLMs) and introduces LLMOps as a framework for managing their lifecycle.

AzureFine-tuningGitJSONKubernetesLLaMAMicroservicesMLflow

Liad Levi-Raz

12 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Build Efficient AI Agents Through Model Distillation With the NVIDIA Data Flywheel Blueprint

The article discusses the NVIDIA AI Blueprint for building efficient AI agents through model distillation, focusing on the challenges of scaling intelligent applications and managing inference cost...

ElasticsearchFine-tuningYAML

Daniel Glogowski

10 min read

Includes Code

Has Summary

NVIDIA

Advanced

A Fine-tuning–Free Approach for Rapidly Recovering LLM Compression Errors with EoRA

The article discusses Eigenspace Low-Rank Approximation (EoRA), a fine-tuning-free method developed by NVIDIA for compensating compression errors in large language models (LLMs).

Fine-tuningHugging FacePython

Min-Hung Chen

8 min read

Includes Code

Has Summary

NVIDIA

Beginner

Curating Synthetic Datasets to Train Physical AI Models with NVIDIA Cosmos Reason

The article discusses NVIDIA Cosmos Reason, a world foundation model designed to enhance physical AI by curating synthetic datasets for training robots and autonomous vehicles.

DockerFine-tuningHugging Face

Tsung-Yi Lin

6 min read

Has Summary

NVIDIA

Intermediate

Run Hugging Face Models Instantly with Day-0 Support from NVIDIA NeMo Framework

The article discusses the introduction of the AutoModel feature in the NVIDIA NeMo Framework, which allows users to run Hugging Face models with Day-0 support.

Fine-tuningHugging FaceMistralPyTorchTransformer

Shashank Verma

5 min read

Includes Code

Has Summary

NVIDIA

Intermediate

NVIDIA Accelerates Inference on Meta Llama 4 Scout and Maverick

NVIDIA has introduced the Llama 4 Scout and Llama 4 Maverick models, which leverage NVIDIA's open-source software to achieve impressive performance metrics on Blackwell B200 GPUs.

Fine-tuningTransformer

Anu Srivastava

4 min read

Has Summary

NVIDIA

Advanced

Deploying the NVIDIA AI Blueprint for Cost-Efficient LLM Routing

The article discusses the NVIDIA AI Blueprint for an LLM router, which provides a cost-efficient framework for dynamically routing prompts to the most suitable large language models (LLMs).

ChatGPTDockerFine-tuningGrafanaOpenAI APIPythonRust

Arun Raman

7 min read

Has Summary

NVIDIA

Advanced

Boosting Q&A Accuracy with GraphRAG Using PyG and Graph Databases

The article discusses the implementation of GraphRAG, a graph-powered retrieval-augmented generation technique that enhances the accuracy of large language models (LLMs) in answering domain-specifi...

Fine-tuningNeo4jOpenAI APIPyTorchPyTorch Geometric

Brian Shi

8 min read

Includes Code

Has Summary

NVIDIA

Advanced

Seamlessly Scale AI Across Cloud Environments with NVIDIA DGX Cloud Serverless Inference

The article discusses NVIDIA DGX Cloud Serverless Inference, an auto-scaling AI inference solution that simplifies the deployment and scaling of AI applications across multi-cloud and on-premises e...

Fine-tuninggRPCHelmServerless

Vishal Ganeriwala

9 min read

Has Summary

NVIDIA

Advanced

Vision Language Model Prompt Engineering Guide for Image and Video Understanding

This article provides a comprehensive guide on Vision Language Models (VLMs) and their evolution from single-image understanding to advanced video comprehension.

Fine-tuningJSONLarge Language ModelsPrompt Engineering

Shubham Agrawal

11 min read

Includes Code

Has Summary

NVIDIA

Advanced

Understanding the Language of Life’s Biomolecules Across Evolution at a New Scale with Evo 2

The article discusses the advancements in AI-driven biological research with the introduction of Evo 2, a foundation model that integrates genomic, RNA, and protein data across multiple life domain...

AWSFine-tuningJSONTransformerTransformersYAML

Kyle Tretina

9 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Improving Translation Quality with Domain-Specific Fine-Tuning and NVIDIA NIM

The article discusses enhancing translation quality through domain-specific fine-tuning using LoRA adapters and NVIDIA NIM.

Fine-tuningMistralRemix

Cheng-Han (Hank) Du

7 min read

Includes Code

Has Summary

NVIDIA

Advanced

Advancing Physical AI with NVIDIA Cosmos World Foundation Model Platform

The article discusses the NVIDIA Cosmos World Foundation Model Platform, which accelerates the development of physical AI by enabling autonomous machines to perceive and interact with their environ...

Fine-tuningHugging FaceRapids

Pranjali Joshi

13 min read

Has Summary

NVIDIA

Advanced

Fine-Tuning Small Language Models to Optimize Code Review Accuracy

The article discusses the fine-tuning of small language models (SLMs) to enhance code review accuracy, addressing challenges faced by enterprises in adopting large foundational models.

DockerFine-tuningGenerative AIGPTGPT-4JSONPython

Japinder Singh

14 min read

Includes Code

Has Summary

NVIDIA

Advanced

Insights, Techniques, and Evaluation for LLM-Driven Knowledge Graphs

The article discusses how integrating large language models (LLMs) with knowledge graphs enhances the extraction of structured insights from unstructured data, addressing challenges faced by tradit...

ChatGPTFine-tuningJSONNetworkXPythonShell

Rohan Rao

15 min read

Includes Code

Has Summary

NVIDIA

Intermediate

An Easy Introduction to Multimodal Retrieval-Augmented Generation for Video and Audio

This article provides an introduction to building a multimodal retrieval-augmented generation (RAG) system for video and audio content.

CLIPEmbeddingFine-tuning

Tanay Varshney

11 min read

Has Summary

NVIDIA

Intermediate

Advancing Neuroscience Research with Visual Question Answering and Multimodal Retrieval

The article discusses how the IIT Madras Brain Centre is leveraging generative AI, specifically visual question answering (VQA) and multimodal retrieval, to enhance neuroscience research.

EmbeddingFine-tuningHelmVector Database

Pralaypati Ta

7 min read

Has Summary

NVIDIA

Advanced

Exploring the Case of Super Protocol with Self-Sovereign AI and NVIDIA Confidential Computing

The article discusses the innovative approach of Confidential and self-sovereign AI, emphasizing how Super Protocol leverages decentralized systems and NVIDIA Confidential Computing to enhance data...

Fine-tuning

Rob Nertney

14 min read

Has Summary

NVIDIA

Advanced

DataStax Announces New AI Development Platform, Built with NVIDIA AI

DataStax has launched a new AI development platform in collaboration with NVIDIA, designed to streamline the development, security, and optimization of AI applications.

Fine-tuning

Nicola Sessions

6 min read

Has Summary

NVIDIA

Advanced

Enhancing RAG Applications with NVIDIA NIM

The article discusses how NVIDIA NIM enhances Retrieval-Augmented Generation (RAG) applications, particularly in the veterinary field through the development of LAIKA, an AI copilot.

DockerFine-tuningMistral

Davide Tricarico

9 min read

Has Summary

NVIDIA

Intermediate

NVIDIA AI Workbench Simplifies Using GPUs on Windows

NVIDIA AI Workbench is a free development environment manager that simplifies the use of GPUs on Windows, macOS, and Ubuntu for data science, machine learning, and AI projects.

DockerFine-tuningGitGitLabGradioRedis

Tyler Whitehouse

7 min read

Has Summary

NVIDIA

Intermediate

Developing Robust Georgian Automatic Speech Recognition with FastConformer Hybrid Transducer CTC BPE

The article discusses the development of a robust Automatic Speech Recognition (ASR) model for the Georgian language using the FastConformer Hybrid Transducer CTC BPE architecture.

Fine-tuningWhisper

Sofia Kostandian

9 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Creating Synthetic Data Using Llama 3.1 405B

The article discusses the creation of synthetic data using the Llama 3. 1 405B model, emphasizing its applications in enhancing model accuracy across various domains.

BERTFine-tuning

Tanay Varshney

14 min read

Has Summary

NVIDIA

Advanced

NVIDIA NeMo Accelerates LLM Innovation with Hybrid State Space Model Support

The article discusses NVIDIA's NeMo framework and its new support for Hybrid State Space Models (SSMs), which enhance the training and efficiency of large language models (LLMs).

Deep LearningFine-tuningPyTorchTransformer

Ashraf Eassa

6 min read

Has Summary

NVIDIA

Advanced

Unlock Gene Networks Using Limited Data with AI Model Geneformer

Geneformer is an AI model designed to learn gene network dynamics using limited data, leveraging transfer learning from extensive single-cell transcriptome datasets.

BERTFine-tuningPython

Kyle Tretina

5 min read

Has Summary

NVIDIA

Intermediate

Understanding Diffusion Models: An Essential Guide for AEC Professionals

This article explores the transformative potential of diffusion models within the Architecture, Engineering, and Construction (AEC) industry, highlighting their ability to generate high-quality vis...

DALL-EDiffusion ModelsFine-tuningGenerative AIGPTGPT-4MidjourneyStable Diffusion

Sama Bali

12 min read

Has Summary

NVIDIA

Intermediate

Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data

The article discusses the importance of fine-tuning AI models with synthetic data to enhance multi-camera tracking accuracy. It highlights the use of NVIDIA Isaac Sim and the Omni. Replicator.

EmbeddingFine-tuningMicroservicesResNetSupervised LearningTransformer

Sameer Satish Pusegaonkar

13 min read

Includes Code

Has Summary

NVIDIA

Advanced

Deploy GPU-Optimized AI Software with One Click Using Brev.dev and NVIDIA NGC Catalog

The article discusses how Brev. dev simplifies the deployment of GPU-optimized AI software using NVIDIA's NGC catalog, enabling developers to launch AI solutions quickly and efficiently.

Fine-tuningGPTHugging FaceMistralPython

Nirmal Kumar Juluru

6 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Streamline Development of AI-Powered Apps with NVIDIA RTX AI Toolkit for Windows RTX PCs

NVIDIA has launched the NVIDIA RTX AI Toolkit, a comprehensive suite of tools and SDKs designed for Windows application developers to customize, optimize, and deploy AI models.

Fine-tuningGPTHugging FaceLangChainLlamaIndexTransformer

Jesse Clayton

8 min read

Has Summary

NVIDIA

Advanced

Customizing Neural Machine Translation Models with NVIDIA NeMo, Part 2

This article provides a detailed guide on customizing Neural Machine Translation (NMT) models using NVIDIA NeMo, focusing on curating a custom dataset and fine-tuning the model.

Fine-tuningJSONMicroservicesPythonTensorBoard

Zhiyong Ban

10 min read

Includes Code

Has Summary

NVIDIA

Advanced

Enhance Text-to-Image Fine-Tuning with DRaFT+, Now Part of NVIDIA NeMo

The article introduces DRaFT+, an enhanced algorithm for fine-tuning text-to-image diffusion models, which aims to improve the alignment between input prompts and generated images.

CLIPDiffusion ModelsFine-tuningReinforcement LearningRLHFStable Diffusion

Ali Taghibakhshi

9 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Democratizing AI Workflows with Union.ai and NVIDIA DGX Cloud

The article discusses how Union. ai and NVIDIA DGX Cloud are transforming AI workflows by providing accessible, high-performance computing resources.

AzureAzure Blob StorageFine-tuningGoogle CloudKubernetesLarge Language Models

Niels Bantilan

6 min read