NVIDIA logo

How NVIDIA Uses Fine-tuning

87 engineering articles about Fine-tuning from NVIDIA's engineering team

Articles

Filter:
NVIDIA logo
NVIDIA
Advanced
The article discusses how Painkiller RTX utilizes generative AI to enhance game assets by transforming legacy textures into high-quality Physically Based Rendering (PBR) materials.
Phillip Singh
14 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
Kimi K2. 5 is an advanced multimodal vision language model (VLM) developed by Kimi, optimized for various AI tasks.
Anu Srivastava
4 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the optimization of semiconductor defect classification using generative AI and vision foundation models (VFMs).
Tim Lin
11 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the use of AI Model Distillation to create efficient financial data workflows, focusing on the optimization of large language models (LLMs) for applications in quantitative fi...
NVIDIA logo
NVIDIA
Advanced
The article discusses how NVIDIA's CorrDiff model leverages generative AI for downscaling weather predictions, significantly improving efficiency and reducing computational costs.
Alicia Sui
11 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article introduces CodonFM, a new state-of-the-art RNA foundation model developed by NVIDIA as part of the Clara open model family.
Kyle Gion
10 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses how the NVIDIA DGX Spark supercomputer enhances performance for intensive AI tasks, providing a local alternative to cloud computing.
Allen Bourgoyne
5 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses how to fine-tune and scale large language models (LLMs) using the open-source Unsloth framework on NVIDIA Blackwell GPUs.
Paul Abruzzo
5 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the development of an AI-powered log analysis solution using NVIDIA's Generative AI reference workflows.
Prashant Bhende
5 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the implementation of NVIDIA NV-Tesseract and NVIDIA NIM for smarter anomaly detection in semiconductor manufacturing.
Aditi Gautam
7 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses Neural Robot Dynamics (NeRD), a neural simulation framework designed to enhance robotics development by accurately predicting the dynamics of articulated robots.
Jie Xu
8 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses three neural innovations from NVIDIA Research that are enhancing robot learning capabilities, specifically focusing on bridging the gap between controlled simulations and real...
Rishabh Chadha
8 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the integration of AI-powered simulations in computer-aided engineering (CAE) to accelerate design processes.
Abouzar Ghasemi
12 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the significance of small language models (SLMs) in the development of scalable agentic AI, emphasizing their efficiency and cost-effectiveness compared to large language mode...
Peter Belcak
8 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
NVIDIA Cosmos Reason is an open and customizable vision language model designed for robotics and physical AI, enabling robots to reason using prior knowledge and common sense.
Tsung-Yi Lin
5 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
This article provides a comprehensive guide on how to train a reasoning-capable language model using NVIDIA NeMo in just 48 hours on a single GPU.
Mehran Maghoumi
17 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the general availability of Google DeepMind's Gemma 3n on NVIDIA RTX and Jetson platforms, highlighting its capabilities in multi-modal on-device deployment, including audio, ...
Anu Srivastava
4 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the operational challenges of deploying large language models (LLMs) and introduces LLMOps as a framework for managing their lifecycle.
Liad Levi-Raz
12 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the NVIDIA AI Blueprint for building efficient AI agents through model distillation, focusing on the challenges of scaling intelligent applications and managing inference cost...
Daniel Glogowski
10 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses Eigenspace Low-Rank Approximation (EoRA), a fine-tuning-free method developed by NVIDIA for compensating compression errors in large language models (LLMs).
Min-Hung Chen
8 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Beginner
The article discusses NVIDIA Cosmos Reason, a world foundation model designed to enhance physical AI by curating synthetic datasets for training robots and autonomous vehicles.
Tsung-Yi Lin
6 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the introduction of the AutoModel feature in the NVIDIA NeMo Framework, which allows users to run Hugging Face models with Day-0 support.
Shashank Verma
5 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
NVIDIA has introduced the Llama 4 Scout and Llama 4 Maverick models, which leverage NVIDIA's open-source software to achieve impressive performance metrics on Blackwell B200 GPUs.
Anu Srivastava
4 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the NVIDIA AI Blueprint for an LLM router, which provides a cost-efficient framework for dynamically routing prompts to the most suitable large language models (LLMs).
NVIDIA logo
NVIDIA
Advanced
The article discusses the implementation of GraphRAG, a graph-powered retrieval-augmented generation technique that enhances the accuracy of large language models (LLMs) in answering domain-specifi...
Brian Shi
8 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses NVIDIA DGX Cloud Serverless Inference, an auto-scaling AI inference solution that simplifies the deployment and scaling of AI applications across multi-cloud and on-premises e...
Vishal Ganeriwala
9 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
This article provides a comprehensive guide on Vision Language Models (VLMs) and their evolution from single-image understanding to advanced video comprehension.
Shubham Agrawal
11 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the advancements in AI-driven biological research with the introduction of Evo 2, a foundation model that integrates genomic, RNA, and protein data across multiple life domain...
Kyle Tretina
9 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses enhancing translation quality through domain-specific fine-tuning using LoRA adapters and NVIDIA NIM.
Cheng-Han (Hank) Du
7 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the NVIDIA Cosmos World Foundation Model Platform, which accelerates the development of physical AI by enabling autonomous machines to perceive and interact with their environ...
Pranjali Joshi
13 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the fine-tuning of small language models (SLMs) to enhance code review accuracy, addressing challenges faced by enterprises in adopting large foundational models.
Japinder Singh
14 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses how integrating large language models (LLMs) with knowledge graphs enhances the extraction of structured insights from unstructured data, addressing challenges faced by tradit...
Rohan Rao
15 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
This article provides an introduction to building a multimodal retrieval-augmented generation (RAG) system for video and audio content.
Tanay Varshney
11 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses how the IIT Madras Brain Centre is leveraging generative AI, specifically visual question answering (VQA) and multimodal retrieval, to enhance neuroscience research.
Pralaypati Ta
7 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the innovative approach of Confidential and self-sovereign AI, emphasizing how Super Protocol leverages decentralized systems and NVIDIA Confidential Computing to enhance data...
Rob Nertney
14 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
DataStax has launched a new AI development platform in collaboration with NVIDIA, designed to streamline the development, security, and optimization of AI applications.
Nicola Sessions
6 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses how NVIDIA NIM enhances Retrieval-Augmented Generation (RAG) applications, particularly in the veterinary field through the development of LAIKA, an AI copilot.
Davide Tricarico
9 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
NVIDIA AI Workbench is a free development environment manager that simplifies the use of GPUs on Windows, macOS, and Ubuntu for data science, machine learning, and AI projects.
Tyler Whitehouse
7 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the development of a robust Automatic Speech Recognition (ASR) model for the Georgian language using the FastConformer Hybrid Transducer CTC BPE architecture.
Sofia Kostandian
9 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the creation of synthetic data using the Llama 3. 1 405B model, emphasizing its applications in enhancing model accuracy across various domains.
Tanay Varshney
14 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses NVIDIA's NeMo framework and its new support for Hybrid State Space Models (SSMs), which enhance the training and efficiency of large language models (LLMs).
Ashraf Eassa
6 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
Geneformer is an AI model designed to learn gene network dynamics using limited data, leveraging transfer learning from extensive single-cell transcriptome datasets.
Kyle Tretina
5 min read
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
This article explores the transformative potential of diffusion models within the Architecture, Engineering, and Construction (AEC) industry, highlighting their ability to generate high-quality vis...
NVIDIA logo
NVIDIA
Intermediate
The article discusses the importance of fine-tuning AI models with synthetic data to enhance multi-camera tracking accuracy. It highlights the use of NVIDIA Isaac Sim and the Omni. Replicator.
Sameer Satish Pusegaonkar
13 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses how Brev. dev simplifies the deployment of GPU-optimized AI software using NVIDIA's NGC catalog, enabling developers to launch AI solutions quickly and efficiently.
Nirmal Kumar Juluru
6 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
NVIDIA has launched the NVIDIA RTX AI Toolkit, a comprehensive suite of tools and SDKs designed for Windows application developers to customize, optimize, and deploy AI models.
NVIDIA logo
NVIDIA
Advanced
This article provides a detailed guide on customizing Neural Machine Translation (NMT) models using NVIDIA NeMo, focusing on curating a custom dataset and fine-tuning the model.
Zhiyong Ban
10 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article introduces DRaFT+, an enhanced algorithm for fine-tuning text-to-image diffusion models, which aims to improve the alignment between input prompts and generated images.
Ali Taghibakhshi
9 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses how Union. ai and NVIDIA DGX Cloud are transforming AI workflows by providing accessible, high-performance computing resources.
NVIDIA logo
NVIDIA
Advanced
The article discusses the NVIDIA NeMo microservices, which simplify the development of custom generative AI models for enterprises.
Nirmal Kumar Juluru
5 min read
Has Summary
--