How NVIDIA Uses Docker

292 engineering articles about Docker from NVIDIA's engineering team

Other NVIDIA Technologies

Python(740)PyTorch(566)Deep Learning(505)TensorFlow(444)Kubernetes(251)AWS(202)

Other Companies Using Docker

Articles

Filter:

NVIDIA

Intermediate

Build AI-Ready Knowledge Systems Using 5 Essential Multimodal RAG Capabilities

The article discusses the importance of building AI-ready knowledge systems using Retrieval-Augmented Generation (RAG) capabilities.

Docker

Shruthii Sathyanarayanan

9 min read

Includes Code

Has Summary

NVIDIA

Advanced

How to Build a Document Processing Pipeline for RAG with Nemotron

The article provides a comprehensive guide on building a document processing pipeline using NVIDIA Nemotron RAG, focusing on the extraction of structured data from complex documents like PDFs.

DockerEmbeddingHugging FaceJSONPythonRedistorchvision

Chia-Chih Chen

9 min read

Includes Code

Has Summary

NVIDIA

Advanced

Accelerating Long-Context Model Training in JAX and XLA

The article discusses the integration of the NVSHMEM communication library into the Accelerated Linear Algebra (XLA) compiler to optimize long-context model training in JAX.

DockerJAXPython

Sevin Fide Varoglu

9 min read

Includes Code

Has Summary

NVIDIA

Advanced

Build an AI Catalog System That Delivers Localized, Interactive Product Experiences

This article provides a comprehensive tutorial on building an AI-powered catalog enrichment system that enhances e-commerce product listings using NVIDIA's advanced models.

DockerFastAPIGenerative AIJSONPython

Antonio Martinez

10 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Multi-Agent Warehouse AI Command Layer Enables Operational Excellence and Supply Chain Intelligence

The article discusses the NVIDIA Multi-Agent Intelligent Warehouse (MAIW), an AI command layer designed to enhance operational efficiency and supply chain intelligence in automated warehouses.

DockerEmbeddingFastAPIGrafanaHelmJSONJWTOptunaPostgreSQLPrometheusReactRedisSQLTimescaleDB

Tarik Hammadou

10 min read

Includes Code

Has Summary

NVIDIA

Advanced

Simplify Generalist Robot Policy Evaluation in Simulation with NVIDIA Isaac Lab-Arena

The article introduces NVIDIA Isaac Lab-Arena, an open-source framework designed for efficient and scalable evaluation of generalist robot policies in simulation.

DockerHugging Face

Sangeeta Subramanian

9 min read

Includes Code

Has Summary

NVIDIA

Advanced

Enabling Horizontal Autoscaling of Enterprise RAG Components on Kubernetes

This article discusses the implementation of horizontal autoscaling for Retrieval-Augmented Generation (RAG) components on Kubernetes, focusing on NVIDIA's microservices architecture.

DockerGrafanaHelmKubernetesMicroservicesPrometheus

Juana Nakfour

23 min read

Includes Code

Has Summary

NVIDIA

Intermediate

How to Enhance 3D Gaussian Reconstruction Quality for Simulation

The article discusses enhancing the quality of 3D Gaussian reconstruction for simulation, focusing on the use of NVIDIA's Fixer model to eliminate rendering artifacts.

Diffusion ModelsDockerHugging Face

Wonsik Han

7 min read

Includes Code

Has Summary

NVIDIA

Advanced

NVIDIA-Accelerated Mistral 3 Open Models Deliver Efficiency, Accuracy at Any Scale

The NVIDIA-accelerated Mistral 3 open model family offers developers and enterprises industry-leading accuracy, efficiency, and customization capabilities.

DockerHugging FaceMistralOllama

Anu Srivastava

6 min read

Has Summary

NVIDIA

Advanced

Build Efficient Financial Data Workflows with AI Model Distillation

The article discusses the use of AI Model Distillation to create efficient financial data workflows, focusing on the optimization of large language models (LLMs) for applications in quantitative fi...

Deep LearningDockerElasticsearchFine-tuningJSONKubernetesMicroservicesYAML

Dhruv Desai

10 min read

Includes Code

Has Summary

NVIDIA

Advanced

Build and Run Secure, Data-Driven AI Agents

The article discusses the deployment of secure, data-driven AI agents using NVIDIA's AI-Q Research Assistant and Enterprise RAG Blueprints on AWS.

AWSDockerGitGrafanaHelmKubernetesPrometheusServerlessTerraform

Abdullahi Olaoye

8 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Building Scalable AI on Enterprise Data with NVIDIA Nemotron RAG and Microsoft SQL Server 2025

The article discusses the integration of NVIDIA Nemotron RAG with Microsoft SQL Server 2025, showcasing how this collaboration enables the development of scalable AI applications on enterprise data.

AzureDockerEmbeddingHTTPSSQLSQL Server

Uttara Kumar

10 min read

Includes Code

Has Summary

NVIDIA

Advanced

How to Predict Biomolecular Structures Using the OpenFold3 NIM

The article discusses the advancements in biomolecular structure prediction using OpenFold3, a deep learning model integrated into the NVIDIA ecosystem.

DockerPython

Kyle Tretina

5 min read

Includes Code

Has Summary

NVIDIA

Intermediate

How Code Execution Drives Key Risks in Agentic AI Systems

The article discusses the security risks associated with AI-driven applications that generate and execute code autonomously.

AWSAWS EC2DockerPython

John Irwin

8 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Train an LLM on NVIDIA Blackwell with Unsloth—and Scale for Production

The article discusses how to fine-tune and scale large language models (LLMs) using the open-source Unsloth framework on NVIDIA Blackwell GPUs.

DockerFine-tuningHugging FacePython

Paul Abruzzo

5 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Training Federated AI Models to Predict Protein Properties

The article discusses the collaborative training of AI models to predict protein properties, specifically subcellular localization, using NVIDIA FLARE and the BioNeMo Framework.

DockerTensorBoard

Holger Roth

4 min read

Includes Code

Has Summary

NVIDIA

Advanced

Smarter Anomaly Detection in Semiconductor Manufacturing with NVIDIA NV-Tesseract and NVIDIA NIM

The article discusses the implementation of NVIDIA NV-Tesseract and NVIDIA NIM for smarter anomaly detection in semiconductor manufacturing.

DockerFine-tuningJSONKubernetes

Aditi Gautam

7 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Build a Real-Time Visual Inspection Pipeline with NVIDIA TAO 6 and NVIDIA DeepStream 8

This article discusses building a real-time visual inspection pipeline using NVIDIA TAO 6 and NVIDIA DeepStream 8, addressing challenges in defect detection and quality control.

DockerPythonResNetYAML

Varun Praveen

11 min read

Includes Code

Has Summary

NVIDIA

Advanced

Build a Retrieval-Augmented Generation (RAG) Agent with NVIDIA Nemotron

The article provides a comprehensive guide on building a Retrieval-Augmented Generation (RAG) agent using NVIDIA Nemotron, emphasizing the integration of external information to enhance text genera...

DockerEmbeddingHugging FaceLangChainPythonStreamlitVector Database

Edward Li

16 min read

Includes Code

Has Summary

NVIDIA

Advanced

Accelerate Autonomous Vehicle Development with the NVIDIA DRIVE AGX Thor Developer Kit

The article discusses the launch of the NVIDIA DRIVE AGX Thor Developer Kit, a powerful platform designed to accelerate the development of autonomous vehicles.

Deep LearningDocker

Abhinaw Priyadershi

8 min read

Has Summary

NVIDIA

Advanced

Deploying Your Omniverse Kit Apps at Scale

The article discusses NVIDIA Omniverse Kit App Streaming, a solution for deploying and streaming 3D applications built with NVIDIA's SDKs directly to browsers.

AWSAzureDockerHelmKubernetesLoad BalancerWebRTC

Ashley Goldstein

11 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Streamline CUDA-Accelerated Python Install and Packaging Workflows with Wheel Variants

The article discusses the introduction of Wheel Variants, a new Python packaging standard aimed at improving the installation and packaging workflows for CUDA-accelerated Python packages.

DockerJAXPythonPyTorchSciPy

Jonathan Dekhtiar

15 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Maximize Robotics Performance by Post-Training NVIDIA Cosmos Reason

NVIDIA Cosmos Reason is an open and customizable vision language model designed for robotics and physical AI, enabling robots to reason using prior knowledge and common sense.

DockerFine-tuningHugging Face

Tsung-Yi Lin

5 min read

Includes Code

Has Summary

NVIDIA

Advanced

Efficient Transforms in cuDF Using JIT Compilation

The article discusses how JIT compilation enhances the performance of transforms in cuDF, a GPU-accelerated library for data processing.

Docker

Basit Ayantunde

8 min read

Includes Code

Has Summary

NVIDIA

Intermediate

NVIDIA Accelerates OpenAI gpt-oss Models Delivering 1.5 M TPS Inference on NVIDIA GB200 NVL72

NVIDIA has optimized OpenAI's gpt-oss models for accelerated inference performance on the NVIDIA GB200 NVL72 system, achieving up to 1. 5 million tokens per second (TPS).

DockerHugging FaceOllamaPythonTransformerTransformers

Anu Srivastava

6 min read

Includes Code

Has Summary

NVIDIA

Advanced

Serverless Distributed Data Processing with Apache Spark and NVIDIA AI on Azure

The article discusses the deployment of a serverless, distributed data processing architecture using Apache Spark and NVIDIA AI on Azure.

ApacheApache SparkAzureDockerEmbeddingHTTPSHugging FacePythonREST APIServerlessSQLSQL Server

Alexander Spiridonov

9 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Enhancing Multilingual Human-Like Speech and Voice Cloning with NVIDIA Riva TTS

The article discusses the advancements in multilingual human-like speech synthesis and voice cloning using NVIDIA Riva TTS.

DockerTransformer

Maggie Zhang

9 min read

Has Summary

NVIDIA

Intermediate

Improving Synthetic Data Augmentation and Human Action Recognition with SynthDa

The article discusses SynthDa, a modular synthetic data augmentation pipeline aimed at improving human action recognition in AI systems.

DockerPython

Meg Rajendran

9 min read

Includes Code

Has Summary

NVIDIA

Advanced

How to Streamline Complex LLM Workflows Using NVIDIA NeMo-Skills

The article discusses how to streamline complex workflows for large language models (LLMs) using NVIDIA NeMo-Skills.

DockerHugging FacePython

Igor Gitman

10 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Run Multimodal Extraction for More Efficient AI Pipelines Using One GPU

This article discusses the challenges of extracting insights from multimodal documents and presents a solution using the NVIDIA NeMo Retriever extraction pipeline.

AWSDockerGrafanaPrometheusPython

Lior Cohen

8 min read

Includes Code

Has Summary

NVIDIA

Advanced

Chat With Your Enterprise Data Through Open-Source AI-Q NVIDIA Blueprint

The article discusses the AI-Q NVIDIA Blueprint, an open-source framework designed to help enterprises leverage their data through AI-powered agents.

DockerLangChainLlamaIndexPythonSemantic Kernel

Nicola Sessions

8 min read

Has Summary

NVIDIA

Advanced

Simplify LLM Deployment and AI Inference with a Unified NVIDIA NIM Workflow

The article discusses how NVIDIA NIM simplifies the deployment of large language models (LLMs) by providing a unified workflow that abstracts the complexities of model loading, backend selection, a...

DockerHugging FaceMistral

Mehran Maghoumi

10 min read

Includes Code

Has Summary

NVIDIA

Advanced

Accelerate Decision Optimization Using Open Source NVIDIA cuOpt

The article discusses how NVIDIA cuOpt, an open-source GPU-accelerated optimization tool, enhances decision-making processes in businesses by efficiently solving complex linear programming (LP), mi...

ApacheDockerJSONPythonREST API

Gordana Neskovic

5 min read

Includes Code

Has Summary

NVIDIA

Advanced

Reproducing NVIDIA MLPerf v5.0 Training Scores for LLM Benchmarks

This article provides a comprehensive guide on reproducing NVIDIA's MLPerf v5. 0 training scores for LLM benchmarks, specifically focusing on Llama 2 70B LoRA fine-tuning and Llama 3.

DockerHugging FacePython

Michał Marcinkiewicz

11 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Streamline Trade Capture and Evaluation with Self-Correcting AI Workflows

The article discusses the integration of AI workflows in automating trade capture and evaluation processes, emphasizing the challenges of achieving high reliability with free-form text inputs.

DockerJSONLangChain

Alexander Sokol

10 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Supercharging Fraud Detection in Financial Services with Graph Neural Networks (Updated)

The article discusses the application of Graph Neural Networks (GNNs) in enhancing fraud detection within financial services.

ApacheApache SparkDockerGraph Neural NetworksJSONKubernetesNeural NetworksXGBoost

Naim

10 min read

Includes Code

Has Summary

NVIDIA

Advanced

Profiling LLM Training Workflows on NVIDIA Grace Hopper

The article discusses the exponential growth of large language models (LLMs) and the importance of profiling LLM training workflows on the NVIDIA Grace Hopper architecture.

DockerGPTPythonPyTorch

Karin Sevegnani

11 min read

Includes Code

Has Summary

NVIDIA

Advanced

NVIDIA Dynamo Adds GPU Autoscaling, Kubernetes Automation, and Networking Optimizations

NVIDIA Dynamo's v0.

AWSDockerKubernetesYAML

Amr Elmeleegy

7 min read

Has Summary

NVIDIA

Intermediate

Advance Video Analytics AI Agents Using the NVIDIA AI Blueprint for Video Search and Summarization

The article discusses the advancements in video analytics through the NVIDIA AI Blueprint for Video Search and Summarization (VSS), highlighting the integration of Vision Language Models (VLMs), La...

AWSAzureDockerHelmKubernetes

Adam Ryason

13 min read

Includes Code

Has Summary

NVIDIA

Beginner

Curating Synthetic Datasets to Train Physical AI Models with NVIDIA Cosmos Reason

The article discusses NVIDIA Cosmos Reason, a world foundation model designed to enhance physical AI by curating synthetic datasets for training robots and autonomous vehicles.

DockerFine-tuningHugging Face

Tsung-Yi Lin

6 min read

Has Summary

NVIDIA

Advanced

Simplify Setup and Boost Data Science in the Cloud using NVIDIA CUDA-X and Coiled

The article discusses how to leverage NVIDIA CUDA-X and Coiled to simplify data science workflows in the cloud, particularly for analyzing large datasets like NYC ride-share journeys.

AWSAzureDeep LearningDockerLessPandasPython

Jaya Venkatesh

10 min read

Includes Code

Has Summary

NVIDIA

Advanced

Accelerate Deep Learning and LLM Inference with Apache Spark in the Cloud

The article discusses how to accelerate Deep Learning (DL) and Large Language Model (LLM) inference using Apache Spark in cloud environments.

ApacheApache SparkAWSAzureDeep LearningDockerJSONNumPyPythonPyTorchSemantic SearchTensorFlowTransformers

Rishi Chandra

9 min read

Includes Code

Has Summary

NVIDIA

Advanced

LLM Inference Benchmarking Guide: NVIDIA GenAI-Perf and NIM

This article serves as a comprehensive guide for benchmarking Large Language Models (LLMs) using NVIDIA's GenAI-Perf tool alongside NVIDIA NIM.

DockerGenerative AIHugging FaceLarge Language ModelsOpenAI API

Vinh Nguyen

11 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Choosing Your First Local AI Project

The article discusses the transition of AI from centralized cloud systems to local development on professional workstations, emphasizing the advantages of enhanced data privacy, cost savings, and o...

DockerGit

Sama Bali

6 min read

Has Summary

NVIDIA

Advanced

Deploying the NVIDIA AI Blueprint for Cost-Efficient LLM Routing

The article discusses the NVIDIA AI Blueprint for an LLM router, which provides a cost-efficient framework for dynamically routing prompts to the most suitable large language models (LLMs).

ChatGPTDockerFine-tuningGrafanaOpenAI APIPythonRust

Arun Raman

7 min read

Has Summary

NVIDIA

Intermediate

Maximize AI Agent Performance with Data Flywheels Using NVIDIA NeMo Microservices

The article discusses how to enhance AI agent performance using NVIDIA NeMo microservices and a data flywheel strategy.

DockerHelmKubernetesMicroservices

Sylendran Arunagiri

10 min read

Has Summary

NVIDIA

Intermediate

Build an AI Agent with Expert Reasoning Capabilities Using the DeepSeek-R1 NIM

The article discusses how to build AI agents with expert reasoning capabilities using the DeepSeek-R1 NIM microservice.

DockerElevenLabsJSONKubernetesTransformer

Mehran Maghoumi

8 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Deploying NVIDIA Riva Multilingual ASR with Whisper and Canary Architectures While Selectively Deactivating

The article discusses the deployment of NVIDIA Riva's multilingual Automatic Speech Recognition (ASR) capabilities using Whisper and Canary architectures. It highlights the new features in Riva 2.

DockerPythonWhisper

Sven Chilton

12 min read

Includes Code

Has Summary

NVIDIA

Intermediate

JSON Lines Reading with pandas 100x Faster Using NVIDIA cuDF

The article discusses how to read JSON Lines data using NVIDIA's cuDF library, achieving performance improvements of up to 100 times faster than traditional pandas methods.

ApacheApache ArrowApache SparkDockerJSONPython

Karthikeyan Natarajan

10 min read

Includes Code

Has Summary

NVIDIA

Advanced

LLM Model Pruning and Knowledge Distillation with NVIDIA NeMo Framework

The article discusses model pruning and knowledge distillation as effective strategies for creating smaller, more efficient language models using the NVIDIA NeMo framework.

DockerHugging FaceMistral

Gomathy Venkata Krishnan

9 min read

Includes Code

Has Summary