How NVIDIA Uses GPT

134 engineering articles about GPT from NVIDIA's engineering team

Other NVIDIA Technologies

Python(740)PyTorch(566)Deep Learning(505)TensorFlow(444)Docker(292)Kubernetes(251)

Other Companies Using GPT

Articles

Filter:

NVIDIA

Advanced

Open Source AI Tool Upgrades Speed Up LLM and Diffusion Models on NVIDIA RTX PCs

The article discusses how recent upgrades to open source AI tools enhance the performance of small language models (SLMs) and diffusion models on NVIDIA RTX PCs.

Diffusion ModelsGPTOllamaPyTorch

Annamalai Chockalingam

7 min read

Has Summary

NVIDIA

Intermediate

New Software and Model Optimizations Supercharge NVIDIA DGX Spark

The article discusses the latest software and model optimizations for NVIDIA DGX Spark, highlighting significant performance improvements in AI workflows.

GPTHugging FacePyTorch

Allen Bourgoyne

5 min read

Has Summary

NVIDIA

Intermediate

Train Small Orchestration Agents to Solve Big Problems

The article discusses the development of small orchestration agents, specifically the ToolOrchestra method, which automates the selection and management of models and tools for task-solving in AI s...

ClaudeGPT

Shizhe Diao

7 min read

Includes Code

Has Summary

NVIDIA

Beginner

Benchmarking LLMs on AI-Generated CUDA Code with ComputeEval 2025.2

The article discusses the benchmarking of AI coding assistants in writing efficient CUDA code using the ComputeEval framework.

ClaudeGPTHugging Face

Daniel Rodriguez

2 min read

Has Summary

NVIDIA

Advanced

Democratizing Large-Scale Mixture-of-Experts Training with NVIDIA PyTorch Paralism

The article discusses how NVIDIA's NeMo Automodel simplifies the training of large-scale mixture-of-experts (MoE) models in PyTorch, making it accessible to a broader audience.

GPTHugging FacePyTorchTransformer

Hemil Desai

7 min read

Includes Code

Has Summary

NVIDIA

Advanced

Advancing Explainable AI in Radiology Research with NVIDIA Clara Reason

The article discusses the advancements in Explainable AI for radiology through NVIDIA Clara Reason, focusing on the NV-Reason-CXR-3B model that enhances diagnostic transparency and mimics radiologi...

GPTHugging FacePIL

Andriy Myronenko

11 min read

Includes Code

Has Summary

NVIDIA

Intermediate

How NVIDIA DGX Spark’s Performance Enables Intensive AI Tasks

The article discusses how the NVIDIA DGX Spark supercomputer enhances performance for intensive AI tasks, providing a local alternative to cloud computing.

Fine-tuningGPTHugging FacePyTorchscikit-learn

Allen Bourgoyne

5 min read

Has Summary

NVIDIA

Advanced

Advancing Robotics Development with Neural Dynamics in Newton

The article discusses Neural Robot Dynamics (NeRD), a neural simulation framework designed to enhance robotics development by accurately predicting the dynamics of articulated robots.

Fine-tuningGPT

Jie Xu

8 min read

Has Summary

NVIDIA

Intermediate

R²D²: Three Neural Breakthroughs Transforming Robot Learning from NVIDIA Research

The article discusses three neural innovations from NVIDIA Research that are enhancing robot learning capabilities, specifically focusing on bridging the gap between controlled simulations and real...

AssemblyFine-tuningGPTTransformerWarp

Rishabh Chadha

8 min read

Has Summary

NVIDIA

Advanced

How to Reduce KV Cache Bottlenecks with NVIDIA Dynamo

The article discusses how NVIDIA Dynamo can help reduce Key-Value (KV) Cache bottlenecks in large language model (LLM) inference by offloading cache data to more cost-effective storage solutions.

GPTGrafanaPrometheusRedis

Amr Elmeleegy

11 min read

Includes Code

Has Summary

NVIDIA

Intermediate

NVIDIA Blackwell Ultra Sets New Inference Records in MLPerf Debut

The article discusses NVIDIA's Blackwell Ultra architecture, which sets new inference records in the MLPerf Inference v5. 1 benchmark.

GPTStable DiffusionWhisper

Zhihan Jiang

10 min read

Has Summary

NVIDIA

Advanced

Fine-Tuning gpt-oss for Accuracy and Performance with Quantization Aware Training

The article discusses fine-tuning the gpt-oss model for improved accuracy and performance through Quantization Aware Training (QAT) and Supervised Fine-Tuning (SFT).

GPTHugging FacePyTorchTransformerTransformers

Eduardo Alvarez

7 min read

Includes Code

Has Summary

NVIDIA

Advanced

NVIDIA Hardware Innovations and Open Source Contributions Are Shaping AI

The article discusses how NVIDIA's hardware innovations, particularly the Blackwell architecture and NVFP4 precision, along with their open source contributions, are driving advancements in AI.

GPTHugging FaceJAXKubernetesPythonPyTorchTransformer

George Chellapa

8 min read

Has Summary

NVIDIA

Advanced

Streamlining Quantum Error Correction and Application Development with CUDA-QX 0.4

The article discusses the advancements in quantum error correction (QEC) and application development with the release of CUDA-QX 0. 4.

GPTPython

Shane Caldwell

7 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Introducing NVFP4 for Efficient and Accurate Low-Precision Inference

The article introduces NVFP4, a new 4-bit floating point format designed for efficient and accurate low-precision inference on NVIDIA's Blackwell architecture.

GPTHugging Face

Eduardo Alvarez

10 min read

Has Summary

NVIDIA

Advanced

Getting Started with Project G-Assist: Build a Twitch-Integrated Plug-in

Project G-Assist is an experimental AI assistant designed to help users control their RTX GPU and other PC settings using a natural language interface.

GPTJSONOAuthPython

Sydney Altobell

7 min read

Includes Code

Has Summary

NVIDIA

Advanced

Transforming Quantum Education with AI Supercomputing and NVIDIA CUDA-Q Academic

The article discusses the integration of AI supercomputing with quantum computing education through the NVIDIA CUDA-Q platform.

GPT

Monica VanDieren

7 min read

Has Summary

NVIDIA

Advanced

Profiling LLM Training Workflows on NVIDIA Grace Hopper

The article discusses the exponential growth of large language models (LLMs) and the importance of profiling LLM training workflows on the NVIDIA Grace Hopper architecture.

DockerGPTPythonPyTorch

Karin Sevegnani

11 min read

Includes Code

Has Summary

NVIDIA

Advanced

R²D²: Unlocking Robotic Assembly and Contact Rich Manipulation with NVIDIA Research

This article discusses NVIDIA's advancements in robotic assembly and contact-rich manipulation, highlighting innovative workflows and technologies that enhance flexibility, adaptability, and scalab...

AssemblyGPT

Oyindamola Omotuyi

8 min read

Has Summary

NVIDIA

Advanced

Applying Specialized LLMs with Reasoning Capabilities to Accelerate Battery Research

The article discusses the transformative role of domain-adapted large language models (LLMs) with reasoning capabilities in accelerating battery research.

ClaudeGeminiGPTKubernetesLLaMAscikit-learn

Rucha Apte

11 min read

Has Summary

NVIDIA

Intermediate

Developing an AI-Powered Tool for Automatic Citation Validation Using NVIDIA NIM

The article discusses the development of an AI-powered tool for automatic citation validation using NVIDIA NIM, aimed at improving the accuracy of citations in academic and AI-generated content.

EmbeddingGenerative AIGPTLangChainStreamlit

Sebastian Haan

8 min read

Has Summary

NVIDIA

Intermediate

NVIDIA Blackwell Delivers Massive Performance Leaps in MLPerf Inference v5.0

The article discusses the advancements of NVIDIA's Blackwell architecture, highlighting its significant performance improvements in MLPerf Inference v5.

GPTKongResNetStable DiffusionTransformerU-Net

Ashraf Eassa

9 min read

Has Summary

NVIDIA

Intermediate

NVIDIA Deep Learning Institute Releases New Generative AI Teaching Kit

NVIDIA has released a new Generative AI Teaching Kit aimed at enhancing education in generative AI technologies.

Deep LearningDiffusion ModelsGenerative AIGPTLarge Language ModelsTransformer

Joe Bungo

7 min read

Has Summary

NVIDIA

Intermediate

Bring NVIDIA ACE AI Characters to Games with the New In-Game Inferencing SDK

The article discusses the integration of NVIDIA ACE AI characters into games using the new In-Game Inferencing SDK (NVIGI).

EmbeddingGPTMistralWhisper

Allyson Vasquez

11 min read

Includes Code

Has Summary

NVIDIA

Advanced

Optimize AI Inference Performance with NVIDIA Full-Stack Solutions

The article discusses how NVIDIA's full-stack solutions, including the newly renamed NVIDIA Dynamo Triton, optimize AI inference performance.

Generative AIGPTStable DiffusionTransformer

Nick Comly

9 min read

Has Summary

NVIDIA

Intermediate

GPU Memory Essentials for AI Performance

The article discusses the importance of GPU memory in enhancing AI performance, particularly for local AI model execution.

Generative AIGPT

Sama Bali

6 min read

Has Summary

NVIDIA

Advanced

Evaluating GenMol as a Generalist Foundation Model for Molecular Generation

The article evaluates GenMol, a generalist foundation model for molecular generation, comparing it with SAFE-GPT.

BERTEmbeddingGPTOracle

Kyle Tretina

7 min read

Includes Code

Has Summary

NVIDIA

Intermediate

A Guide to Retrieval-Augmented Generation for AEC

This article provides an in-depth exploration of Retrieval-Augmented Generation (RAG) and its transformative potential for the Architecture, Engineering, and Construction (AEC) industry.

EmbeddingGenerative AIGPTHelm

Sama Bali

12 min read

Has Summary

NVIDIA

Intermediate

Deploy Agents, Assistants, and Avatars on NVIDIA RTX AI PCs with New Small Language Models

NVIDIA has introduced a series of small language models (SLMs) designed to enhance the capabilities of digital humans, allowing them to provide more relevant responses and understand visual inputs.

GPTMistral

Ike Nnoli

4 min read

Has Summary

NVIDIA

Advanced

Fine-Tuning Small Language Models to Optimize Code Review Accuracy

The article discusses the fine-tuning of small language models (SLMs) to enhance code review accuracy, addressing challenges faced by enterprises in adopting large foundational models.

DockerFine-tuningGenerative AIGPTGPT-4JSONPython

Japinder Singh

14 min read

Includes Code

Has Summary

NVIDIA

Advanced

NVIDIA Partners Accelerate Quantum Breakthroughs with AI Supercomputing

NVIDIA is advancing quantum computing through partnerships that integrate AI supercomputing with quantum hardware, aiming to overcome current technological challenges.

Artificial IntelligenceGenerative AIGPTSolidTransformers

Marwa Farag

7 min read

Has Summary

NVIDIA

Intermediate

NVIDIA Blackwell Doubles LLM Training Performance in MLPerf Training v4.1

The article discusses the significant performance improvements of the NVIDIA Blackwell platform in LLM training, showcasing its capabilities in the latest MLPerf Training v4. 1 benchmarks.

GPTNatural Language ProcessingTransformer

Sukru Burc Eryilmaz

8 min read

Has Summary

NVIDIA

Intermediate

Developing a 172B LLM with Strong Japanese Capabilities Using NVIDIA Megatron-LM

The article discusses the development of a 172 billion parameter large language model (LLM) with strong Japanese capabilities using NVIDIA Megatron-LM.

Generative AIGoogle CloudGPTHugging FacePaLMTransformerV

Kazuki Fujii

6 min read

Includes Code

Has Summary

NVIDIA

Advanced

Scaling LLMs with NVIDIA Triton and NVIDIA TensorRT-LLM Using Kubernetes

The article discusses how to scale Large Language Models (LLMs) using NVIDIA Triton and NVIDIA TensorRT-LLM in a Kubernetes environment.

AWSAzureDockerGenerative AIGPTGrafanaHelmHugging FaceKubernetesNGINXPrometheusPythonPyTorchTensorFlowTraefik

Maggie Zhang

16 min read

Includes Code

Has Summary

NVIDIA

Intermediate

NVIDIA Contributes NVIDIA GB200 NVL72 Designs to Open Compute Project

NVIDIA has contributed the NVIDIA GB200 NVL72 designs to the Open Compute Project, enhancing the utility of design standards for modern data centers.

GPTPythonPyTorch

Amr Elmeleegy

9 min read

Has Summary

NVIDIA

Advanced

Advanced RAG Techniques for Telco O-RAN Specifications Using NVIDIA NIM Microservices

The article discusses advanced Retrieval-Augmented Generation (RAG) techniques applied to telecommunications standards, specifically O-RAN, using NVIDIA NIM microservices.

GPTGPT-4LangChainMicroservicesMistralStreamlit

Amparo Canaveras

7 min read

Has Summary

NVIDIA

Advanced

Advancing Quantum Algorithm Design with GPTs

The article discusses the integration of generative pre-trained transformers (GPTs) into quantum algorithm design, specifically through the Generative Quantum Eigensolver (GQE) technique.

GPTPyTorch

Mark Wolf

7 min read

Includes Code

Has Summary

NVIDIA

Intermediate

NVIDIA GH200 Grace Hopper Superchip Delivers Outstanding Performance in MLPerf Inference v4.1

The article discusses the performance of the NVIDIA GH200 Grace Hopper Superchip in the latest MLPerf Inference v4.

GPTOracleRetrieval Augmented Generation

Amr Elmeleegy

6 min read

Has Summary

NVIDIA

Intermediate

Post-Training Quantization of LLMs with NVIDIA NeMo and NVIDIA TensorRT Model Optimizer

The article discusses the implementation of post-training quantization (PTQ) for large language models (LLMs) using NVIDIA NeMo and NVIDIA TensorRT Model Optimizer.

GPTPython

Jan Lasek

10 min read

Includes Code

Has Summary

NVIDIA

Intermediate

NVIDIA Blackwell Platform Sets New LLM Inference Records in MLPerf Inference v4.1

The article discusses NVIDIA's Blackwell platform, which has set new records in the MLPerf Inference v4. 1 benchmarks for large language model (LLM) inference.

BERTGenerative AIGPTMistralResNetStable DiffusionTransformerU-Net

Ashraf Eassa

12 min read

Has Summary

NVIDIA

Advanced

LLM Research Rewrites the Role of AI in Safeguarding Sustainable Systems

The article discusses how Large Language Models (LLMs) are being utilized to enhance the monitoring and safeguarding of critical infrastructure systems.

GPTMistral

Michelle Horton

3 min read

Has Summary

NVIDIA

Intermediate

Leverage the Latest Open Models for Synthetic Data Generation with NVIDIA Nemotron-4-340B

The article discusses the introduction of NVIDIA's Nemotron-4-340B family of models designed for synthetic data generation (SDG), emphasizing their application in creating high-quality training dat...

GPTGPT-4

Chris Alexiuk

8 min read

Has Summary

NVIDIA

Intermediate

Interactive AI Tool Delivers Immersive Video Content to Blind and Low-Vision Viewers

The article discusses a new AI-powered system called SPICA designed to enhance video accessibility for blind and low-vision viewers.

GPTGPT-4

Michelle Horton

4 min read

Has Summary

NVIDIA

Advanced

Writer Releases Domain-Specific LLMs for Healthcare and Finance

Writer has launched two domain-specific AI models, Palmyra-Med 70B and Palmyra-Fin 70B, enhancing NVIDIA NIM's capabilities in healthcare and finance.

ClaudeGeminiGPTGPT-4PaLM

Sam Julien

5 min read

Has Summary

NVIDIA

Advanced

Automating Telco Network Design using NVIDIA NIM and NVIDIA NeMo

The article discusses how Infosys has automated the generation of TOSCA templates for telecom network design using NVIDIA NIM and NVIDIA NeMo.

AzureEmbeddingGPTGPT-4Hugging FaceLangChainMistralReactYAML

Balamurugan Natarajan

6 min read

Includes Code

Has Summary

NVIDIA

Advanced

Train Generative AI Models More Efficiently with New NVIDIA Megatron-Core Functionalities

The article discusses the new functionalities of NVIDIA Megatron-Core, an open-source library designed to enhance the efficiency of training generative AI models.

AWSAzureBERTCLIPGeminiGenerative AIGPTHugging FaceMistralPythonPyTorchT5

Erin Ho

10 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Understanding Diffusion Models: An Essential Guide for AEC Professionals

This article explores the transformative potential of diffusion models within the Architecture, Engineering, and Construction (AEC) industry, highlighting their ability to generate high-quality vis...

DALL-EDiffusion ModelsFine-tuningGenerative AIGPTGPT-4MidjourneyStable Diffusion

Sama Bali

12 min read

Has Summary

NVIDIA

Intermediate

Building Cyber Language Models to Unlock New Cybersecurity Capabilities

The article discusses the development of specialized cyber language models designed to enhance cybersecurity capabilities by effectively processing and generating machine logs.

AWSAzureGPTJSON

Gorkem Batmaz

12 min read

Includes Code

Has Summary

NVIDIA

Advanced

Deploy GPU-Optimized AI Software with One Click Using Brev.dev and NVIDIA NGC Catalog

The article discusses how Brev. dev simplifies the deployment of GPU-optimized AI software using NVIDIA's NGC catalog, enabling developers to launch AI solutions quickly and efficiently.

Fine-tuningGPTHugging FaceMistralPython

Nirmal Kumar Juluru

6 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Introducing DoRA, a High-Performing Alternative to LoRA for Fine-Tuning

The article introduces DoRA, a high-performing alternative to Low-Rank Adaptation (LoRA) for fine-tuning pretrained models.

ChatGPTGPTGPT-4Hugging Face

Min-Hung Chen

5 min read

Has Summary