How NVIDIA Uses Google Cloud

116 engineering articles about Google Cloud from NVIDIA's engineering team

Other NVIDIA Technologies

Python(740)PyTorch(566)Deep Learning(505)TensorFlow(444)Docker(292)Kubernetes(251)

Other Companies Using Google Cloud

Articles

Filter:

NVIDIA

Intermediate

NVFP4 Trains with Precision of 16-Bit and Speed and Efficiency of 4-Bit

The article discusses NVIDIA's NVFP4, a new 4-bit precision format for training large language models (LLMs) that enhances efficiency and scalability while maintaining accuracy.

Google CloudMistralTransformer

Kirthi Devleker

9 min read

Has Summary

NVIDIA

Intermediate

Optimizing Vector Search for Indexing and Real-Time Retrieval with NVIDIA cuVS

The article discusses the advancements in NVIDIA cuVS, a GPU-accelerated vector search library designed for high-performance indexing and low-latency retrieval.

ApacheElasticsearchGoogle CloudJavaOraclePythonRustscikit-learnVertex AI

Corey Nolet

7 min read

Has Summary

NVIDIA

Intermediate

RAPIDS Brings Zero-Code-Change Acceleration, IO Performance Gains, and Out-of-Core XGBoost

The article discusses the latest enhancements in RAPIDS, including zero-code-change acceleration for Python machine learning, significant IO performance improvements, and out-of-core XGBoost capabi...

ApacheAzureAzure Blob StorageDaskGeminiGoogle CloudGoogle Cloud StorageLightGBMNetworkXPolarsPythonscikit-learnXGBoost

Nick Becker

9 min read

Includes Code

Has Summary

NVIDIA

Advanced

Spotlight: Build Scalable and Observable AI Ready for Production with Iguazio’s MLRun and NVIDIA NIM

The article discusses the collaboration between Iguazio and NVIDIA, focusing on how their combined technologies, MLRun and NVIDIA NIM, enable organizations to build scalable and observable AI solut...

AWSAzureGoogle CloudKubernetesServerless

Amit Bleiweiss

6 min read

Has Summary

NVIDIA

Intermediate

Kaggle Grandmasters Unveil Winning Strategies for Data Science Superpowers

Kaggle Grandmasters David Austin, Chris Deotte, and Ruchi Bhatia shared insights on their winning strategies for data science competitions at the Google Cloud Next conference.

Google CloudLightGBMMVPscikit-learnXGBoost

Jenn Yonemitsu

9 min read

Has Summary

NVIDIA

Intermediate

AI for a Greener Future: Its Power is in Our Hands

The article discusses the role of AI in promoting sustainability and addressing climate challenges.

AWSGoogle Cloud

Michelle Horton

6 min read

Has Summary

NVIDIA

Intermediate

Delivering NVIDIA Accelerated Computing for Enterprise AI Workloads with Rafay

The article discusses the increasing demand for NVIDIA accelerated computing in enterprise AI workloads and how Rafay's platform-as-a-service (PaaS) model addresses the challenges of building self-...

AWSAzureGoogle CloudKubernetes

Matheen Raza

7 min read

Has Summary

NVIDIA

Advanced

NVIDIA NeMo Retriever Delivers Accurate Multimodal PDF Data Extraction 15x Faster

The article discusses the advancements in NVIDIA's NeMo Retriever, which enables accurate multimodal PDF data extraction at a speed 15 times faster than traditional methods.

AWSAWS SageMakerAzureEmbeddingGoogle Cloud

Ruchika Kharwar

10 min read

Has Summary

NVIDIA

Advanced

Measure and Improve AI Workload Performance with NVIDIA DGX Cloud Benchmarking

The article discusses the importance of measuring and improving AI workload performance using NVIDIA DGX Cloud Benchmarking.

AWSAzureGoogle CloudOracleTransformer

Emily Potyraj

7 min read

Has Summary

NVIDIA

Intermediate

High-Performance Remote IO With NVIDIA KvikIO

The article discusses optimizing high-performance remote I/O operations using NVIDIA KvikIO for data analysis workloads on cloud object storage services.

ApacheAWSAzureAzure Blob StorageDaskGoogle CloudGoogle Cloud StoragePython

Tom Augspurger

8 min read

Includes Code

Has Summary

NVIDIA

Advanced

Continued Pretraining of State-of-the-Art LLMs for Sovereign AI and Regulated Industries with

The article discusses the continued pretraining of the Colosseum 355B large language model (LLM) by Domyn, leveraging NVIDIA DGX Cloud infrastructure.

AWSAzureCrystalGoogle CloudTransformerYAML

Martin Cimmino

16 min read

Includes Code

Has Summary

NVIDIA

Advanced

Create a Custom Slackbot LLM Agent with NVIDIA NIM and LangChain

This article provides a comprehensive guide on creating a custom Slackbot LLM agent using NVIDIA NIM and LangChain.

AWSAzureDynamoDBGenerative AIGoogle CloudLangChainPostgreSQL

Xhoni Shollaj

9 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Rapidly Create Real-Time Physics Digital Twins with NVIDIA Omniverse Blueprints

The article discusses the creation of real-time physics digital twins using NVIDIA Omniverse Blueprints, highlighting their importance in computer-aided engineering (CAE) and their application in v...

AzureGoogle CloudHelmKubernetesOracleWarp

John Linford

7 min read

Has Summary

NVIDIA

Intermediate

Spotlight: Dataloop Accelerates Multimodal Data Preparation Pipelines for LLMs with NVIDIA NIM

The article discusses how the partnership between NVIDIA and Dataloop is transforming the preparation of multimodal datasets for large language models (LLMs).

AWSAzureGoogle CloudKubernetesMistralWhisper

Amit Bleiweiss

9 min read

Has Summary

NVIDIA

Intermediate

Developing a 172B LLM with Strong Japanese Capabilities Using NVIDIA Megatron-LM

The article discusses the development of a 172 billion parameter large language model (LLM) with strong Japanese capabilities using NVIDIA Megatron-LM.

Generative AIGoogle CloudGPTHugging FacePaLMTransformerV

Kazuki Fujii

6 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Enhanced Security and Streamlined Deployment of AI Agents with NVIDIA AI Enterprise

The article discusses the advancements in AI agents facilitated by NVIDIA AI Enterprise, emphasizing enhanced security, streamlined deployment, and management of AI pipelines.

DGLEmbeddingGoogle CloudHelmKubernetesMistralPythonPyTorchTensorFlow

Charu Chaubal

5 min read

Has Summary

NVIDIA

Intermediate

Scale High-Performance AI Inference with Google Kubernetes Engine and NVIDIA NIM

The article discusses the integration of NVIDIA NIM with Google Kubernetes Engine (GKE) to enhance AI inference capabilities.

Google CloudKubernetesMicroservicesPyTorch

Charlie Huang

6 min read

Includes Code

Has Summary

NVIDIA

Advanced

Memory Efficiency, Faster Initialization, and Cost Estimation with NVIDIA Collective Communications Library 2.

The article discusses the NVIDIA Collective Communications Library (NCCL) 2.

Google Cloud

Giuseppe Congiu

8 min read

Includes Code

Has Summary

NVIDIA

Advanced

Google Cloud Run Adds Support for NVIDIA L4 GPUs, NVIDIA NIM, and Serverless AI Inference Deployments at Scale

The article discusses the integration of NVIDIA L4 GPUs and NVIDIA NIM microservices with Google Cloud Run, enabling enterprises to deploy AI-enabled applications more efficiently.

Google CloudGoogle Compute EngineKubernetesMistralOpenAI APIServerlessVertex AI

Uttara Kumar

6 min read

Includes Code

Has Summary

NVIDIA

Advanced

Revolutionizing Data Center Efficiency with the NVIDIA Grace Family

The article discusses the NVIDIA Grace family of CPUs, designed to enhance data center efficiency amidst rising data processing demands.

AzureFortranGoogle CloudJavaMicroservicesOraclePythonRust

Ashraf Eassa

15 min read

Includes Code

Has Summary

NVIDIA

Advanced

Faster Insights from Luminary Cloud’s Engineering Simulations with NVIDIA GPUs

The article discusses how Luminary Cloud leverages NVIDIA GPUs to enhance engineering simulations, making them faster and more efficient.

Google Cloud

Ian Pegler

7 min read

Has Summary

NVIDIA

Intermediate

Transforming Financial Analysis with NVIDIA NIM

The article discusses how NVIDIA NIM can transform financial analysis by enabling faster and more accurate insights extraction from earnings call transcripts.

Google CloudJSONLangChainMicroservicesMistralNatural Language ProcessingPythonPyTorch

Guilherme Pombo

13 min read

Includes Code

Has Summary

NVIDIA

Advanced

Optimize Processes for Large Spaces with the Multi-Camera Tracking Workflow

This article introduces the multi-camera tracking workflow developed by NVIDIA, aimed at optimizing processes in large spaces such as warehouses and airports.

AWSAzureElasticsearchGoogle CloudGrafanaHelmKubernetesMicroservices

Monika Jhuria

11 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Democratizing AI Workflows with Union.ai and NVIDIA DGX Cloud

The article discusses how Union. ai and NVIDIA DGX Cloud are transforming AI workflows by providing accessible, high-performance computing resources.

AzureAzure Blob StorageFine-tuningGoogle CloudKubernetesLarge Language Models

Niels Bantilan

6 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Generate Stunning Images with Stable Diffusion XL on the NVIDIA AI Inference Platform

The article discusses how to generate stunning images using Stable Diffusion XL on the NVIDIA AI Inference Platform, highlighting the challenges of deploying diffusion models at scale and how NVIDI...

Deep LearningDiffusion ModelsGenerative AIGoogle CloudPILPyTorchStable DiffusionTensorFlowU-Net

Amr Elmeleegy

13 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Boost Meeting Productivity with AI-Powered Note-Taking and Summarization

The article discusses how AI-powered note-taking and summarization can enhance meeting productivity by leveraging a cloud-native microservice architecture.

AWSAzureGoogle CloudHelmLarge Language Models

Mohamed Elshenawy

6 min read

Has Summary

NVIDIA

Intermediate

Getting Started with Large Language Models for Enterprise Solutions

The article discusses the application of Large Language Models (LLMs) in enterprise solutions, highlighting their capabilities in enhancing productivity across various industries.

ChatGPTEmbeddingGenerative AIGoogle CloudGPTLarge Language ModelsMistralRetrieval Augmented GenerationRLHFStable Diffusion

Erik Pounds

13 min read

Has Summary

NVIDIA

Intermediate

Power Your Business with NVIDIA AI Enterprise 4.0 for Production-Ready Generative AI

The article discusses NVIDIA AI Enterprise 4. 0, a comprehensive solution designed to support enterprises in developing and deploying generative AI applications.

AWSAzureGenerative AIGoogle CloudGraph Neural NetworksKubernetesMachine LearningNeural NetworksPythonRetrieval Augmented Generation

Phoebe Lee

4 min read

Has Summary

NVIDIA

Advanced

How to Build a Distributed Inference Cache with NVIDIA Triton and Redis

This article discusses how to build a distributed inference cache using NVIDIA Triton and Redis, highlighting the benefits and drawbacks of local versus distributed caching.

CachingDockerGoogle CloudRedis

Steve Lorello

12 min read

Includes Code

Has Summary

NVIDIA

Advanced

How to Deploy NVIDIA Riva Speech and Translation AI in the Public Cloud

The article provides a comprehensive guide on deploying NVIDIA Riva Speech and Translation AI in public cloud environments.

AWSAzureDockerGoogle CloudgRPCHelmKubernetesPythonTerraformTraefik

Sven Chilton

15 min read

Includes Code

Has Summary

NVIDIA

Advanced

Streamline Generative AI Development with NVIDIA NeMo on GPU-Accelerated Google Cloud

The article discusses how NVIDIA NeMo can streamline the development of generative AI applications on GPU-accelerated Google Cloud.

BERTDaskFine-tuningGenerative AIGoogle CloudGPTHugging FacePythonRedisReinforcement LearningT5Transformer

Chintan Patel

9 min read

Has Summary

NVIDIA

Advanced

Develop and Deploy Scalable Generative AI Models Seamlessly with NVIDIA AI Workbench

The article discusses the NVIDIA AI Workbench, a unified toolkit designed to simplify the development and deployment of scalable generative AI models.

AWSAzureFine-tuningGenerative AIGitGoogle CloudGradioHugging FacePythonStable DiffusionTensorFlow

Tyler Whitehouse

10 min read

Has Summary

NVIDIA

Advanced

Access the Latest in Vision AI Model Development Workflows with NVIDIA TAO Toolkit 5.0

The article discusses the release of NVIDIA TAO Toolkit 5. 0, which provides a low-code framework for accelerating vision AI model development.

AutoMLAzureCDNGoogle CloudKubernetesResNetTransformerTransformersVertex AI

Chintan Shah

13 min read

Has Summary

NVIDIA

Beginner

How to Train a Defect Detection Model Using Synthetic Data with NVIDIA Omniverse Replicator

The article discusses the process of training a defect detection model using synthetic data generated by NVIDIA Omniverse Replicator.

Google CloudJSON

Akhil Docca

8 min read

Has Summary

NVIDIA

Advanced

SDKs Accelerating Industry 5.0, Data Pipelines, Computational Science, and More Featured at NVIDIA GTC 2023

At NVIDIA GTC 2023, NVIDIA showcased significant updates to its AI software suite aimed at accelerating computing across various domains.

ApacheApache SparkAWSAzureBERTComputer VisionDeep LearningGoogle CloudGPTHugging FaceMachine LearningPythonPyTorchRedisTensorFlow

Siddharth Sharma

10 min read

Has Summary

NVIDIA

Intermediate

MONAI Reaches 1 Million Download Milestone Driven by Research Breakthroughs and Clinical Adoption

MONAI, an open-source medical imaging AI framework, has surpassed 1 million downloads, showcasing its impact on research and clinical applications.

AzureGenerative AIGoogle CloudOracle

Michael Zephyr

3 min read

Has Summary

NVIDIA

Intermediate

Supercharging AI Video and AI Inference Performance with NVIDIA L4 GPUs

The article discusses the introduction of NVIDIA L4 Tensor Core GPUs, highlighting their enhanced performance for AI video and inference tasks compared to the previous T4 generation.

Google CloudOpenCV

Abhishek Verma

9 min read

Has Summary

NVIDIA

Intermediate

Catapulting Enterprises to the Leading Edge of AI with NVIDIA AI Enterprise 3.1

The article discusses NVIDIA AI Enterprise 3. 1, highlighting its role in accelerating enterprise adoption of AI through a comprehensive suite of tools and frameworks.

ApacheApache SparkAWSAzureClearMLGenerative AIGoogle CloudKubernetesOracle

Phoebe Lee

4 min read

Has Summary

NVIDIA

Intermediate

Smarter Retail Data Analytics with GPU Accelerated Apache Spark Workloads on Google Cloud Dataproc

The article discusses how retailers can enhance their data analytics capabilities using GPU-accelerated Apache Spark workloads on Google Cloud Dataproc.

ApacheApache SparkGoogle CloudGoogle Cloud StorageJSONPySparkPythonShellSQL

Saurav Agarwal

12 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Machine Learning in Practice: Deploy an ML Model on Google Cloud Platform

This article provides a comprehensive guide on deploying machine learning models on Google Cloud Platform (GCP).

AutoMLAWSAzureFlaskGoogle CloudGoogle Cloud FunctionsGoogle Cloud StorageHTMLIrisMachine LearningPandasPythonscikit-learnServerlessVertex AI

Kurtis Pykes

10 min read

Includes Code

Has Summary

NVIDIA

Beginner

Machine Learning in Practice: Build an ML Model

This article focuses on the practical aspects of building and training a machine learning (ML) model using Python, specifically utilizing the Iris Dataset.

DaskGoogle CloudIrisMachine LearningPythonscikit-learn

Kurtis Pykes

5 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Machine Learning in Practice: ML Workflows

This article provides an overview of machine learning workflows, detailing the stages involved in developing and deploying machine learning models to deliver business value.

AWSAzureDjangoFastAPIFlaskGoogle CloudGoogle Cloud FunctionsMachine LearningServerlessStreamlit

Kurtis Pykes

6 min read

Has Summary

NVIDIA

Intermediate

Simplifying and Accelerating Machine Learning Predictions in Apache Beam with NVIDIA TensorRT

This article discusses the integration of NVIDIA TensorRT with Apache Beam SDK to streamline and enhance machine learning predictions at scale.

ApacheDeep LearningDockerGoogle CloudGoogle Cloud StorageGoogle Compute EngineMachine LearningPythonPyTorchTensorFlowtorchvision

Alexander Zhurkevich

11 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Saving Apache Spark Big Data Processing Costs on Google Cloud Dataproc

The article discusses how organizations can reduce costs and improve performance in big data processing using Apache Spark on Google Cloud Dataproc with the RAPIDS Accelerator.

ApacheApache SparkGoogle CloudSQL

Karthikeyan Rajendran

8 min read

Has Summary

NVIDIA

Intermediate

MONAI Drives Medical AI on Google Cloud with Medical Imaging Suite

The article discusses the integration of MONAI, the Medical Open Network for AI, into the Google Cloud Medical Imaging Suite, which enhances medical imaging workflows through AI and ML technologies.

Deep LearningGoogle CloudPythonPyTorch

Brad Genereaux

5 min read

Has Summary

NVIDIA

Advanced

New SDKs Accelerating AI Research, Computer Vision, Data Science, and More

NVIDIA has announced significant updates to its AI software suite, including JAX, NVIDIA CV-CUDA, and NVIDIA RAPIDS, aimed at accelerating AI research, computer vision, and data science.

ApacheApache SparkComputer VisionDaskDeep LearningDGLGoogle CloudGPTJAXKubernetesNeural NetworksNumPyPyTorchPyTorch GeometricSQL

Siddharth Sharma

7 min read

Has Summary

NVIDIA

Intermediate

Democratizing and Accelerating Genome Sequencing Analysis with NVIDIA Clara Parabricks v4.0

The article discusses the release of NVIDIA Clara Parabricks v4.

DockerGoogle CloudOraclePyTorchTensorFlow

Harry Clifford

7 min read

Has Summary

NVIDIA

Intermediate

How MONAI Fuels Open Research for Medical AI Workflows

The article discusses how MONAI, the Medical Open Network for AI, empowers medical researchers by providing an open-source framework for developing AI workflows in healthcare.

AutoMLAWSGoogle CloudModalPyTorchTransformers

Prerna Dogra

5 min read

Has Summary

NVIDIA

Advanced

Faster Text Classification with Naive Bayes and GPUs

The article discusses the advantages of using Naive Bayes (NB) classifiers for text classification tasks, particularly when leveraging GPU acceleration through RAPIDS cuML.

DaskGoogle CloudNumPyPythonscikit-learnSciPy

Mickael Ide

11 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Building a Computer Vision Application to Recognize Human Activities

This article discusses building a computer vision application to recognize human activities using NVIDIA AI software and Google Cloud Vertex AI.

Computer VisionDockerGoogle CloudVertex AI

Abhishek Sawarkar

8 min read

Includes Code

Has Summary