How NVIDIA Uses Google Cloud
116 engineering articles about Google Cloud from NVIDIA's engineering team
Other NVIDIA Technologies
Other Companies Using Google Cloud
Articles
Filter:
The article discusses NVIDIA's NVFP4, a new 4-bit precision format for training large language models (LLMs) that enhances efficiency and scalability while maintaining accuracy.
Kirthi Devleker
9 min read
Has Summary
--
The article discusses the advancements in NVIDIA cuVS, a GPU-accelerated vector search library designed for high-performance indexing and low-latency retrieval.
Corey Nolet
7 min read
Has Summary
--
The article discusses the latest enhancements in RAPIDS, including zero-code-change acceleration for Python machine learning, significant IO performance improvements, and out-of-core XGBoost capabi...
ApacheAzureAzure Blob StorageDaskGeminiGoogle CloudGoogle Cloud StorageLightGBMNetworkXPolarsPythonscikit-learnXGBoost
Nick Becker
9 min read
Includes Code
Has Summary
--
The article discusses the collaboration between Iguazio and NVIDIA, focusing on how their combined technologies, MLRun and NVIDIA NIM, enable organizations to build scalable and observable AI solut...
Amit Bleiweiss
6 min read
Has Summary
--
Kaggle Grandmasters David Austin, Chris Deotte, and Ruchi Bhatia shared insights on their winning strategies for data science competitions at the Google Cloud Next conference.
Jenn Yonemitsu
9 min read
Has Summary
--
The article discusses the role of AI in promoting sustainability and addressing climate challenges.
Michelle Horton
6 min read
Has Summary
--
The article discusses the increasing demand for NVIDIA accelerated computing in enterprise AI workloads and how Rafay's platform-as-a-service (PaaS) model addresses the challenges of building self-...
Matheen Raza
7 min read
Has Summary
--
The article discusses the advancements in NVIDIA's NeMo Retriever, which enables accurate multimodal PDF data extraction at a speed 15 times faster than traditional methods.
Ruchika Kharwar
10 min read
Has Summary
--
The article discusses the importance of measuring and improving AI workload performance using NVIDIA DGX Cloud Benchmarking.
Emily Potyraj
7 min read
Has Summary
--
The article discusses optimizing high-performance remote I/O operations using NVIDIA KvikIO for data analysis workloads on cloud object storage services.
Tom Augspurger
8 min read
Includes Code
Has Summary
--
The article discusses the continued pretraining of the Colosseum 355B large language model (LLM) by Domyn, leveraging NVIDIA DGX Cloud infrastructure.
Martin Cimmino
16 min read
Includes Code
Has Summary
--
This article provides a comprehensive guide on creating a custom Slackbot LLM agent using NVIDIA NIM and LangChain.
Xhoni Shollaj
9 min read
Includes Code
Has Summary
--
The article discusses the creation of real-time physics digital twins using NVIDIA Omniverse Blueprints, highlighting their importance in computer-aided engineering (CAE) and their application in v...
John Linford
7 min read
Has Summary
--
The article discusses how the partnership between NVIDIA and Dataloop is transforming the preparation of multimodal datasets for large language models (LLMs).
Amit Bleiweiss
9 min read
Has Summary
--
The article discusses the development of a 172 billion parameter large language model (LLM) with strong Japanese capabilities using NVIDIA Megatron-LM.
Kazuki Fujii
6 min read
Includes Code
Has Summary
--
The article discusses the advancements in AI agents facilitated by NVIDIA AI Enterprise, emphasizing enhanced security, streamlined deployment, and management of AI pipelines.
Charu Chaubal
5 min read
Has Summary
--
The article discusses the integration of NVIDIA NIM with Google Kubernetes Engine (GKE) to enhance AI inference capabilities.
Charlie Huang
6 min read
Includes Code
Has Summary
--
The article discusses the NVIDIA Collective Communications Library (NCCL) 2.
Giuseppe Congiu
8 min read
Includes Code
Has Summary
--
The article discusses the integration of NVIDIA L4 GPUs and NVIDIA NIM microservices with Google Cloud Run, enabling enterprises to deploy AI-enabled applications more efficiently.
Uttara Kumar
6 min read
Includes Code
Has Summary
--
The article discusses the NVIDIA Grace family of CPUs, designed to enhance data center efficiency amidst rising data processing demands.
Ashraf Eassa
15 min read
Includes Code
Has Summary
--
The article discusses how Luminary Cloud leverages NVIDIA GPUs to enhance engineering simulations, making them faster and more efficient.
Ian Pegler
7 min read
Has Summary
--
The article discusses how NVIDIA NIM can transform financial analysis by enabling faster and more accurate insights extraction from earnings call transcripts.
Guilherme Pombo
13 min read
Includes Code
Has Summary
--
This article introduces the multi-camera tracking workflow developed by NVIDIA, aimed at optimizing processes in large spaces such as warehouses and airports.
Monika Jhuria
11 min read
Includes Code
Has Summary
--
The article discusses how Union. ai and NVIDIA DGX Cloud are transforming AI workflows by providing accessible, high-performance computing resources.
Niels Bantilan
6 min read
Includes Code
Has Summary
--
The article discusses how to generate stunning images using Stable Diffusion XL on the NVIDIA AI Inference Platform, highlighting the challenges of deploying diffusion models at scale and how NVIDI...
Amr Elmeleegy
13 min read
Includes Code
Has Summary
--
The article discusses how AI-powered note-taking and summarization can enhance meeting productivity by leveraging a cloud-native microservice architecture.
Mohamed Elshenawy
6 min read
Has Summary
--
The article discusses the application of Large Language Models (LLMs) in enterprise solutions, highlighting their capabilities in enhancing productivity across various industries.
ChatGPTEmbeddingGenerative AIGoogle CloudGPTLarge Language ModelsMistralRetrieval Augmented GenerationRLHFStable Diffusion
Erik Pounds
13 min read
Has Summary
--
The article discusses NVIDIA AI Enterprise 4. 0, a comprehensive solution designed to support enterprises in developing and deploying generative AI applications.
AWSAzureGenerative AIGoogle CloudGraph Neural NetworksKubernetesMachine LearningNeural NetworksPythonRetrieval Augmented Generation
Phoebe Lee
4 min read
Has Summary
--
This article discusses how to build a distributed inference cache using NVIDIA Triton and Redis, highlighting the benefits and drawbacks of local versus distributed caching.
Steve Lorello
12 min read
Includes Code
Has Summary
--
The article provides a comprehensive guide on deploying NVIDIA Riva Speech and Translation AI in public cloud environments.
Sven Chilton
15 min read
Includes Code
Has Summary
--
The article discusses how NVIDIA NeMo can streamline the development of generative AI applications on GPU-accelerated Google Cloud.
BERTDaskFine-tuningGenerative AIGoogle CloudGPTHugging FacePythonRedisReinforcement LearningT5Transformer
Chintan Patel
9 min read
Has Summary
--
The article discusses the NVIDIA AI Workbench, a unified toolkit designed to simplify the development and deployment of scalable generative AI models.
Tyler Whitehouse
10 min read
Has Summary
--
The article discusses the release of NVIDIA TAO Toolkit 5. 0, which provides a low-code framework for accelerating vision AI model development.
Chintan Shah
13 min read
Has Summary
--
The article discusses the process of training a defect detection model using synthetic data generated by NVIDIA Omniverse Replicator.
Akhil Docca
8 min read
Has Summary
--
At NVIDIA GTC 2023, NVIDIA showcased significant updates to its AI software suite aimed at accelerating computing across various domains.
ApacheApache SparkAWSAzureBERTComputer VisionDeep LearningGoogle CloudGPTHugging FaceMachine LearningPythonPyTorchRedisTensorFlow
Siddharth Sharma
10 min read
Has Summary
--
MONAI, an open-source medical imaging AI framework, has surpassed 1 million downloads, showcasing its impact on research and clinical applications.
Michael Zephyr
3 min read
Has Summary
--
The article discusses the introduction of NVIDIA L4 Tensor Core GPUs, highlighting their enhanced performance for AI video and inference tasks compared to the previous T4 generation.
Abhishek Verma
9 min read
Has Summary
--
The article discusses NVIDIA AI Enterprise 3. 1, highlighting its role in accelerating enterprise adoption of AI through a comprehensive suite of tools and frameworks.
Phoebe Lee
4 min read
Has Summary
--
The article discusses how retailers can enhance their data analytics capabilities using GPU-accelerated Apache Spark workloads on Google Cloud Dataproc.
Saurav Agarwal
12 min read
Includes Code
Has Summary
--
This article provides a comprehensive guide on deploying machine learning models on Google Cloud Platform (GCP).
AutoMLAWSAzureFlaskGoogle CloudGoogle Cloud FunctionsGoogle Cloud StorageHTMLIrisMachine LearningPandasPythonscikit-learnServerlessVertex AI
Kurtis Pykes
10 min read
Includes Code
Has Summary
--
This article focuses on the practical aspects of building and training a machine learning (ML) model using Python, specifically utilizing the Iris Dataset.
Kurtis Pykes
5 min read
Includes Code
Has Summary
--
This article provides an overview of machine learning workflows, detailing the stages involved in developing and deploying machine learning models to deliver business value.
Kurtis Pykes
6 min read
Has Summary
--
This article discusses the integration of NVIDIA TensorRT with Apache Beam SDK to streamline and enhance machine learning predictions at scale.
ApacheDeep LearningDockerGoogle CloudGoogle Cloud StorageGoogle Compute EngineMachine LearningPythonPyTorchTensorFlowtorchvision
Alexander Zhurkevich
11 min read
Includes Code
Has Summary
--
The article discusses how organizations can reduce costs and improve performance in big data processing using Apache Spark on Google Cloud Dataproc with the RAPIDS Accelerator.
Karthikeyan Rajendran
8 min read
Has Summary
--
The article discusses the integration of MONAI, the Medical Open Network for AI, into the Google Cloud Medical Imaging Suite, which enhances medical imaging workflows through AI and ML technologies.
Brad Genereaux
5 min read
Has Summary
--
NVIDIA has announced significant updates to its AI software suite, including JAX, NVIDIA CV-CUDA, and NVIDIA RAPIDS, aimed at accelerating AI research, computer vision, and data science.
ApacheApache SparkComputer VisionDaskDeep LearningDGLGoogle CloudGPTJAXKubernetesNeural NetworksNumPyPyTorchPyTorch GeometricSQL
Siddharth Sharma
7 min read
Has Summary
--
The article discusses the release of NVIDIA Clara Parabricks v4.
Harry Clifford
7 min read
Has Summary
--
The article discusses how MONAI, the Medical Open Network for AI, empowers medical researchers by providing an open-source framework for developing AI workflows in healthcare.
Prerna Dogra
5 min read
Has Summary
--
The article discusses the advantages of using Naive Bayes (NB) classifiers for text classification tasks, particularly when leveraging GPU acceleration through RAPIDS cuML.
Mickael Ide
11 min read
Includes Code
Has Summary
--
This article discusses building a computer vision application to recognize human activities using NVIDIA AI software and Google Cloud Vertex AI.
Abhishek Sawarkar
8 min read
Includes Code
Has Summary
--