How NVIDIA Uses Dask

76 engineering articles about Dask from NVIDIA's engineering team

Other NVIDIA Technologies

Python(740)PyTorch(566)Deep Learning(505)TensorFlow(444)Docker(292)Kubernetes(251)

Other Companies Using Dask

Uber(2)

Articles

Filter:

NVIDIA

Advanced

How to Accelerate Community Detection in Python Using GPU-Powered Leiden

The article discusses the importance of community detection algorithms, particularly the Leiden algorithm, in analyzing large-scale graph data using GPU acceleration via cuGraph.

DaskigraphNetworkXPython

Rick Ratzel

8 min read

Includes Code

Has Summary

NVIDIA

Advanced

Train with Terabyte-Scale Datasets on a Single NVIDIA Grace Hopper Superchip Using XGBoost 3.0

The article discusses the advancements in XGBoost 3. 0, particularly its ability to train with terabyte-scale datasets on a single NVIDIA Grace Hopper Superchip.

DaskSHAPXGBoost

Dante Gama Dessavre

7 min read

Includes Code

Has Summary

NVIDIA

Intermediate

RAPIDS Adds GPU Polars Streaming, a Unified GNN API, and Zero-Code ML Speedups

RAPIDS version 25.

DaskPolarsPythonPyTorchscikit-learn

Brian Tepera

6 min read

Includes Code

Has Summary

NVIDIA

Advanced

How to Work with Data Exceeding VRAM in the Polars GPU Engine

This article discusses strategies for processing large datasets that exceed GPU VRAM using the Polars GPU engine, specifically focusing on Unified Virtual Memory (UVM) and multi-GPU streaming execu...

DaskPolars

Jamil Semaan

4 min read

Has Summary

NVIDIA

Intermediate

Driving Toward Billion-Cell Analysis and Biological Breakthroughs with RAPIDS-singlecell

The article discusses the advancements in single-cell analysis facilitated by RAPIDS-singlecell, an open-source tool that leverages GPU acceleration to handle large datasets efficiently.

DaskNumPyPythonSciPy

TJ Chen

7 min read

Includes Code

Has Summary

NVIDIA

Intermediate

RAPIDS Brings Zero-Code-Change Acceleration, IO Performance Gains, and Out-of-Core XGBoost

The article discusses the latest enhancements in RAPIDS, including zero-code-change acceleration for Python machine learning, significant IO performance improvements, and out-of-core XGBoost capabi...

ApacheAzureAzure Blob StorageDaskGeminiGoogle CloudGoogle Cloud StorageLightGBMNetworkXPolarsPythonscikit-learnXGBoost

Nick Becker

9 min read

Includes Code

Has Summary

NVIDIA

Advanced

Building Nemotron-CC, A High-Quality Trillion Token Dataset for LLM Pretraining from Common Crawl

The article discusses the development of the Nemotron-CC dataset, a high-quality trillion-token dataset for pretraining large language models (LLMs) using Common Crawl data.

DaskHTML

Nirmal Kumar Juluru

7 min read

Includes Code

Has Summary

NVIDIA

Intermediate

High-Performance Remote IO With NVIDIA KvikIO

The article discusses optimizing high-performance remote I/O operations using NVIDIA KvikIO for data analysis workloads on cloud object storage services.

ApacheAWSAzureAzure Blob StorageDaskGoogle CloudGoogle Cloud StoragePython

Tom Augspurger

8 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Enhancing Generative AI Model Accuracy with NVIDIA NeMo Curator

The article discusses the significance of high-quality data in enhancing the accuracy of generative AI models, focusing on the capabilities of NVIDIA NeMo Curator for data curation and processing.

DaskGenerative AIJSON

Nirmal Kumar Juluru

5 min read

Has Summary

NVIDIA

Intermediate

Accelerating GPU Analytics Using RAPIDS and Ray

The article discusses how to accelerate GPU analytics using RAPIDS and Ray, two powerful frameworks for distributed data science and AI applications.

DaskPython

Peter Entschev

4 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Enhance Your Training Data with New NVIDIA NeMo Curator Classifier Models

The article discusses the introduction of new NVIDIA NeMo Curator classifier models that enhance training data quality for generative AI.

BERTDaskHugging Face

Tom Balough

10 min read

Includes Code

Has Summary

NVIDIA

Advanced

RAPIDS 24.12 Introduces cuDF on PyPI, CUDA Unified Memory for Polars, and Faster GNNs

RAPIDS 24.

AWSAWS S3DaskPolarsRapids

Nick Becker

7 min read

Includes Code

Has Summary

NVIDIA

Intermediate

NVIDIA Deep Learning Institute Releases New Data Science Teaching Kit for Educators

The NVIDIA Deep Learning Institute has launched the Accelerated Data Science Teaching Kit, aimed at educators to enhance data science education.

DaskDeep LearningMachine LearningNetworkXNeural NetworksPolarsPython

Joe Bungo

3 min read

Has Summary

NVIDIA

Advanced

Best Practices for Multi-GPU Data Analysis Using RAPIDS with Dask

The article discusses best practices for multi-GPU data analysis using RAPIDS with Dask, emphasizing the need for efficient memory management and accelerated networking.

DaskPandasPythonPyTorchXGBoost

Ben Zaitlen

5 min read

Includes Code

Has Summary

NVIDIA

Advanced

Processing High-Quality Vietnamese Language Data with NVIDIA NeMo Curator

This article discusses the use of NVIDIA NeMo Curator for processing high-quality Vietnamese language data, highlighting the challenges faced by large language models (LLMs) in non-English language...

DaskEmbeddingHugging FacePythonYAML

Hoang Nguyen

16 min read

Includes Code

Has Summary

NVIDIA

Advanced

Mastering LLM Techniques: Text Data Processing

The article discusses techniques for processing text data to optimize the performance of Large Language Models (LLMs).

BERTDaskTransformer

Amit Bleiweiss

13 min read

Has Summary

NVIDIA

Advanced

Streamlining Data Processing for Domain Adaptive Pretraining with NVIDIA NeMo Curator

The article discusses the process of streamlining data processing for Domain Adaptive Pretraining (DAPT) of large language models (LLMs) using NVIDIA NeMo Curator.

DaskPerlPython

Mehran Maghoumi

16 min read

Includes Code

Has Summary

NVIDIA

Advanced

Curating Custom Datasets for LLM Parameter-Efficient Fine-Tuning with NVIDIA NeMo Curator

The article discusses how to curate custom datasets for parameter-efficient fine-tuning of large language models (LLMs) using NVIDIA NeMo Curator.

DaskJSONPython

Mehran Maghoumi

11 min read

Includes Code

Has Summary

NVIDIA

Advanced

Curating Non-English Datasets for LLM Training with NVIDIA NeMo Curator

The article discusses the importance of data curation in training large language models (LLMs), particularly for low-resourced languages.

DaskYAML

Arham Mehta

12 min read

Includes Code

Has Summary

NVIDIA

Advanced

Curating Custom Datasets for LLM Training with NVIDIA NeMo Curator

The article discusses the importance of data curation in training large language models (LLMs) and introduces NVIDIA NeMo Curator, an open-source framework designed for creating high-quality datase...

DaskGPTGPT-4Hugging FaceJSON

Mehran Maghoumi

14 min read

Includes Code

Has Summary

NVIDIA

Intermediate

RAPIDS on Databricks: A Guide to GPU-Accelerated Data Processing

This article provides a comprehensive guide on leveraging RAPIDS for GPU-accelerated data processing on Databricks.

ApacheApache SparkDaskPythonRapidsSQLXGBoost

Sheilah Kirui

10 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Scale and Curate High-Quality Datasets for LLM Training with NVIDIA NeMo Curator

The article discusses the NVIDIA NeMo Curator framework, an open-source tool designed to streamline the data curation process for training large language models (LLMs).

ApacheDaskHugging FaceJSON

Mehran Maghoumi

6 min read

Has Summary

NVIDIA

Advanced

Unlocking Multi-GPU Model Training with Dask XGBoost

The article discusses how to optimize multi-GPU model training using Dask and XGBoost, addressing common challenges such as out-of-memory errors.

DaskPythonXGBoost

Jiwei Liu

11 min read

Includes Code

Has Summary

NVIDIA

Advanced

Streamline Generative AI Development with NVIDIA NeMo on GPU-Accelerated Google Cloud

The article discusses how NVIDIA NeMo can streamline the development of generative AI applications on GPU-accelerated Google Cloud.

BERTDaskFine-tuningGenerative AIGoogle CloudGPTHugging FacePythonRedisReinforcement LearningT5Transformer

Chintan Patel

9 min read

Has Summary

NVIDIA

Intermediate

Pro Tips for Building Multilingual Recommender Systems

This article provides insights into building multilingual recommender systems, focusing on a two-stage candidate reranker approach.

DaskHugging Face

Chris Deotte

11 min read

Includes Code

Has Summary

NVIDIA

Advanced

Unlocking the Power of Enterprise-Ready LLMs with NVIDIA NeMo

The article discusses NVIDIA NeMo, an end-to-end platform designed to facilitate the development and deployment of enterprise-ready large language models (LLMs).

ChatGPTDaskEmbeddingHugging FaceRedisRLHF

Amanda Saunders

9 min read

Has Summary

NVIDIA

Intermediate

Curating Trillion-Token Datasets: Introducing NVIDIA NeMo Data Curator

The article introduces the NVIDIA NeMo Data Curator, a scalable tool designed for curating trillion-token multilingual datasets for training large language models (LLMs).

DaskGPTLLaMAPython

Joseph Jennings

8 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Debugging a Mixed Python and C Language Stack

This article discusses the challenges of debugging in a mixed Python and C language stack, particularly in the context of the RAPIDS project.

DaskDockerNumbaPython

Peter Entschev

18 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Accelerated Data Analytics: Speed Up Data Exploration with RAPIDS cuDF

The article discusses how NVIDIA's RAPIDS cuDF can significantly accelerate data analytics workflows, particularly in exploratory data analysis (EDA).

ApacheApache SparkDaskMachine LearningPandasPython

Prachi Goel

11 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Accelerated Data Analytics: Faster Time Series Analysis with RAPIDS cuDF

The article discusses how RAPIDS cuDF can significantly accelerate time series data analysis, providing speed improvements of up to 40x compared to traditional pandas workflows.

ApacheApache SparkDaskPandas

Prachi Goel

9 min read

Includes Code

Has Summary

NVIDIA

Advanced

Maximizing Performance with Massively Parallel Hash Maps on GPUs

This article discusses the optimization of hash maps for GPU acceleration, focusing on their memory access patterns and performance benefits.

DaskNatural Language Processing

Daniel Juenger

18 min read

Includes Code

Has Summary

NVIDIA

Beginner

Machine Learning in Practice: Build an ML Model

This article focuses on the practical aspects of building and training a machine learning (ML) model using Python, specifically utilizing the Iris Dataset.

DaskGoogle CloudIrisMachine LearningPythonscikit-learn

Kurtis Pykes

5 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Accelerating Digital Pathology Workflows Using cuCIM and NVIDIA GPUDirect Storage

The article discusses how NVIDIA's cuCIM and GPUDirect Storage can significantly enhance digital pathology workflows by improving input/output performance and image processing tasks.

ApacheDaskPython

Gregory Lee

9 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Optimizing Fraud Detection in Financial Services with Graph Neural Networks and NVIDIA GPUs

The article discusses how Graph Neural Networks (GNNs) and NVIDIA GPUs can optimize fraud detection in financial services.

AWSDaskDGLGraph Neural NetworksNeural NetworksPythonPyTorchPyTorch GeometricXGBoost

Ashish Sardana

21 min read

Includes Code

Has Summary

NVIDIA

Advanced

New SDKs Accelerating AI Research, Computer Vision, Data Science, and More

NVIDIA has announced significant updates to its AI software suite, including JAX, NVIDIA CV-CUDA, and NVIDIA RAPIDS, aimed at accelerating AI research, computer vision, and data science.

ApacheApache SparkComputer VisionDaskDeep LearningDGLGoogle CloudGPTJAXKubernetesNeural NetworksNumPyPyTorchPyTorch GeometricSQL

Siddharth Sharma

7 min read

Has Summary

NVIDIA

Advanced

Accelerating ETL on KubeFlow with RAPIDS

The article discusses how to accelerate ETL processes on KubeFlow using RAPIDS, a data science framework that leverages GPUs for improved performance.

DaskDockerKubernetesNumPyPandasPythonscikit-learnYAML

Jacob Tomlinson

12 min read

Includes Code

Has Summary

NVIDIA

Advanced

Faster Text Classification with Naive Bayes and GPUs

The article discusses the advantages of using Naive Bayes (NB) classifiers for text classification tasks, particularly when leveraging GPU acceleration through RAPIDS cuML.

DaskGoogle CloudNumPyPythonscikit-learnSciPy

Mickael Ide

11 min read

Includes Code

Has Summary

NVIDIA

Advanced

Optimizing Access to Parquet Data with fsspec

The article discusses the optimization of accessing Parquet data using the fsspec library, particularly through the new fsspec. parquet module.

DaskGoogle CloudGoogle Cloud StoragePython

Rick Zamora

11 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Insider’s Guide to GTC: Cybersecurity, Data Center, Data Science, and Networking

The article provides an overview of the upcoming GTC event, highlighting key sessions focused on Cybersecurity, Data Center, Data Science, and Networking.

ApacheApache SparkAzureDaskMachine Learning

Michelle Horton

5 min read

Has Summary

NVIDIA

Intermediate

Natural Language Processing First Steps: How Algorithms Understand Text

This article introduces the foundational techniques for preparing text data for Natural Language Processing (NLP) using vectorization, hashing, and tokenization.

DaskNatural Language Processingscikit-learn

Edward Krueger

10 min read

Has Summary

NVIDIA

Intermediate

Accelerated Portfolio Construction with Numba and Dask in Python

This article discusses how to accelerate portfolio construction algorithms using Numba and Dask in Python, achieving up to 800x speed improvements on GPUs.

DaskNumbaNumPyPythonPyTorchTensorFlow

Yi Dong

8 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Accelerating Interpretable Machine Learning for Diversified Portfolio Construction

The article discusses how Munich Re Markets leverages interpretable machine learning to enhance portfolio construction strategies in the Life and Pension industry.

DaskMachine LearningNumbaPythonSHAPXGBoost

Jochen Papenbrock

10 min read

Has Summary

NVIDIA

Intermediate

Analyzing the RNA-Sequence of 1.3M Mouse Brain Cells with RAPIDS on NVIDIA GPUs

This article discusses the analysis of RNA sequencing data from 1. 3 million mouse brain cells using RAPIDS on NVIDIA GPUs.

DaskGoogle CloudNumPy

Corey Nolet

7 min read

Has Summary

NVIDIA

Beginner

Zero to RAPIDS in Minutes with NVIDIA GPUs + Saturn Cloud

The article discusses how to leverage NVIDIA GPUs and the Saturn Cloud platform to accelerate data science workflows using RAPIDS.

DaskDockerNumPyPythonPyTorchscikit-learnSciPyTensorFlowXGBoost

Jacob Schmitt

8 min read

Includes Code

Has Summary

NVIDIA

Advanced

Input and Output Configurability in RAPIDS cuML

The article discusses the input and output configurability of the RAPIDS cuML machine learning library, highlighting its support for various data formats and the benefits of using GPU memory for pe...

DaskMachine LearningNumbaNumPyPandasPythonPyTorchscikit-learnTensorFlow

Dante Gama Dessavre

11 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Accelerating XGBoost on GPU Clusters with Dask

The article discusses how to accelerate XGBoost on GPU clusters using Dask, highlighting the new Dask interface introduced in XGBoost 1. 4.

DaskMachine Learningscikit-learnSHAPXGBoost

Belen Tegegn

11 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Accelerating Sequential Python User-Defined Functions with RAPIDS on GPUs for 100X Speedups

This article discusses how to accelerate sequential Python User-Defined Functions (UDFs) using RAPIDS on GPUs, achieving speedups of up to 100x.

ApacheApache SparkDaskDockerNumbaPython

Vibhu Jawa

5 min read

Includes Code

Has Summary

NVIDIA

Intermediate

Data Science - Top Resources from GTC 21

The article discusses how accelerated data science can enhance data analytics workflows by leveraging NVIDIA technologies, significantly improving performance and reducing costs.

DaskMachine Learning

Chase Hooley

3 min read

Has Summary

NVIDIA

Intermediate

High-Performance Python Communication with UCX-Py

The article discusses UCX-Py, an accelerated networking library that enhances communication performance for Python applications, particularly in the context of GPU and distributed computing.

DaskFiberPandasPythonscikit-learn

Belen Tegegn

9 min read

Includes Code

Has Summary

NVIDIA

Intermediate

On-Demand Technical Sessions: Develop and Deploy AI Solutions in the Cloud Using NVIDIA NGC

The article discusses on-demand technical sessions from GTC '21 that focus on developing and deploying AI solutions in the cloud using NVIDIA NGC.

AWSAzureDaskHelmMachine LearningTransfer Learning

Chintan Patel

2 min read

Has Summary