How NVIDIA Uses Pandas
42 engineering articles about Pandas from NVIDIA's engineering team
Other NVIDIA Technologies
Other Companies Using Pandas
Articles
Filter:
The article discusses how to leverage NVIDIA CUDA-X and Coiled to simplify data science workflows in the cloud, particularly for analyzing large datasets like NYC ride-share journeys.
The article discusses how feature engineering, particularly using NVIDIA cuDF-pandas for GPU acceleration, can significantly enhance model accuracy in Kaggle competitions involving tabular data.
The article discusses how the NVIDIA RAPIDS Accelerator for Apache Spark enables zero code change for GPU-accelerated data processing, enhancing the performance of Apache Spark ML applications.
The article highlights significant advancements in NVIDIA technologies throughout 2024, focusing on NVIDIA NIM, breakthroughs in large language models (LLMs), and optimizations in data science.
Michelle Horton
3 min read
Has Summary
--
The article discusses how Unified Virtual Memory (UVM) enhances the performance of pandas through the RAPIDS cuDF library, enabling GPU acceleration without code changes.
Prem Sagar Gali
5 min read
Includes Code
Has Summary
--
The article discusses best practices for multi-GPU data analysis using RAPIDS with Dask, emphasizing the need for efficient memory management and accelerated networking.
The article discusses how NVIDIA and ArangoDB have enhanced the performance and scalability of graph analytics for NetworkX users without requiring code changes.
The article discusses how RAPIDS AI can accelerate predictive maintenance in manufacturing by leveraging advanced data analytics to minimize downtime and optimize maintenance schedules.
Amarnath Mohan
11 min read
Includes Code
Has Summary
--
This article provides a comprehensive guide on encoding and compression techniques for string data in the Parquet format using RAPIDS.
The article discusses the integration of RAPIDS cuDF into Google Colab, enabling developers to accelerate pandas code execution by up to 50 times on GPU instances.
The article highlights the top data science sessions from NVIDIA GTC 2024, focusing on GPU-accelerated tools and best practices for data scientists.
Belen Tegegn
2 min read
Has Summary
--
The article discusses the announcement of RAPIDS cuDF at NVIDIA GTC 2024, which enables GPU acceleration for 9. 5 million pandas users without any code changes.
The article discusses how to accelerate NetworkX, a popular Python library for graph analytics, using NVIDIA GPUs through the RAPIDS cuGraph project.
The article discusses the importance of data visualization in uncovering insights from large datasets and introduces RAPIDS, a suite of GPU-accelerated libraries that enhance data analytics workflo...
Allan Enemark
9 min read
Includes Code
Has Summary
--
The article discusses how GPU-accelerated data analytics can enhance machine learning (ML) projects by improving speed and scalability.
Jay Rodge
14 min read
Includes Code
Has Summary
--
The article discusses the integration of distributed deep learning with Apache Spark 3. 4, highlighting new built-in APIs for both distributed model training and inference.
Lee Yang
6 min read
Includes Code
Has Summary
--
The article discusses how NVIDIA's RAPIDS cuDF can significantly accelerate data analytics workflows, particularly in exploratory data analysis (EDA).
Prachi Goel
11 min read
Includes Code
Has Summary
--
The article discusses how RAPIDS cuDF can significantly accelerate time series data analysis, providing speed improvements of up to 40x compared to traditional pandas workflows.
Prachi Goel
9 min read
Includes Code
Has Summary
--
This article provides a comprehensive guide on deploying machine learning models on Google Cloud Platform (GCP).
AutoMLAWSAzureFlaskGoogle CloudGoogle Cloud FunctionsGoogle Cloud StorageHTMLIrisMachine LearningPandasPythonscikit-learnServerlessVertex AI
Kurtis Pykes
10 min read
Includes Code
Has Summary
--
The article discusses how to accelerate ETL processes on KubeFlow using RAPIDS, a data science framework that leverages GPUs for improved performance.
Jacob Tomlinson
12 min read
Includes Code
Has Summary
--
The article explores NVIDIA TensorRT and its TensorRT Engine Explorer (TREx) tool, designed to optimize deep-learning inference performance by providing insights into engine execution plans and pro...
The article highlights key data science sessions at the NVIDIA GTC conference, showcasing innovative approaches and technologies in the field.
This article discusses the importance of efficient memory layouts and memory pools in machine learning frameworks to enhance interoperability and performance.
Christian Hundt
9 min read
Includes Code
Has Summary
--
The article discusses how to create an interactive visualization dashboard using Plotly Dash and RAPIDS, capable of handling datasets with over 300 million rows.
The article discusses the input and output configurability of the RAPIDS cuML machine learning library, highlighting its support for various data formats and the benefits of using GPU memory for pe...
Dante Gama Dessavre
11 min read
Includes Code
Has Summary
--
The article discusses the Gauss rank transformation technique, which significantly enhances the training of neural networks by converting input data into a Gaussian distribution.
The article discusses advancements in AutoML using NVIDIA GPUs and RAPIDS, highlighting how AutoGluon simplifies the process of achieving state-of-the-art machine learning accuracy while significan...
AutoMLAWSAWS EC2CatBoostDeep LearningGoogle AutoMLLightGBMMachine LearningPandasPythonscikit-learnXGBoost
Carol McDonald
15 min read
Includes Code
Has Summary
--
The article discusses UCX-Py, an accelerated networking library that enhances communication performance for Python applications, particularly in the context of GPU and distributed computing.
Belen Tegegn
9 min read
Includes Code
Has Summary
--
The article discusses the advancements in Natural Language Processing (NLP) and text processing using RAPIDS, emphasizing performance improvements in string processing with cuDF and cuML.
Vibhu Jawa
6 min read
Includes Code
Has Summary
--
This article discusses the winning solution by NVIDIA's team in the Booking.
Deep LearningGRUMachine LearningNatural Language ProcessingPandasPyTorchSpringTensorFlowTransformerTransformers
Carol McDonald
20 min read
Includes Code
Has Summary
--
Cloudera and NVIDIA have partnered to enhance data analytics and AI capabilities at scale, enabling organizations to process large datasets efficiently without modifying existing code.
Scott McClellan
4 min read
Has Summary
--
This article serves as a beginner's guide to using GPU-accelerated DataFrames with Python Pandas through the RAPIDS cuDF library.
Tom Drabas
8 min read
Includes Code
Has Summary
--
The article discusses the NVIDIA Tools Extension API (NVTX), an annotation tool designed for profiling code in Python and C/C++.
This article serves as an introductory guide to the RAPIDS ecosystem, focusing on GPU-accelerated DataFrames in Python through cuDF.
ApacheApache ArrowAWSAWS S3AzureBERTDeep LearningJSONMachine LearningNetworkXNumPyPandasPythonscikit-learnSQL
Tom Drabas
7 min read
Includes Code
Has Summary
--
The article discusses the integration of RAPIDS and whylogs for monitoring high-performance machine learning models.
Bernease Herman
6 min read
Includes Code
Has Summary
--
The article discusses the application of machine learning (ML) to predict loan delinquencies, emphasizing the importance of model explainability and the benefits of GPU acceleration in enhancing pr...
Mark J. Bennett
15 min read
Includes Code
Has Summary
--
The article discusses the acceleration of single-cell genomic analysis using RAPIDS, a suite of open-source libraries that leverage GPU acceleration to enhance data science workflows.
Avantika Lal
6 min read
Includes Code
Has Summary
--
The article discusses the development of an interactive COVID-19 visualization dashboard using modified 2010 US Census data, leveraging Plot. ly Dash and RAPIDS cuDF for GPU acceleration.
This article introduces a framework within RAPIDS cuDF that allows the compilation of Python user-defined functions (UDFs) into native CUDA kernels, leveraging the Numba compiler and Jitify library.
NVIDIA has released new containers on NGC to assist developers using NVIDIA Jetson Developer Kits.
Nefi Alarcon
1 min read
Has Summary
--
The article discusses the GPU Open Analytics Initiative (GOAI), which aims to create open frameworks for GPU-accelerated data analytics.
The GPU Open Analytics Initiative (GOAI) aims to create common data frameworks that enhance data science on GPUs, enabling seamless data interchange among applications.
You've reached the end! All 42 articles loaded.