How NVIDIA Uses XGBoost
79 engineering articles about XGBoost from NVIDIA's engineering team
Other NVIDIA Technologies
Other Companies Using XGBoost
Articles
Filter:
The article discusses Project Aether, a tool developed by NVIDIA to facilitate the migration of CPU-based Apache Spark workloads to GPU-accelerated environments on Amazon EMR.
Navin Kumar
6 min read
Includes Code
Has Summary
--
The article discusses the integration of XGBoost with Polars DataFrames, emphasizing the benefits of GPU acceleration for machine learning workflows.
Jiaming Yuan
7 min read
Includes Code
Has Summary
--
This article provides insights into GPU-accelerating machine learning model training using CUDA-X Data Science, focusing on tree-based models like XGBoost, LightGBM, and CatBoost.
The article presents a comprehensive playbook developed through extensive experience in Kaggle competitions, detailing seven effective modeling techniques for handling tabular data.
The article discusses the advancements in XGBoost 3. 0, particularly its ability to train with terabyte-scale datasets on a single NVIDIA Grace Hopper Superchip.
This article discusses seven drop-in replacements for popular Python libraries that can significantly speed up data science workflows by leveraging GPU acceleration.
Jamil Semaan
8 min read
Includes Code
Has Summary
--
The article discusses the introduction of cuda-cccl, a Python library that provides high-level building blocks for NVIDIA CUDA kernel fusion, enabling developers to write efficient algorithms witho...
Ashwin Srinath
5 min read
Includes Code
Has Summary
--
NVIDIA utilizes data science and machine learning to enhance chip manufacturing processes, focusing on optimizing workflows through the use of CUDA-X libraries like cuDF and cuML.
Divyansh Jain
8 min read
Includes Code
Has Summary
--
The article discusses the enhancements in the Forest Inference Library (FIL) within NVIDIA cuML 25. 04, focusing on its capabilities for fast inference of tree-based models.
Dante Gama Dessavre
10 min read
Includes Code
Has Summary
--
The article discusses the application of Graph Neural Networks (GNNs) in enhancing fraud detection within financial services.
Naim
10 min read
Includes Code
Has Summary
--
The article discusses the latest enhancements in RAPIDS, including zero-code-change acceleration for Python machine learning, significant IO performance improvements, and out-of-core XGBoost capabi...
ApacheAzureAzure Blob StorageDaskGeminiGoogle CloudGoogle Cloud StorageLightGBMNetworkXPolarsPythonscikit-learnXGBoost
Nick Becker
9 min read
Includes Code
Has Summary
--
The article discusses how Atgenomix SeqsLab leverages NVIDIA technologies to enhance health omics analysis for precision medicine.
Yu-Ting Lin
9 min read
Has Summary
--
The article discusses the use of GPU acceleration to enhance performance in Apache Spark applications, highlighting the challenges of migrating workloads from CPUs to GPUs.
Matt Ahrens
9 min read
Includes Code
Has Summary
--
Kaggle Grandmasters David Austin, Chris Deotte, and Ruchi Bhatia shared insights on their winning strategies for data science competitions at the Google Cloud Next conference.
Jenn Yonemitsu
9 min read
Has Summary
--
The article discusses how feature engineering, particularly using NVIDIA cuDF-pandas for GPU acceleration, can significantly enhance model accuracy in Kaggle competitions involving tabular data.
The article discusses the strategies employed by the winners of the NVIDIA hackathon at ODSC West, focusing on how they utilized RAPIDS Python APIs to enhance machine learning workflows.
The article discusses the integration of CUDA-accelerated Homomorphic Encryption into Federated XGBoost, enhancing data privacy and security in federated learning environments.
Ziyue Xu
10 min read
Includes Code
Has Summary
--
The article discusses best practices for multi-GPU data analysis using RAPIDS with Dask, emphasizing the need for efficient memory management and accelerated networking.
The article discusses how NVIDIA RAPIDS can enhance causal inference on large datasets by leveraging GPU acceleration, specifically through the integration of the cuML library with the DoubleML fra...
Nick Becker
4 min read
Includes Code
Has Summary
--
The article discusses the practical implementation of Federated XGBoost using NVIDIA FLARE, highlighting its capabilities for concurrent training, fault tolerance, and experiment tracking.
Yuan-Ting Hsieh
5 min read
Includes Code
Has Summary
--
The article introduces five new technical courses offered by NVIDIA aimed at enhancing skills in AI and data science.
ApacheApache ArrowApache SparkComputer VisionNatural Language ProcessingPrompt EngineeringPyTorchTransformerTransformersXGBoost
Rachel Ho
4 min read
Has Summary
--
This article provides a comprehensive guide on leveraging RAPIDS for GPU-accelerated data processing on Databricks.
The article discusses the rapid adoption of federated learning (FL) and the new features introduced in NVIDIA FLARE 2. 4.
AWSAzureFederated LearningGPTGraph Neural NetworksgRPCHugging FaceMachine LearningNeural NetworksPyTorchXGBoost
Chester Chen
15 min read
Includes Code
Has Summary
--
The article discusses the integration of Metaflow and NVIDIA Triton Inference Server for developing and deploying machine learning models.
Eddie Mattia
12 min read
Includes Code
Has Summary
--
The article discusses the collaboration between H2O. ai and NVIDIA to enhance AI applications in financial services through generative AI and predictive analytics.
The article discusses how to optimize multi-GPU model training using Dask and XGBoost, addressing common challenges such as out-of-memory errors.
The article discusses how GPU-accelerated data analytics can enhance machine learning (ML) projects by improving speed and scalability.
Jay Rodge
14 min read
Includes Code
Has Summary
--
The article discusses the application of federated learning to traditional machine learning methods, highlighting its advantages in communication efficiency and the ability to train models collabor...
Kris Kersten
3 min read
Has Summary
--
This article discusses the use of time-series models, specifically autoregressive recursive neural networks and XGBoost, for predicting credit defaults.
Jiwei Liu
11 min read
Includes Code
Has Summary
--
The article discusses the new capability of XGBoost 1. 7 to handle categorical features without manual encoding, which simplifies the training and inference processes for machine learning models.
Chris Jarrett
5 min read
Includes Code
Has Summary
--
The article discusses NVIDIA FLARE 2. 2, an open-source platform for federated learning that introduces new features aimed at reducing development time and enhancing deployment efficiency.
The article discusses the importance of explainability in machine learning models, particularly through the use of SHAP (SHapley Additive Explanations) and its GPU-accelerated variant, GPUTreeShap.
Parul Pandey
14 min read
Includes Code
Has Summary
--
The article discusses how Graph Neural Networks (GNNs) and NVIDIA GPUs can optimize fraud detection in financial services.
Ashish Sardana
21 min read
Includes Code
Has Summary
--
The article discusses the challenges of deploying AI models in production and how NVIDIA Triton Inference Server addresses these challenges.
Shankar Chandrasekaran
11 min read
Includes Code
Has Summary
--
At GTC 2022, NVIDIA unveiled significant advancements in AI frameworks, focusing on real-time speech AI, cybersecurity, and medical AI applications.
Erik Pounds
11 min read
Has Summary
--
The article discusses how AT&T leveraged GPUs to optimize their data pipelines, focusing on improving speed, cost, and efficiency across various stages of the data-to-AI pipeline.
Mark Austin
9 min read
Includes Code
Has Summary
--
NVIDIA AI Enterprise 2. 1 has been released, providing an optimized software suite for deploying and scaling AI applications across various environments.
Phoebe Lee
4 min read
Has Summary
--
The article discusses Accelerated WEKA, a project that integrates GPU acceleration into the WEKA machine learning software using RAPIDS libraries.
Albert Bifet
11 min read
Has Summary
--
This article provides a comprehensive step-by-step guide for building a machine learning application using RAPIDS, a suite of open-source software libraries that leverage GPU acceleration.
Paul Mahler
10 min read
Includes Code
Has Summary
--
The article discusses a new partnership between NVIDIA and Google Cloud that simplifies the deployment of Jupyter Notebooks on Google Cloud using a one-click deploy feature in the NVIDIA NGC catalo...
Chintan Patel
3 min read
Has Summary
--
The article discusses the challenges of deploying AI workloads at scale and how Bottlerocket, a Linux-based container-optimized OS developed by AWS, can be utilized with NVIDIA-powered Amazon EC2 i...
Uttara Kumar
3 min read
Has Summary
--
The article discusses the deployment of tree-based models like XGBoost and LightGBM using the NVIDIA Triton Inference Server, emphasizing its capabilities for real-time serving and GPU acceleration.
William Hicks
7 min read
Has Summary
--
The article discusses the European Union's Artificial Intelligence Act and its implications for high-risk AI systems, particularly in credit risk management.
Jochen Papenbrock
12 min read
Has Summary
--
The NVIDIA Deep Learning Institute (DLI) offers hands-on training for building AI applications focused on anomaly detection.
Josh Wyatt
5 min read
Has Summary
--
This article explores the comparison between deep learning and machine learning models for predicting default risk, emphasizing the importance of explainability in model predictions.
Emanuel Scoullos
17 min read
Includes Code
Has Summary
--
The article discusses the latest innovations in Machine Learning, Graphics, HPC, and IoT showcased at AWS re:Invent, highlighting collaborations between NVIDIA and AWS.
Geoff Murase
3 min read
Has Summary
--
The article discusses NVIDIA's vision for achieving multi-Million-X speedups in computational performance, which could significantly enhance data-intensive research across various fields.
Joseph Chandler
6 min read
Has Summary
--
The article discusses the NVIDIA Triton Inference Server, an open-source platform designed for fast and scalable AI model deployment.
AWSAzureBERTDockerGoogle CloudGPTKubernetesLightGBMPythonPyTorchTensorFlowTransformerTransformersVertex AIXGBoost
Shankar Chandrasekaran
11 min read
Has Summary
--
NVIDIA GTC is set to showcase over 500 sessions from November 8-11, focusing on the latest advancements in AI and deep learning.
Jay Rodge
4 min read
Has Summary
--
The article discusses how Munich Re Markets leverages interpretable machine learning to enhance portfolio construction strategies in the Life and Pension industry.