How NVIDIA Uses Numba
36 engineering articles about Numba from NVIDIA's engineering team
Other NVIDIA Technologies
Articles
Filter:
The article discusses Autodesk Research's development of the Accelerated Lattice Boltzmann (XLB) library, which enhances computational fluid dynamics (CFD) performance using NVIDIA's Warp and GH200...
The article discusses the significant updates in CUDA Toolkit 13.
Jonathan Bentz
18 min read
Includes Code
Has Summary
--
The article discusses how to leverage GPU acceleration for algorithmic trading simulations using Numba, highlighting the significant performance improvements achievable—over 100x faster simulations.
The article discusses Numbast, a tool that enables Python developers to write CUDA kernels similarly to C++.
The article discusses how to build a zero-copy AI sensor processing pipeline using OpenCV within the NVIDIA Holoscan SDK.
Meiran Peng
7 min read
Includes Code
Has Summary
--
The article discusses the advancements in single-cell RNA sequencing analysis using the RAPIDS-singlecell library, which leverages GPU acceleration to significantly enhance performance.
Severin Dicks
13 min read
Includes Code
Has Summary
--
This article discusses the challenges of debugging in a mixed Python and C language stack, particularly in the context of the RAPIDS project.
The article discusses RAPIDS RAFT, a library designed to optimize machine learning and data analytics on GPUs by providing reusable computational patterns.
Corey Nolet
11 min read
Includes Code
Has Summary
--
The article discusses the increasing computational demands for AI processing at the edge and introduces the NVIDIA Holoscan SDK v0.
The article discusses the importance of multidimensional image processing for enhanced image analysis, particularly in fields like medical imaging and remote sensing.
Michael Boone
5 min read
Has Summary
--
The article discusses the improved interoperability between NVIDIA Vision Programming Interface (VPI) and PyTorch, focusing on how VPI can enhance object detection and tracking in computer vision a...
This article provides a comprehensive guide on writing ray tracing applications in Python using the Numba extension for PyOptiX.
The article discusses the latest enhancements in user-defined functions (UDFs) within the NVIDIA cuDF API, highlighting how these improvements can accelerate the development process and enhance per...
This article provides a weekly roundup of the latest software updates, learning resources, events, and notable news from NVIDIA. Key highlights include the release of nvCOMP v2. 2.
Michelle Horton
3 min read
Has Summary
--
This article discusses how to accelerate portfolio construction algorithms using Numba and Dask in Python, achieving up to 800x speed improvements on GPUs.
The article discusses how Munich Re Markets leverages interpretable machine learning to enhance portfolio construction strategies in the Life and Pension industry.
Machine Learning Frameworks Interoperability, Part 3: Zero-Copy in Action using an E2E Pipeline
This article discusses the implementation of an end-to-end pipeline utilizing zero-copy techniques for efficient data transfer across various machine learning frameworks.
Christian Hundt
7 min read
Has Summary
--
This article discusses the importance of efficient memory layouts and memory pools in machine learning frameworks to enhance interoperability and performance.
Christian Hundt
9 min read
Includes Code
Has Summary
--
The article discusses the input and output configurability of the RAPIDS cuML machine learning library, highlighting its support for various data formats and the benefits of using GPU memory for pe...
Dante Gama Dessavre
11 min read
Includes Code
Has Summary
--
This article discusses how to accelerate Blender Python using CUDA, focusing on the generation of synthetic data and the performance improvements achieved through matrix multiplication optimization.
This article discusses how to accelerate sequential Python User-Defined Functions (UDFs) using RAPIDS on GPUs, achieving speedups of up to 100x.
The article introduces rapidAligner, a CUDA-accelerated library designed for efficient alignment of time series data.
Christian Hundt
10 min read
Has Summary
--
The article discusses cuCIM, a new RAPIDS library designed for accelerated n-dimensional image processing and image I/O on GPUs.
AlbumentationsApacheDaskDeep LearningITKJavaJAXNumbaNumPyOpenCVPythonPyTorchscikit-imageSciPySimpleITK
Gigon Bae
6 min read
Includes Code
Has Summary
--
The article discusses NVIDIA's efforts to unify the CUDA Python ecosystem, enhancing the developer experience by providing standardized low-level interfaces for accessing CUDA APIs from Python.
This article serves as a beginner's guide to using GPU-accelerated DataFrames with Python Pandas through the RAPIDS cuDF library.
Tom Drabas
8 min read
Includes Code
Has Summary
--
The article discusses cuSignal, a library designed to accelerate signal processing using GPU technology.
The article discusses the challenges of memory allocation in NVIDIA CUDA and introduces the RAPIDS Memory Manager (RMM) as a solution.
Mark Harris
23 min read
Includes Code
Has Summary
--
The GTC HPC conference, occurring from October 5-9, will highlight advancements in High-Performance Computing (HPC) and GPU technologies.
This article introduces a framework within RAPIDS cuDF that allows the compilation of Python user-defined functions (UDFs) into native CUDA kernels, leveraging the Numba compiler and Jitify library.
The article discusses Numba, a just-in-time compiler used in RAPIDS cuDF, which transforms user-defined Python functions into CUDA kernels.
This article discusses how to enhance the performance of Python for exotic option pricing using GPU libraries.
NVIDIA is offering discounted training through its Deep Learning Institute during the GTC Digital event, providing developers, researchers, data scientists, and students with hands-on, instructor-l...
Nefi Alarcon
2 min read
Has Summary
--
This article introduces GPU accelerated Python for financial services, focusing on the challenges and solutions in quantitative finance.
The article discusses how the NVIDIA DGX-2 system, utilizing accelerated Python libraries and NVIDIA CUDA-X AI software, has achieved a groundbreaking 6,000x acceleration in backtesting trading alg...
The article discusses the GPU Open Analytics Initiative (GOAI), which aims to create open frameworks for GPU-accelerated data analytics.
The article introduces Numba, a Python compiler that enables high-performance computing by compiling Python code for execution on CUDA-capable GPUs and multicore CPUs.
Mark Harris
8 min read
Includes Code
Has Summary
--
You've reached the end! All 36 articles loaded.