#
C++ Programming Tutorials & Engineering Articles
41 C++ tutorials, guides, and engineering insights from NVIDIA, Meta, and ClickHouse
Companies Using This
C++ Articles & Tutorials
Filter:
The article discusses the significant updates in CUDA Toolkit 13.
Jonathan Bentz
18 min read
Includes Code
Has Summary
--
This article explores profit and loss modeling on GPUs using ISO C++ language parallelism, building on previous work with Black-Scholes simulations.
Ioana Boier
9 min read
Includes Code
Has Summary
--
Logarithm is a serverless, multitenant logging engine developed internally at Meta to enhance AI training workflows and services.
Partha Kanuparthy
14 min read
Has Summary
--
This article explores the implementation and optimization of hash tables in ClickHouse, focusing on zero-cost abstractions in modern C++.
NVIDIA has released CUDA Toolkit 12. 0, marking its first major update in years, focusing on enhanced programming models and application acceleration through new hardware capabilities.
The article discusses the author's experiments in building ClickHouse, a fast columnar SQL DBMS, on an Android phone.
This article discusses multi-GPU programming using Standard Parallel C++, focusing on the advantages of utilizing parallelism in C++ for accelerated computing.
The article discusses MatX, a GPU-accelerated numerical computing C++ library that allows developers to write high-performance code with a Python-like syntax.
The article discusses the advantages of using standard language parallelism for accelerated computing on NVIDIA platforms, emphasizing the productivity and portability of programming with ISO C++, ...
This article discusses the implementation and usage of sstable-to-arrow, a tool designed to convert SSTable data from Cassandra into Arrow format for GPU-based analytics.
NVIDIA has announced the availability of the Arm HPC Developer Kit, which integrates hardware and software for high-performance computing, AI, and scientific applications on Arm server platforms.
Facebook has increasingly embraced Rust, a programming language known for its performance and safety, by joining the Rust Foundation and expanding its internal Rust development team.
NVIDIA has announced the release of the HPC SDK version 21. 3, which is now available for free download.
The NVIDIA HPC SDK 20. 11 update introduces new features and enhancements for high-performance computing developers, including support for automatic GPU acceleration and new libraries.
libcu++, NVIDIA's open-source C++ Standard Library, facilitates seamless integration between CPU and GPU code. The latest version, 1. 4.
Nefi Alarcon
1 min read
Has Summary
--
The article discusses the challenges of memory allocation in NVIDIA CUDA and introduces the RAPIDS Memory Manager (RMM) as a solution.
Mark Harris
23 min read
Includes Code
Has Summary
--
The article discusses the NVIDIA Ampere architecture and the new data movement controls introduced in CUDA 11.
The NVIDIA HPC SDK is a comprehensive suite of compilers, libraries, and tools designed for high-performance computing (HPC) developers, enabling them to program across the entire HPC platform.
The article discusses how NVIDIA's NVC++ compiler enables GPU acceleration of Standard C++ code without the need for language extensions or non-standard libraries.
David Olsen
19 min read
Includes Code
Has Summary
--
The PGI Community Edition 19. 10 introduces support for NVIDIA V100 Tensor Cores in CUDA Fortran, along with enhancements in C++17 language features and OpenACC 2. 6.
The maps people use on many of our apps to shop, find jobs, support causes, and more are all powered by OpenStreetMap (OSM). OSM is a community-driven project built by mappers all over the world wh…
Saurav Mohapatra
18 min read
Includes Code
--
The article discusses the open-sourcing of F14, a 14-way probing hash table designed for improved performance and memory efficiency in C++.
Nathan Bronson
21 min read
Includes Code
Has Summary
--
The article discusses how SafetyCulture utilizes Envoy and Lua for edge routing, focusing on routing traffic based on application criteria and user characteristics.
The article discusses the latest release of PGI Compilers & Tools, version 19. 1, which enhances support for high-performance computing applications.
This article compares the performance of Cloudflare Workers against AWS Lambda for CPU-bound tasks, particularly focusing on cryptography workloads.
Zack Bloom
6 min read
Has Summary
--
The article announces the release of the PGI Community Edition version 17. 10, highlighting its features and improvements for high-performance computing applications.
PGI 17. 7 introduces enhanced support for Tesla V100 GPUs, OpenACC directives, and CUDA Fortran, aimed at improving performance in high-performance computing (HPC) applications.
The article discusses the development of the Zero protocol, aimed at enhancing mobile connection security and performance for Facebook's Android and iOS apps.
Kyle Nekritz
17 min read
Includes Code
Has Summary
--
CUDA 8 introduces significant enhancements to the CUDA compiler toolchain, focusing on compile time improvements, extended lambda support, and runtime compilation features.
Jaydeep Marathe
17 min read
Includes Code
Has Summary
--
Wangle is an asynchronous C++ networking and RPC library designed to facilitate the development of protocols, servers, and clients in a clean and scalable manner.
CUDA 8 introduces significant advancements in NVIDIA's parallel computing platform, including support for the Pascal GPU architecture, enhanced Unified Memory capabilities, and new profiling tools.
Mark Harris
17 min read
Includes Code
Has Summary
--
The article discusses the integration of modern C++ features into CUDA 7. 5, particularly focusing on the Hemi 2 library, which simplifies portable parallel programming for GPUs.
Mark Harris
12 min read
Includes Code
Has Summary
--
The article introduces the NVIDIA OpenACC Toolkit, designed to facilitate GPU computing for scientists and researchers who may lack extensive programming experience.
The article discusses the new features introduced in CUDA Toolkit 7.
Mark Harris
10 min read
Includes Code
Has Summary
--
The article discusses the development and application of RapidCheck, a C++ framework for property-based testing, which automates the generation of test cases to improve software reliability.
The article discusses the implementation of Futures for C++11 at Facebook, highlighting the benefits of asynchronous programming and how it improves performance and code readability.
Hans Fugal
11 min read
Includes Code
Has Summary
--
The article discusses the development of the Moments app, designed to facilitate photo sharing among friends.
Ashwin Bharambe
10 min read
Includes Code
Has Summary
--
The article recaps the @Scale 2014 mobile track, highlighting the challenges and innovations in mobile application development.
Nick Schrock
5 min read
Has Summary
--
The article introduces mcrouter, a memcached protocol router developed by Facebook to enhance the scalability of memcached deployments.
The article discusses the development and open-sourcing of Flint, Facebook's custom linting tool designed for C++ code.
Andrei Alexandrescu
13 min read
Includes Code
Has Summary
--
The article provides three optimization tips for C++ programming, focusing on reducing operation strength, minimizing array writes, and leveraging efficient algorithms.
Andrei Alexandrescu
10 min read
Has Summary
--
You've reached the end! All 41 articles loaded.