#

Caching Programming Tutorials & Engineering Articles

157 Caching tutorials, guides, and engineering insights from Netflix, Pinterest, Shopify, and more

Caching Articles & Tutorials

Filter:
ClickHouse logo
ClickHouse
Intermediate
The article discusses how PeerDB facilitates large-scale PostgreSQL migrations, specifically achieving a 1TB migration in just 2 hours.
15 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the collaboration between NVIDIA and Black Forest Labs to optimize the FLUX. 2 text-to-image model for NVIDIA Blackwell Data Center GPUs.
Sandro Cavallari
8 min read
Includes Code
Has Summary
--
OpenAI logo
OpenAI
Advanced
OpenAI details how they scaled PostgreSQL to support 800 million ChatGPT users, achieving millions of queries per second through a single-primary architecture with nearly 50 read replicas across mu...
Bohan Zhang
13 min read
Has Summary
--
Slack logo
Slack
Advanced
Slack's build pipeline team reduced build times for Quip and Slack Canvas from 60 minutes to as little as 10 minutes by applying classic software engineering principles—separation of concerns, cach...
David Reed
19 min read
Includes Code
Has Summary
--
Netflix logo
Netflix
Advanced
Netflix redesigned Maestro's internal workflow engine, replacing the legacy Conductor 2. x-based stateless worker model with a custom stateful actor model built on Java 21 virtual threads.
Netflix Technology Blog
25 min read
Has Summary
--
Netflix logo
Netflix
Advanced
This article discusses how Netflix built a resilient data platform using a Write-Ahead Log (WAL) to address data consistency, reliability, and operational efficiency challenges at scale.
Netflix Technology Blog
15 min read
Includes Code
Has Summary
--
Pinterest logo
Pinterest
Intermediate
The article discusses Pinterest's journey in enhancing developer experience through the creation of PinConsole, an Internal Developer Platform built on Backstage.
Pinterest Engineering
15 min read
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses the evolution of Netflix's Tudum architecture, transitioning from a CQRS model utilizing Kafka to a more efficient system based on RAW Hollow.
Netflix Technology Blog
8 min read
Has Summary
--
ClickHouse logo
ClickHouse
Intermediate
The article discusses the transition of ClickHouse Cloud to a fully stateless compute architecture, enabled by the introduction of a Shared Catalog.
Tom Schreiber
21 min read
Includes Code
Has Summary
--
ClickHouse logo
ClickHouse
Intermediate
This article discusses the development of a distributed cache for ClickHouse Cloud, aimed at providing low-latency access to hot data across compute nodes.
Tom Schreiber
23 min read
Includes Code
Has Summary
--
Google logo
Google
Beginner
The article discusses the introduction of implicit caching support in Gemini 2. 5 models, enabling developers to benefit from significant cost savings without needing to create an explicit cache.
Logan Kilpatrick
2 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses the importance of structuring application prompts to enhance the security of key-value (KV) caching in large language model (LLM) applications.
Joseph Lucas
11 min read
Includes Code
Has Summary
--
Shopify logo
Shopify
Beginner
The article discusses Mobile Bridge, a framework developed by Shopify to enhance WebViews in their mobile app, making them feel more native.
Mauricio de Meirelles
8 min read
Has Summary
--
Palantir logo
Palantir
Intermediate
The article discusses the requirements and best practices for deploying AI in production within the insurance underwriting sector.
NVIDIA logo
NVIDIA
Intermediate
The article introduces new KV cache reuse optimizations in NVIDIA TensorRT-LLM, focusing on improving memory management and throughput for large language models (LLMs).
John Thomson
7 min read
Includes Code
Has Summary
--
NVIDIA logo
NVIDIA
Intermediate
The article discusses the implementation of data-efficient knowledge distillation using NVIDIA NeMo-Aligner during supervised fine-tuning (SFT).
Anna Shors
5 min read
Has Summary
--
NVIDIA logo
NVIDIA
Advanced
The article discusses how NVIDIA TensorRT-LLM enhances the inference throughput of Meta's Llama 3. 3 70B model by up to 3x through optimizations like speculative decoding and KV caching.
Anjali Shah
8 min read
Includes Code
Has Summary
--
ClickHouse logo
ClickHouse
Beginner
ClickHouse version 24.
The ClickHouse Team
15 min read
Includes Code
Has Summary
--
Netflix logo
Netflix
Intermediate
Netflix's TimeSeries Data Abstraction Layer is designed to efficiently store and query vast amounts of temporal event data with low latency.
Netflix Technology Blog
22 min read
Includes Code
Has Summary
--
Pinterest logo
Pinterest
Advanced
The article discusses the Structured DataStore (SDS), a unified multi-model data management platform developed by Pinterest.
Pinterest Engineering
18 min read
Includes Code
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses Preon, a microservice developed by Uber for intelligent and efficient query analysis using the Presto SQL engine.
Gurmeet Singh
13 min read
Has Summary
--
Pinterest logo
Pinterest
Intermediate
The article discusses Pinterest's implementation of feature caching in their recommender systems using Cachelib, an in-process caching engine developed by Meta Open Source.
Pinterest Engineering
11 min read
Has Summary
--
Anthropic logo
Anthropic
Advanced
The article introduces Contextual Retrieval, a method that enhances Retrieval-Augmented Generation (RAG) by improving the retrieval step through Contextual Embeddings and Contextual BM25.
11 min read
Includes Code
Has Summary
--
Meta logo
Meta
Intermediate
The article discusses how Meta has optimized the deployment of its AI-generated image animation feature to serve billions of users efficiently.
Gaurav Sharma
11 min read
Has Summary
--
Uber logo
Uber
Advanced
This article discusses Uber's migration of its Apache Hadoop-based data lake to Google Cloud Storage (GCS) and the security measures implemented during this transition.
Matt Mathew, Alexander Gulko, Lei Sun, KK Sriramadhesikan, Alan Cao, Omkar Kakade
20 min read
Includes Code
Has Summary
--
Pinterest logo
Pinterest
Advanced
The article discusses Pinterest's adoption of TiDB as a replacement for HBase, detailing the motivations, selection methodology, and the journey of integrating TiDB into their infrastructure.
Pinterest Engineering
19 min read
Has Summary
--
Cloudflare logo
Cloudflare
Beginner
The article announces the General Availability of AI Gateway, a unified interface for managing and scaling generative AI workloads.
Kathy Liao
6 min read
Includes Code
Has Summary
--
Pinterest logo
Pinterest
Intermediate
The article discusses Pinterest's transition from HBase, its first NoSQL datastore, to a new serving architecture with a unified storage service.
Pinterest Engineering
7 min read
Has Summary
--
Google logo
Google
Intermediate
The article recaps the Google I/O 2024 event, highlighting advancements in AI technologies aimed at making AI accessible for developers.
Notion logo
Notion
Intermediate
Notion has significantly improved the launch speed of its Android app, making it more than twice as fast compared to the beginning of 2023.
Karn Saheb
11 min read
Includes Code
Has Summary
--
Uber logo
Uber
Intermediate
The article delves into Uber's comprehensive accounting data testing strategies, emphasizing the importance of precision and integrity in financial processes.
Onkar Singh, Harsha Aditya Ravuri, Viswanath Ramakkagari, Aditya Gopisetti, Hari Srinivasan
16 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses how Uber serves over 40 million reads per second from its online storage using an integrated caching solution called CacheFront.
Preetham Narayanareddy, Eli Pozniansky, Zurab Kutsia, Afshin Salek, Piyush Patel
19 min read
Has Summary
--
Cloudflare logo
Cloudflare
Advanced
Cloudflare celebrated its 13th birthday with a series of announcements aimed at enhancing its services for customers and the broader internet community.
NVIDIA logo
NVIDIA
Advanced
This article discusses how to build a distributed inference cache using NVIDIA Triton and Redis, highlighting the benefits and drawbacks of local versus distributed caching.
Steve Lorello
12 min read
Includes Code
Has Summary
--
Cursor logo
Cursor
Intermediate
The article discusses the concept of 'prompt design' and draws parallels between prompting in AI and web design.
Arvid
6 min read
Includes Code
Has Summary
--
Pinterest logo
Pinterest
Intermediate
The article discusses Pacer, Pinterest's new asynchronous computing platform designed to address the limitations of its predecessor, Pinlater.
Pinterest Engineering
9 min read
Has Summary
--
Uber logo
Uber
Intermediate
This article discusses Uber's implementation of a local caching solution for HDFS DataNodes to optimize performance while adopting high-density HDDs.
Chen Liang, Jing Zhao, Yangjun Zhang, Junyan Guo, Fengnan Li
19 min read
Has Summary
--
Cloudflare logo
Cloudflare
Intermediate
The article announces Cohort #2 of the Workers Launchpad, highlighting the success of the first cohort and introducing 25 new startups selected for the program.
Cloudflare logo
Cloudflare
Advanced
The article discusses how Cloudflare is transitioning its architecture to utilize Cloudflare Workers, aiming to enhance the performance, robustness, and developer experience of its products.
Richard Boulton
23 min read
Includes Code
Has Summary
--
Cloudflare logo
Cloudflare
Advanced
The article discusses the modernization of the build system for Cloudflare Pages, introducing a new beta version that supports updated tools and languages, including Node. js, Python, and Ruby.
Greg Brimble
8 min read
Includes Code
Has Summary
--
ClickHouse logo
ClickHouse
Intermediate
This article discusses the optimization of LZ4 decompression in ClickHouse, highlighting the challenges and solutions to improve performance.
Alexey Milovidov
37 min read
Includes Code
Has Summary
--
LinkedIn logo
LinkedIn
Advanced
The article discusses how LinkedIn reduced the upload of Apache Spark application dependencies by 99% through the implementation of a user-level caching mechanism.
LinkedIn Engineering Team
10 min read
Has Summary
--
ClickHouse logo
ClickHouse
Intermediate
The article introduces the ClickHouse Query Cache, a new feature designed to enhance performance by caching the results of expensive SELECT queries.
Robert Schulze
10 min read
Includes Code
Has Summary
--
ClickHouse logo
ClickHouse
Beginner
ClickHouse Release 23. 1 introduces significant enhancements including 17 new features, 17 performance optimizations, and 78 bug fixes.
The ClickHouse Team
9 min read
Includes Code
Has Summary
--
Shopify logo
Shopify
Advanced
This article discusses the transition from Ruby's Marshal serialization to MessagePack for caching in Rails applications.
Chris Salzberg
19 min read
Includes Code
Has Summary
--
Shopify logo
Shopify
Advanced
The article discusses the critical role of caching in Rails applications and the inherent risks associated with using Ruby's Marshal for serialization.
Chris Salzberg
12 min read
Includes Code
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses Uber's implementation of Alluxio local caching to enhance the performance of Presto, a data analytics engine.
Chen Liang, Beinan Wang
12 min read
Has Summary
--
Cloudflare logo
Cloudflare
Advanced
This article discusses the integration of Cloudflare Workers with micro-frontends, presenting a fragments architecture that enhances web application performance and scalability.
Uber logo
Uber
Advanced
The article discusses uBuild, Uber's platform for building container images efficiently and securely.
Rasmus Vestergaard, Andreas Lykke
12 min read
Has Summary
--
Shopify logo
Shopify
Intermediate
The article summarizes key talks from RailsConf 2022, highlighting insights from Shopify engineers on various topics related to Ruby on Rails, performance improvements, and open-source contribution...