We made Workers KV up to 3x faster — here’s the data

Thomas Gauvin

Cloudflare

•

Thomas Gauvin

•9 min read•advanced•

--

•View Original

HTMLRust

Overview

The article discusses significant performance improvements to Cloudflare's Workers KV, achieving up to 3x faster hot reads and reducing overall KV operation latency by up to 20 milliseconds. It details the architectural changes made to optimize latency and enhance caching strategies, ultimately benefiting various Cloudflare services reliant on KV.

What You'll Learn

1

How to optimize Workers KV for reduced latency

2

Why bypassing Front Line layers can save processing time

3

How to implement tiered caching strategies for better performance

4

When to utilize in-memory caching for high-frequency keys

Prerequisites & Requirements

Understanding of caching concepts and distributed systems
Familiarity with Cloudflare Workers and KV(optional)

Key Questions Answered

How much faster are KV hot reads after the optimization?

KV hot reads are now up to 3x faster, with all KV operations improved by up to 20 milliseconds. This enhancement is crucial for reducing latency in applications that rely on Workers KV for data retrieval.

What architectural changes were made to Workers KV?

The architecture was optimized by bypassing the Front Line layers for direct connections between Workers and KV, and implementing a tiered cache system to minimize requests to centralized storage backends, significantly improving performance.

What is the impact of tiered caching on KV operations?

With tiered caching, approximately 30% of requests are resolved in regional caches instead of reaching centralized storage backends, which reduces latency and improves response times for users across different regions.

How does the in-memory cache improve performance for Workers KV?

The in-memory cache allows the KV Worker to resolve the hottest keys, which account for nearly half of all requests, in under a millisecond, eliminating the need for additional network hops and significantly speeding up response times.

Key Statistics & Figures

KV hot reads speed improvement

up to 3x faster

This improvement directly enhances the performance of applications relying on Workers KV.

Latency reduction for KV operations

up to 20 ms

This reduction contributes to overall faster response times for users.

Requests resolved in tiered cache

~30%

This statistic highlights the effectiveness of the tiered caching strategy in reducing load on centralized storage.

Response time for hottest keys

under 1 ms

This performance is achieved through the use of in-memory caching for frequently accessed keys.

Technologies & Tools

Backend

Cloudflare Workers

Used to execute serverless functions that interact with Workers KV.

Database

Kv

Key-value storage used for configuration, routing, and asset serving.

Caching

Tiered Cache

Caching mechanism to improve read performance by resolving requests closer to users.

Key Actionable Insights

1
Implement direct connections from Workers to KV Workers to reduce latency.
By bypassing unnecessary processing layers, you can achieve faster read operations, which is especially beneficial for applications with high traffic and low latency requirements.

2
Utilize tiered caching to improve cache hit rates and reduce load on storage backends.
This strategy is effective for applications that experience variable traffic patterns, as it allows for faster data retrieval from regional caches.

3
Consider using in-memory caching for frequently accessed keys.
This approach can drastically improve performance for high-frequency requests, making it suitable for applications that require real-time data access.

Common Pitfalls

1

Over-relying on centralized storage backends can lead to increased latency.

This happens when applications do not utilize caching effectively, resulting in unnecessary network calls that slow down response times. To avoid this, implement tiered caching and in-memory strategies.

Related Concepts

Caching Strategies

Distributed Systems Architecture

Performance Optimization Techniques