Meta

Reel Friends: Building Social Discovery that Scales to Billions

Building a social discovery system that efficiently surfaces Reels watched and reacted to by friends while scaling to billions of users.

caching distributed-systems
5 min
Cloudflare

When DNSSEC goes wrong: how we responded to the .de TLD outage

When DENIC published invalid DNSSEC signatures for the .de TLD, DNS resolvers like 1.1.1.1 faced a critical decision: reject all .de domain queries due to signature validation failures or serve potentially stale cached responses to maintain availability.

caching distributed-systems
4 min
Netflix

Stop Answering the Same Question Twice: Interval-Aware Caching for Druid at Netflix Scale

Query performance degradation at massive scale (10+ trillion rows, 15M events/second) where repeated identical queries were consuming excessive resources and impacting latency.

caching databases
5 min
Cloudflare

Introducing Flagship: feature flags built for the age of AI

Third-party feature flag services introduce unacceptable latency for applications requiring sub-millisecond flag evaluation at global scale.

caching distributed-systems
4 min
Cloudflare

Redirects for AI Training enforces canonical content

AI crawlers were ingesting deprecated and non-canonical content despite soft directives like robots.txt, requiring a way to enforce canonical versions without modifying origin infrastructure.

caching security
4 min
Cloudflare

Shared Dictionaries: compression that keeps up with the agentic web

Web pages are growing larger and slower to load due to increased dynamic content, requiring better compression techniques that can adapt to modern agentic web patterns.

caching api-design
3 min
AWS

Build a multi-tenant configuration system with tagged storage patterns

Building a scalable multi-tenant configuration service that maintains strict tenant isolation while supporting real-time updates without cache staleness or downtime.

caching storage-systems
5 min
Cloudflare

Why we're rethinking cache for the AI era

CDN cache systems were designed for human traffic patterns but struggle with the distinct access patterns of AI bot traffic, which now represents over 10 billion requests per week and threatens cache efficiency.

caching distributed-systems
4 min
LinkedIn

Optimizing LinkedIn Sales Navigator’s search pipeline with Spark

LinkedIn Sales Navigator's search pipeline had latency issues as query complexity and data volume grew.

search caching
3 min
Cloudflare

Launching Cloudflare’s Gen 13 servers: trading cache for cores for 2x edge compute performance

Cloudflare needed to significantly increase edge compute throughput per server but faced a tradeoff where high-core-count CPUs came with smaller per-core L3 cache, risking latency penalties for cache-dependent workloads.

distributed-systems caching
4 min