Scaling oncology patient support: How New York Cancer and Blood Specialists transformed customer experience with AWS and Pronetx, now part of Caylent
NYCBS needed to modernize their patient engagement and contact center infrastructure to improve patient enrollment and streamline communication with oncology patients.
Sitar-agent: Building a reliable dynamic configuration sidecar at scale
Reliably delivering configuration changes to thousands of Airbnb service instances in Kubernetes, with changes occurring multiple times per minute at scale.
When history fails you, borrow from geography
Building reliable forecasting models for marketplace demand when historical data is unavailable or unreliable due to unprecedented market shocks.
How we reduced core unit boot time from hours to minutes
Firmware updates were causing core servers to take four hours to reboot, creating operational inefficiency and extended downtime.
Your AI bill is out of control. Cloudflare can fix it now.
Uncontrolled spending on API calls to multiple AI providers due to lack of visibility and budget enforcement mechanisms.
Experimenting with TPUs, GKE Managed DRANET, and Multi-cluster Inference Gateway
Ensuring high availability and service continuity when AI inference workloads fail in one region while maintaining access to the service across multiple regions.
Scaling AI Agents: A Step-by-Step Guide to Deploying ADK on GKE Autopilot
Moving AI agents built with Google's Agent Development Kit from local prototypes to production-ready, scalable infrastructure.
Lights Out, Systems On: Validating Instant Power Loss Readiness
Meta needed to validate and ensure their data center infrastructure could survive instantaneous power loss without data corruption or service degradation.
Coding Is No Longer the Constraint: Scaling Developer Experience to Teams and Agents at Spotify
Scaling developer productivity and experience when coding is no longer the primary bottleneck, requiring infrastructure and tooling that enable both human teams and AI agents to work effectively.
How we built Cloudflare's data platform and an AI agent on top of it
Cloudflare needed to unify fragmented analytics data across its global edge network and enable intelligent querying of that data at scale.
Iran's Internet is partially restored, Cloudflare Radar data shows
How to detect and monitor large-scale Internet shutdowns and measure the extent of network restoration in real-time across a country.
Beyond code generation: rethinking engineering productivity in the age of AI agents
How to transition from code-generation AI tools that only assist engineers to autonomous agentic systems capable of executing complete, scoped engineering tasks independently.
Supercharge your integration workflow with the Google Pay & Wallet Developer MCP server
Developers integrating with Google Pay & Wallet APIs experienced friction by having to context-switch between their IDE and external documentation/tools to validate implementations and manage accounts.
A Guide to AI Cold Starts on Cloud Run
Managing startup latencies up to 20 seconds for AI workloads on Cloud Run serverless GPUs, which causes poor user experience and is driving developers back to traditional container orchestration.
From Silos to Service Topology: Why Netflix Built a Real-Time Service Map
Netflix needed a real-time, dynamic way for engineers to understand service dependencies and troubleshoot issues quickly across their complex distributed microservices infrastructure.