Cloudflare

Announcing Claude Compliance API support with Cloudflare CASB

Security teams needed visibility and compliance monitoring of Claude Enterprise API usage across their organization without leaving their existing security infrastructure.

security api-design
3 min
Cloudflare

Announcing Claude Managed Agents on Cloudflare

Enabling developers to deploy and scale autonomous agent workflows globally while maintaining security isolation and control over access to private backend systems.

distributed-systems security
4 min
Dropbox

Introducing Nova, our internal platform for coding agents

Enabling engineers to run multiple concurrent coding sessions and integrating AI agents into automated internal workflows at scale.

microservices api-design
3 min
Google

A Smarter Google AI Edge Gallery: MCP integration, notifications, and session continuity

Enable on-device AI models to coordinate complex tasks across external data sources while maintaining persistent user context and proactive engagement without relying solely on cloud connectivity.

api-design ml-systems
5 min
Google

A2UI v0.9: The New Standard for Portable, Framework-Agnostic Generative UI

AI agents needed a standardized way to generate UI components that work across different platforms and frameworks without being tightly coupled to any specific technology stack.

api-design real-time-systems
5 min
Google

Accelerating on-device AI: A look at Arm and Google AI Edge optimization

Enabling efficient execution of generative AI models on edge devices with limited computational resources while maintaining acceptable latency and performance.

ml-systems api-design
5 min
Google

Agents CLI in Agent Platform: create to production in one CLI

Developers face high context overhead and token waste when scaffolding AI agents locally and struggle to bridge the gap between development environments and production-grade deployment on Google Cloud.

api-design microservices
5 min
Google

All the news from the Google I/O 2026 Developer keynote

How to enable developers to build applications powered by autonomous AI agents rather than traditional assistive AI interfaces.

api-design sdks
5 min
Google

An important update: Transitioning Gemini CLI to Antigravity CLI

Google needed to unify fragmented AI terminal tooling by consolidating the community-focused Gemini CLI into a more scalable, agent-first platform capable of handling complex multi-agent workflows.

api-design microservices
5 min
Google

Announcing ADK for Kotlin and ADK for Android 0.1.0: Building AI Agents on Android and Beyond

Developers needed a way to build AI agent workflows that could run on Android devices and backend systems without reinventing the core agentic logic across different platforms.

api-design sdks
3 min
Google

Announcing Genkit Middleware: Intercept, extend, and harden your agentic apps

Developers need a way to reliably control, monitor, and extend AI model generation calls in production agentic applications without modifying core business logic.

api-design ml-systems
5 min
Google

Blazing fast on-device GenAI with LiteRT-LM

Running large language models efficiently on mobile and edge devices while preserving multimodal and agentic capabilities without requiring server-side inference.

ml-systems mobile-platforms
5 min
Google

Build Long-running AI agents that pause, resume, and never lose context with ADK

Building production-grade AI agents that can maintain context and state across long-running enterprise workflows spanning days or weeks without losing information during idle periods or server restarts.

api-design distributed-systems
5 min
Google

Building real-world on-device AI with LiteRT and NPU

Mobile developers faced performance and battery inefficiency when running AI models on CPU/GPU, limiting real-time AI applications on edge devices.

api-design ml-systems
5 min
Google

Building with Gemini Embedding 2: Agentic multimodal RAG and beyond

Developers needed a unified embedding model capable of processing interleaved multimodal inputs (text, images, video, audio, documents) in a single semantic space for tasks like retrieval-augmented generation and visual search.

api-design ml-systems
5 min
Google

Empowering Service Providers and Hardware Partners with Gemini for Home

How can Google enable third-party service providers and hardware manufacturers to build intelligent smart home experiences without requiring deep AI/ML expertise or significant R&D investment?

api-design ml-systems
5 min
Google

Google Tensor SDK Beta with LiteRT

Developers needed a unified way to build, deploy, and run high-performance machine learning models directly on edge devices (Google Pixel TPU) with reliable fallback mechanisms.

ml-systems api-design
5 min
Google

New enhancements for merchant initiated transactions with the Google Pay API

Merchants needed greater flexibility and control when initiating payment transactions for recurring subscriptions, deferred payments, and automatic reloads while maintaining user transparency.

api-design
5 min
Google

One Year of Innovation: Celebrating 100k Members in the Google Cloud x NVIDIA Developer Community

Developers needed accessible infrastructure, resources, and structured learning pathways to effectively build and optimize AI applications using GPUs and large language models at scale.

api-design ml-systems
5 min
Google

Production-Ready AI Agents: 5 Lessons from Refactoring a Monolith

Converting a brittle, monolithic sales research AI prototype into a production-ready agent that eliminates silent failures, fragile parsing, and lacks observability.

microservices observability
5 min
Google

Speeding Up AI: Bringing Google Colossus to PyTorch via GCSFS and Rapid Bucket

AI training pipelines were bottlenecked by slow data I/O when accessing training datasets stored in Google Cloud, limiting throughput and increasing total training time.

storage-systems ml-systems
5 min
Google Cloud

Agent Factory Recap: How Gemma 4 Taught Itself Physics

How to deploy high-intelligence AI models with agentic capabilities to consumer hardware and mobile devices without requiring cloud infrastructure.

ml-systems distributed-systems
5 min
Google Cloud

Create Expert Content: Deploying a Multi-Agent System with Terraform and Cloud Run

Automating the transformation of raw community signals into reliable technical guidance at scale using multiple specialized agents.

microservices api-design
5 min
Google Cloud

Gemini Live Agent Challenge: Announcing the winners and highlights

How to enable developers to build multimodal AI agents that can process and respond to real-time audio, video, text, and generation capabilities beyond traditional text-based interfaces.

real-time-systems api-design
5 min
Google Cloud

Introducing Gemini Enterprise Agent Platform, powering the next wave of agents

Building safe, reliable, and autonomous agents that can act independently across multiple enterprise systems while maintaining security, governance, and reliability guardrails.

ml-systems security
5 min
Google Cloud

Introducing the Builders Hub from the Google Developer Program

Developers lose productivity navigating fragmented tooling across multiple consoles, documentation sites, and services to manage their projects and stay informed.

api-design general
5 min
Google Cloud

Level Up Your Agents: Announcing Google's Official Skills Repository

AI agents built on Google Cloud need access to accurate, current, and grounded information about Google's products and APIs to function effectively.

api-design ml-systems
5 min
Google Cloud

Securing Your Gemini and Google API Keys

Developers using Google's AI APIs (Gemini and Google APIs) are exposing their API keys to unauthorized access, leading to account compromise, token theft, and service abuse.

security api-design
5 min
Google Cloud

Shipping features to production just got easier with new feature flags in AppLifecycle Manager

Development teams struggle to safely deploy code to production while managing the risk of releasing features to all users simultaneously, especially as AI accelerates code generation faster than safe deployment practices can keep up.

devops observability
5 min
Google Cloud

What Google I/O '26 means for developing agents on Google Cloud

Developers needed a unified, secure way to build AI agents locally and deploy them to Google Cloud with standardized protocols and tooling.

api-design microservices
5 min
Airbnb

Viaduct 1.0 and the future of Airbnb’s data mesh

Airbnb needed to transition Viaduct from an internal-only data mesh tool to a production-ready, community-driven platform with a stable public API.

api-design distributed-systems
5 min
Stripe

Five vertical SaaS insights from Sessions 2026

Vertical SaaS platforms needed to expand their service offerings beyond pure software to include integrated payments, financial services, and agentic commerce capabilities to build more defensible and durable businesses.

api-design distributed-systems
3 min
Netflix

Evaluating Netflix Show Synopses with LLM-as-a-Judge

Netflix needed to automatically evaluate the quality and relevance of show synopses at scale to improve member discovery and engagement.

ml-systems api-design
5 min
Netflix

State of Routing in Model Serving

Netflix needed to design a domain-independent traffic routing system for their ML model serving infrastructure that could handle personalized experiences at scale across multiple domains while maintaining high availability.

microservices load-balancing
5 min
Spotify

Building a Natural Language Interface to the Spotify Ads API with Claude Code Plugins

Making the Spotify Ads API accessible to non-technical users and reducing friction in ad campaign management by enabling natural language interaction instead of requiring direct API integration.

api-design microservices
4 min
Stripe

10 things we learned building for the first generation of agentic commerce

Building reliable payment and commerce systems that can handle autonomous AI agents as buyers, which introduce new failure modes and consistency requirements not present in traditional e-commerce.

api-design distributed-systems
4 min
Stripe

Analyzing first-party fraud trends: Account, free trial, and refund abuse

Detecting and preventing first-party fraud at scale across a payment network where legitimate users abuse policies through multiple accounts, free trial cycling, and refund exploitation.

ml-systems security
4 min
Stripe

Everything we announced at Sessions 2026

How to make payment infrastructure more programmable while maintaining reliability across a global distributed network and enabling new use cases like AI economic infrastructure.

api-design distributed-systems
3 min
Stripe

Giving agents the ability to pay

Enable autonomous agents to programmatically access payment instruments and execute transactions without requiring human intervention or direct card/account access.

api-design security
4 min
Stripe

How Stripe Radar helps prevent free trial abuse

Detecting and preventing fraudulent behavior in free trial signups, such as repeated trial abuse and missed cancellations, at scale with high accuracy.

ml-systems api-design
4 min
Stripe

How agents, digital wallets, and trust are rewriting checkout

Understanding and optimizing the checkout conversion funnel across diverse ecommerce businesses to identify what drives successful transactions in modern online payment flows.

api-design real-time-systems
4 min
Stripe

Insights from Shoptalk 2026: How agents are changing retail

How to integrate AI agents into ecommerce platforms to enable seamless product discovery and checkout across embedded and third-party surfaces.

api-design real-time-systems
4 min
Stripe

Introducing the Machine Payments Protocol

Enable autonomous agents and machines to initiate and complete payments programmatically over the internet without requiring human intermediation.

api-design distributed-systems
4 min
Stripe

Testing the impact of Adaptive Pricing across 1.5M subscription checkout sessions

How to automatically localize subscription pricing across 150+ countries while measuring the business impact of dynamic pricing on conversion and lifetime value.

api-design observability
4 min
Stripe

Three of the biggest fraud trends from MRC Vegas 2026

Detecting and preventing sophisticated fraud attacks while minimizing friction for legitimate users in payment systems.

api-design security
4 min
Cloudflare

Agents can now create Cloudflare accounts, buy domains, and deploy

How to enable autonomous agents to programmatically create Cloudflare accounts, purchase domains, and deploy infrastructure without manual dashboard interaction or credential handling.

api-design security
4 min
Cloudflare

Introducing Dynamic Workflows: durable execution that follows the tenant

Enable multi-tenant platforms to execute millions of unique, durable workflows without incurring significant idle infrastructure costs.

distributed-systems microservices
4 min
Cloudflare

Building the agentic cloud: everything we launched during Agents Week 2026

How to enable developers to build and deploy AI agents at scale across a distributed edge computing network while maintaining security and providing necessary infrastructure tools.

distributed-systems security
4 min
Cloudflare

Moving past bots vs. humans

Traditional bot detection mechanisms are becoming ineffective as AI assistants and privacy proxies blur the distinction between legitimate users and automated abuse.

security api-design
4 min
Cloudflare

Orchestrating AI Code Review at scale

Cloudflare needed to scale code review processes across their engineering organization while maintaining code quality and security standards without overwhelming human reviewers.

ml-systems api-design
3 min
Cloudflare

The AI engineering stack we built internally — on the platform we ship

Cloudflare needed to build an internal AI engineering stack that could handle massive scale (20 million requests, 241 billion tokens) while dogfooding their own platform products.

api-design ml-systems
4 min
Airbnb

Privacy-first connections: Empowering social experiences at Airbnb

How can Airbnb enable social features and community connections while maintaining strict user privacy and giving users control over their personal data sharing?

security api-design
5 min
Cloudflare

AI Search: the search primitive for your agents

Providing a scalable, efficient search infrastructure that allows AI agents to dynamically create search instances and perform semantic queries across uploaded documents without managing underlying indexing complexity.

search ml-systems
4 min
Cloudflare

Add voice to your agent

Enabling developers to build conversational agents with real-time voice capabilities without requiring complex infrastructure setup.

real-time-systems api-design
4 min
Cloudflare

Agents that remember: introducing Agent Memory

AI agents lack persistent memory mechanisms to retain context, learn from interactions, and improve decision-making over time.

storage-systems ml-systems
3 min
Cloudflare

Artifacts: versioned storage that speaks Git

Providing agents, developers, and automations with scalable, Git-compatible versioned storage that can handle tens of millions of repositories without forcing them to manage infrastructure.

storage-systems api-design
4 min
Cloudflare

Browser Run: give your agents a browser

AI agents needed a way to interact with browsers at scale while maintaining visibility and control over automated actions, requiring higher concurrency and real-time debugging capabilities.

real-time-systems ml-systems
3 min
Cloudflare

Cloudflare Email Service: now in public beta. Ready for your agents

Enabling AI agents to send, receive, and process email natively as a multi-channel communication medium without requiring developers to build custom email infrastructure.

api-design microservices
4 min
Cloudflare

Cloudflare’s AI Platform: an inference layer designed for agents

Developers needed a unified way to access multiple AI model providers without managing separate integrations and API contracts for each one.

api-design microservices
4 min
Cloudflare

Deploy Postgres and MySQL databases with PlanetScale + Workers

Enabling serverless applications to connect to managed relational databases without managing infrastructure or dealing with connection pooling complexities.

databases api-design
3 min
Cloudflare

Introducing Agent Lee - a new interface to the Cloudflare stack

Users had to manually navigate multiple tabs and interfaces within the Cloudflare dashboard to troubleshoot issues and manage their infrastructure, creating friction in the workflow.

api-design security
4 min
Cloudflare

Introducing Flagship: feature flags built for the age of AI

Third-party feature flag services introduce unacceptable latency for applications requiring sub-millisecond flag evaluation at global scale.

caching distributed-systems
4 min
Cloudflare

Introducing the Agent Readiness score. Is your site agent-ready?

Website owners needed a way to measure and understand how well their sites support AI agents and web crawlers for indexing and integration.

api-design observability
4 min
Cloudflare

Project Think: building the next generation of AI agents on Cloudflare

Building a scalable platform for deploying AI agents at the edge that can think, act, and persist state across distributed Cloudflare infrastructure.

distributed-systems ml-systems
3 min
Cloudflare

Redirects for AI Training enforces canonical content

AI crawlers were ingesting deprecated and non-canonical content despite soft directives like robots.txt, requiring a way to enforce canonical versions without modifying origin infrastructure.

caching security
4 min
Cloudflare

Register domains wherever you build: Cloudflare Registrar API now in beta

Developers needed a programmatic way to register and manage domains without leaving their development workflow or switching between multiple tools and platforms.

api-design
4 min
Cloudflare

Securing non-human identities: automated revocation, OAuth, and scoped permissions

Developers lack effective mechanisms to prevent unauthorized access when API credentials are accidentally exposed or compromised.

security api-design
4 min
Cloudflare

Shared Dictionaries: compression that keeps up with the agentic web

Web pages are growing larger and slower to load due to increased dynamic content, requiring better compression techniques that can adapt to modern agentic web patterns.

caching api-design
3 min
Cloudflare

How we built Organizations to help enterprises manage Cloudflare at scale

Cloudflare needed to enable enterprise customers to manage multiple accounts and resources under a unified organizational structure with centralized authorization and access control.

api-design security
4 min
AWS

Architecting for agentic AI development on AWS

AI agents struggle to iterate rapidly on system design and codebases due to architectural patterns that limit their ability to understand, modify, and validate applications effectively.

microservices serverless
5 min
Cloudflare

How we use Abstract Syntax Trees (ASTs) to turn Workflows code into visual diagrams

How to automatically convert TypeScript workflow code into visual step diagrams for users to understand and interact with their workflows in the dashboard.

api-design
3 min
AWS

Announcing the updated AWS Well-Architected Generative AI Lens

Organizations building generative AI workloads on AWS lacked comprehensive architectural guidance covering responsible AI, data architecture, and emerging patterns like agentic workflows, leading to poorly architected AI systems.

ml-systems api-design
4 min
AWS

BASF Digital Farming builds a STAC-based solution on Amazon EKS

BASF Digital Farming needed a scalable way to catalog, discover, and serve large volumes of spatiotemporal geospatial data (satellite imagery, crop data) for their xarvio crop optimization platform, and their existing infrastructure struggled with the scale and query patterns of this data.

microservices storage-systems
4 min
AWS

Building an AI gateway to Amazon Bedrock with Amazon API Gateway

Enterprises adopting Amazon Bedrock need centralized governance over AI model access, including authorization controls, usage quotas, and auditing, but lack a standardized gateway pattern to enforce these policies at scale.

api-design rate-limiting
4 min
AWS

How Convera built fine-grained API authorization with Amazon Verified Permissions

Convera needed to implement fine-grained authorization for their API platform, where coarse-grained access controls were insufficient to manage complex permission requirements across API resources and actions.

api-design security
3 min
AWS

She architects: Bringing unique perspectives to innovative solutions at AWS

The article addresses the challenge of diverse representation and perspectives in cloud architecture roles, exploring how lack of varied viewpoints can limit innovation in technical solution design.

api-design
5 min
Airbnb

GraphQL Data Mocking at Scale with LLMs and @generateMock

Producing valid and realistic mock data for GraphQL testing and prototyping is tedious to write and maintain; existing approaches like random value generation and field-level stubbing lack domain context, resulting in unconvincing and brittle test data that doesn't scale across a large schema.

api-design ml-systems
5 min
Airbnb

Pay As a Local

Airbnb relied primarily on card payments across 220+ global markets, but many users preferred local payment methods, causing checkout friction, reduced accessibility, and lower adoption in key markets.

api-design microservices
5 min
Cloudflare

A QUICker SASE client: re-building Proxy Mode

The Cloudflare One SASE client's Proxy Mode relied on user-space TCP stacks for tunneling traffic, introducing significant overhead that limited throughput and increased latency for end users.

distributed-systems api-design
4 min
Cloudflare

AI Security for Apps is now generally available

Organizations struggle to discover and secure AI-powered applications across their infrastructure, especially shadow AI deployments that teams spin up without central oversight, creating security blind spots.

security api-design
4 min
Cloudflare

Active defense: introducing a stateful vulnerability scanner for APIs

Standard defensive security tools miss logic flaws and vulnerabilities in APIs because they lack understanding of stateful API interactions and business logic flows.

security api-design
3 min
Cloudflare

Announcing Cloudflare Account Abuse Protection: prevent fraudulent attacks from bots and humans

Traditional bot-blocking approaches are insufficient for preventing account abuse (e.g., credential stuffing, fake account creation) because sophisticated attacks increasingly involve human-like behavior or actual humans, bypassing conventional bot detection.

security rate-limiting
3 min
Cloudflare

Fixing request smuggling vulnerabilities in Pingora OSS deployments

Cloudflare's open-source Pingora proxy had request smuggling vulnerabilities when deployed as an ingress proxy, allowing attackers to exploit HTTP parsing discrepancies to bypass security controls and route malicious requests.

security api-design
3 min
Cloudflare

From the endpoint to the prompt: a unified data security vision in Cloudflare One

Organizations face fragmented data security across endpoints, network traffic, cloud applications, and AI prompts, making it difficult to enforce consistent data loss prevention (DLP) policies as data flows through diverse channels including RDP sessions and AI copilots.

security api-design
3 min
Cloudflare

Slashing agent token costs by 98% with RFC 9457-compliant error responses

AI agents hitting Cloudflare error pages received heavyweight HTML responses that consumed excessive tokens and required brittle parsing, making automated error handling inefficient and costly.

api-design ml-systems
4 min
Cloudflare

Standing up for the open Internet: why we appealed Italy’s "Piracy Shield" fine

Italy's 'Piracy Shield' system forces Internet infrastructure providers like Cloudflare to block content at the network level without proper oversight or due process, leading to disproportionate overblocking of legitimate content.

security api-design
4 min
Dropbox

Engineering VP Josh Clemm on how we use knowledge graphs, MCP, and DSPy in Dash

Enterprise search and AI assistant products like Dropbox Dash need to connect disparate data sources and optimize AI-driven retrieval, but naively querying across siloed data with LLMs leads to poor relevance and brittle prompt engineering.

search ml-systems
3 min