Automating contract intelligence with Doczy.ai™ on AWS
Organizations need to extract structured, actionable insights from unstructured contract documents at scale to automate critical business processes.
When history fails you, borrow from geography
Building reliable forecasting models for marketplace demand when historical data is unavailable or unreliable due to unprecedented market shocks.
Bringing Gemma 4 12B to your Laptop: Unlocking Local, Agentic Workflows with Google AI Edge
Enabling efficient execution of large language models (12B parameters) on resource-constrained devices like laptops with limited RAM while maintaining multimodal and agentic capabilities.
Gemma 4 12B: The Developer Guide
Running high-performance multimodal AI models efficiently on consumer devices without the computational overhead of traditional visual and audio encoders.
Introducing the Google Colab CLI
Developers and AI agents needed a way to seamlessly execute code on remote GPU-powered Colab runtimes without context-switching between local terminals and web interfaces.
Connecting AI agents with unstructured data using Google Cloud Storage MCP Servers
Enterprises need to integrate unstructured data from Google Cloud Storage into AI agent systems while maintaining security, standardization, and efficient context retrieval at scale.
Experimenting with TPUs, GKE Managed DRANET, and Multi-cluster Inference Gateway
Ensuring high availability and service continuity when AI inference workloads fail in one region while maintaining access to the service across multiple regions.
Scaling AI Agents: A Step-by-Step Guide to Deploying ADK on GKE Autopilot
Moving AI agents built with Google's Agent Development Kit from local prototypes to production-ready, scalable infrastructure.
Rethinking risk in the age of AI
Detecting and preventing fraudulent payments in real-time as AI tools become more sophisticated and lower barriers to attack.
The future of agentic commerce is here
How to enable AI agents to autonomously execute commerce transactions while maintaining Stripe's reliability and payment processing standards.
How we built Cloudflare's data platform and an AI agent on top of it
Cloudflare needed to unify fragmented analytics data across its global edge network and enable intelligent querying of that data at scale.
Beyond code generation: rethinking engineering productivity in the age of AI agents
How to transition from code-generation AI tools that only assist engineers to autonomous agentic systems capable of executing complete, scoped engineering tasks independently.
How the community trained Gemma to "Think" with Tunix and TPUs
How to enable developers with limited compute budgets to transform small base language models into capable reasoning engines through efficient training techniques.
A Guide to AI Cold Starts on Cloud Run
Managing startup latencies up to 20 seconds for AI workloads on Cloud Run serverless GPUs, which causes poor user experience and is driving developers back to traditional container orchestration.
SilverTorch: Index as Model — A New Retrieval Paradigm for Recommendation Systems
Meta needed to improve the throughput and compute efficiency of retrieval systems for recommendation engines that process user-generated content at massive scale.
Expanding Stripe Radar to protect more of your business
Detecting and preventing fraud across heterogeneous payment methods and merchant platforms while identifying emerging fraud patterns like multi-account abuse and pay-as-you-go abuse.