Archives — Distributed Readings

Google ↗

Building with Gemini Embedding 2: Agentic multimodal RAG and beyond

Developers needed a unified embedding model capable of processing interleaved multimodal inputs (text, images, video, audio, documents) in a single semantic space for tasks like retrieval-augmented generation and visual search.

api-design ml-systems

5 min

Netflix ↗

Powering Multimodal Intelligence for Video Search

Netflix needed to efficiently extract and surface key moments from hundreds or thousands of hours of raw video footage for editorial teams to accelerate the creative content production process.

ml-systems search

5 min

Meta ↗

Modernizing the Facebook Groups Search to Unlock the Power of Community Knowledge

Facebook Groups Search was unreliable at helping users discover and validate community content most relevant to their search queries.

search ml-systems

5 min

Cloudflare ↗

AI Search: the search primitive for your agents

Providing a scalable, efficient search infrastructure that allows AI agents to dynamically create search instances and perform semantic queries across uploaded documents without managing underlying indexing complexity.

search ml-systems

4 min

LinkedIn ↗

Driving data enhancement & recruitment success with LinkedIn’s unified integrations

LinkedIn's recruitment platform needed richer data signals to improve candidate matching and recruiter success rates.

search databases

3 min

LinkedIn ↗

Optimizing LinkedIn Sales Navigator’s search pipeline with Spark

LinkedIn Sales Navigator's search pipeline had latency issues as query complexity and data volume grew.

search caching

3 min

LinkedIn ↗

Reimagining LinkedIn’s search tech stack

LinkedIn's legacy search infrastructure couldn't scale to handle growing query volumes and evolving relevance requirements across its platform.

search distributed-systems

3 min

Airbnb ↗

Academic Publications & Airbnb Tech: 2025 Year in Review

Airbnb needed to advance its AI, data science, and machine learning capabilities across multiple domains (NLP, optimization, measurement science) to improve its travel and living platform, requiring solutions to challenges in search ranking, recommendation, experimentation, and large-scale data processing.

ml-systems search

5 min

Airbnb ↗

Recommending Travel Destinations to Help Users Explore

Airbnb users in the early trip planning stage often lack a clear travel destination, making it difficult to provide relevant recommendations and convert exploratory browsing into bookings.

ml-systems search

5 min

Dropbox ↗

Engineering VP Josh Clemm on how we use knowledge graphs, MCP, and DSPy in Dash

Enterprise search and AI assistant products like Dropbox Dash need to connect disparate data sources and optimize AI-driven retrieval, but naively querying across siloed data with LLMs leads to poor relevance and brittle prompt engineering.

search ml-systems

3 min

Dropbox ↗

How Dash uses context engineering for smarter AI

Dropbox Dash's AI agent struggled with effectiveness when naively providing all available context to the model, leading to degraded performance as irrelevant information diluted the signal needed for accurate, agentic AI responses.

ml-systems search

3 min

Dropbox ↗

How we optimized Dash's relevance judge with DSPy

Manual prompt engineering for Dropbox Dash's relevance judge was unreliable, hard to measure, and costly—making it difficult to systematically improve task performance in production.

ml-systems search

3 min

Dropbox ↗

Inside the feature store powering real-time AI in Dropbox Dash

Dropbox Dash needs to rank and retrieve relevant context across a user's work in real time, requiring low-latency access to precomputed and real-time features for AI-driven search and recommendation models.

ml-systems real-time-systems

3 min

Dropbox ↗

Using LLMs to amplify human labeling and improve Dash search relevance

Dash's search ranking models required large volumes of high-quality labeled relevance data to train effectively, but human labeling alone was too slow and expensive to scale to the needed coverage.

search ml-systems

3 min

Dropbox ↗

With Mobius Labs' Aana models, we're bringing deeper multimodal understanding to Dropbox Dash

Dropbox Dash needed deeper understanding of multimodal content (photos and videos) across user files, but processing diverse media types at Dropbox's scale posed efficiency and architectural challenges.

ml-systems search

3 min

Netflix ↗

The AI Evolution of Graph Search at Netflix

Netflix's Graph Search platform for federated enterprise data required users to write structured queries, limiting accessibility and ease of use despite the system being scalable and configurable.

search ml-systems

5 min