Spotify

Better Experiments with LLM Evals — A funnel, not a fork

Efficiently evaluating and validating LLM-generated outputs at scale during experimentation without manual review bottlenecks.

ml-systems observability
4 min
Spotify

Background Coding Agents: Supercharging Downstream Consumer Dataset Migrations (Honk, Part 4)

Spotify needed to migrate thousands of downstream datasets when source datasets changed structure, without manually updating each consumer application.

data-pipelines microservices
4 min