Spotify’s New Experimentation Platform (Part 1)

Johan Rydberg
9 min readadvanced
--
View Original

Overview

Spotify's new Experimentation Platform aims to enhance the way product teams conduct A/B testing and experiments. This article details the limitations of the previous system (ABBA) and introduces the new architecture designed to improve efficiency, reduce event logging, and facilitate better analysis.

What You'll Learn

1

How to utilize Remote Configuration to manage user experience dynamically

2

Why reducing event logging is critical for system performance

3

When to implement custom metrics for A/B testing analysis

Prerequisites & Requirements

  • Understanding of A/B testing concepts
  • Familiarity with SQL for metrics analysis(optional)

Key Questions Answered

What were the limitations of Spotify's previous A/B testing system, ABBA?
ABBA had several limitations, including a rigid 1-1 mapping between experiments and feature flags, which made restarting experiments cumbersome. Additionally, it generated a high volume of event logs that contributed to processing costs and lacked sufficient metrics for comprehensive analysis.
How does the new Experimentation Platform improve upon the old system?
The new Experimentation Platform introduces Remote Configuration, a Metrics Catalog, and an Experiment Planner, which collectively enhance flexibility, reduce event logging, and allow for more sophisticated analysis of experiments, addressing the shortcomings of ABBA.
What is the role of Remote Configuration in Spotify's new system?
Remote Configuration replaces the feature-flagging service with a properties model, allowing dynamic control over user experiences. It enables the adjustment of various properties like UI elements without needing to redeploy the application, streamlining experimentation.
What metrics does the Metrics Catalog provide for A/B testing?
The Metrics Catalog aggregates raw metric data and combines it with user experiment group information, allowing for efficient querying and analysis of experiment results. This setup supports real-time data access for better decision-making.

Key Statistics & Figures

Percentage of total event volume attributed to A/B testing logs
25%
The previous A/B testing system generated logs that accounted for nearly a quarter of Spotify's total event volume, leading to increased processing costs.

Technologies & Tools

Backend
Remote Configuration
Manages user experience by controlling configurable properties.
Data Management
Metrics Catalog
Stores and serves metrics for analysis in the Experimentation Platform.
Backend
Planout
Used for executing policies that determine property values for users.

Key Actionable Insights

1
Implement Remote Configuration to enhance user experience management across different platforms.
By using Remote Configuration, teams can dynamically adjust UI elements and features without redeploying code, which significantly reduces development time and improves user engagement.
2
Focus on reducing event logging to optimize system performance and reduce costs.
As event logging can consume significant resources, minimizing unnecessary logs can lead to improved system efficiency and lower operational costs, especially in high-traffic environments.
3
Adopt a consistent methodology for analyzing A/B tests to improve decision-making.
Standardizing how experiments are analyzed ensures that insights are reliable and actionable, fostering a data-driven culture within product teams.

Common Pitfalls

1
Failing to manage the volume of event logging can lead to system performance issues.
As seen with the previous ABBA system, excessive event logging can overwhelm processing capabilities and inflate costs. Teams should implement strategies to minimize unnecessary logs.
2
Inconsistent analysis methodologies can lead to unreliable insights from experiments.
Without a standardized approach to analyzing A/B tests, teams may draw incorrect conclusions, hindering product development. Establishing a consistent framework is essential.

Related Concepts

A/B Testing Methodologies
Experimentation Best Practices
Data-driven Decision Making