Site Speed Monitoring in A/B Testing and Feature Ramp-up

Jiahui QI (JOY)
9 min readintermediate
--
View Original

Overview

The article discusses the importance of site speed monitoring during A/B testing and feature ramp-up at LinkedIn. It highlights the challenges of predicting site speed impacts and outlines the methodologies used to collect, visualize, and analyze performance data effectively.

What You'll Learn

1

How to monitor site speed impacts during A/B testing

2

Why real-time data is crucial for anomaly detection

3

When to use daily vs. real-time site speed data

Prerequisites & Requirements

  • Understanding of A/B testing methodologies
  • Familiarity with Real User Monitoring (RUM) tools(optional)

Key Questions Answered

How does LinkedIn monitor site speed during feature rollouts?
LinkedIn uses Real User Monitoring (RUM) data to track site speed metrics during A/B testing. By comparing performance metrics between experimental and control groups, developers can assess the impact of new features on site speed and make informed decisions about further rollouts.
What are the benefits of using daily and real-time data for site speed monitoring?
Daily data processed in Hadoop provides reliable insights due to lower variance, while real-time data processed in Apache Samza is useful for immediate anomaly detection. Each type serves different purposes, with real-time data alerting developers to issues quickly and daily data offering comprehensive performance assessments.
What challenges does LinkedIn face in site speed monitoring?
LinkedIn faces challenges such as scalability due to high traffic volumes and the need for timely anomaly detection. The system is designed to handle large datasets and provide alerts on performance degradation, ensuring that developers can respond quickly to potential issues.

Key Statistics & Figures

Performance improvement
20% faster
The page load time of the experiment group was 20% faster than that of the control group during A/B testing.
Traffic trends
10% A/B ramping stage
Engineers monitored ads click rate and site speed metrics at the 10% ramping stage to assess the impact of optimizations.
Alerting time
3.5 hours
The anomaly detection engine waits for 3.5 hours before starting A/B comparisons after an experiment begins.

Technologies & Tools

Some links below are affiliate links. We may earn a commission if you make a purchase.

Monitoring
Real User Monitoring (rum)
Used to collect site speed metrics from web pages and mobile applications.
Data Processing
Hadoop
Used for processing daily site speed data.
Data Processing
Apache Samza
Used for processing real-time site speed data.
Messaging
Apache Kafka
Used to send performance metrics and experiment information.
Data Processing
Spark
Used to enhance the performance of the offline data processing pipeline.

Key Actionable Insights

1
Implement a dual data processing strategy using both daily and real-time metrics to monitor site speed effectively.
This approach allows teams to benefit from the reliability of daily data while also having the agility to respond to real-time performance issues, ensuring a balanced view of site performance.
2
Utilize visualization tools to present site speed data clearly to stakeholders.
Effective visualization helps in communicating performance impacts and justifying decisions regarding feature rollouts, making it easier for teams to align on priorities.
3
Incorporate anomaly detection systems in your monitoring strategy to catch performance issues early.
By detecting anomalies in real-time, developers can address performance degradations before they affect a larger audience, thereby maintaining a positive user experience.

Common Pitfalls

1
Failing to differentiate between daily and real-time data can lead to misinterpretation of site performance.
Without understanding the strengths and weaknesses of each data type, teams may either react too quickly to noise in real-time data or miss critical insights from daily data.

Related Concepts

A/B Testing
Real User Monitoring
Performance Optimization