Demand and ETR Forecasting at Airports

Michael (Yichuan) Cai, Martina Balestra
11 min readadvanced
--
View Original

Overview

The article discusses the implementation of demand and estimated time to request (ETR) forecasting models at airports to optimize driver dispatching for Uber. It highlights the challenges of balancing supply and demand, the architecture of the forecasting models, and future improvements.

What You'll Learn

1

How to forecast supply and demand at airports using machine learning models

2

Why understanding queue dynamics is crucial for optimizing driver dispatch

3

When to implement real-time data ingestion for predictive modeling

Prerequisites & Requirements

  • Understanding of machine learning concepts and queue dynamics
  • Familiarity with Uber's Michelangelo platform for machine learning(optional)

Key Questions Answered

How does Uber forecast estimated time to request (ETR) at airports?
Uber forecasts ETR by developing three models: a supply model to estimate the true position of drivers in the FIFO queue, a demand model to predict queue consumption based on arriving flights, and a queue consumption logic that simulates wait times. This approach helps drivers understand when to reposition based on demand.
What challenges are faced in scaling queue-related signals for driver dispatch?
Scaling challenges include onboarding various real-time signals from different platforms, managing data emission rates in Kafka, and addressing database scaling issues due to unbalanced partition schemes. These challenges require careful engineering trade-offs to ensure efficient data processing.
What is the purpose of the ETR Venue Marker in the Uber Driver app?
The ETR Venue Marker provides drivers with real-time information about airport wait times, the number of drivers in the queue, and incoming flights. This helps drivers make informed decisions about when to enter the queue based on expected wait times.

Technologies & Tools

Some links below are affiliate links. We may earn a commission if you make a purchase.

Machine Learning Platform
Michelangelo
Used for training, managing, and deploying machine learning models at scale.
Data Streaming
Kafka
Employed for managing real-time data emissions related to driver queue signals.
Database
Nosql
Utilized for storing entity queue data and managing state changes.

Key Actionable Insights

1
Implementing a modular approach to machine learning models can enhance maintainability and scalability.
By designing separate models for supply and demand, Uber can easily update or replace components without affecting the entire system, facilitating future enhancements.
2
Utilizing near-real-time data ingestion can significantly improve the accuracy of demand forecasting.
Incorporating real-time signals allows for more responsive adjustments to driver dispatch strategies, especially during peak travel times.
3
Partitioning models by airport can reduce bias in predictions, but comes with increased operational costs.
Starting with a single pipeline allows for easier management, and as performance metrics are evaluated, specific airports can be targeted for more tailored solutions.

Common Pitfalls

1
Failing to account for the dynamic nature of driver queues can lead to inaccurate ETR predictions.
Drivers may abandon the queue or gain priority access, affecting the actual wait times. It's crucial to continuously update models with real-time data to reflect these changes.
2
Over-reliance on batch data without integrating real-time signals can degrade prediction accuracy.
Batch data may not capture sudden changes in demand due to external factors like weather or flight delays, leading to suboptimal dispatch decisions.

Related Concepts

Machine Learning For Demand Forecasting
Real-time Data Processing Techniques
Queue Management Strategies In Transportation