DataCentral: Uber’s Big Data Observability and Chargeback Platform

Arnav Balyan, Atul Mantri, Krishna Karri, Amruth Sampath
10 min readintermediate
--
View Original

Overview

DataCentral is Uber's proprietary platform designed for Big Data observability, chargeback, and governance. It provides insights into performance trends, resource consumption, and cost efficiency, addressing the complexities of managing a vast data ecosystem that handles petabyte-scale operations daily.

What You'll Learn

1

How to utilize DataCentral for debugging failed queries and applications

2

Why observability is critical in managing big data ecosystems

3

How to implement cost reduction initiatives using DataCentral insights

Prerequisites & Requirements

  • Understanding of big data concepts and tools
  • Familiarity with data processing frameworks like Apache Spark and Presto(optional)

Key Questions Answered

How does DataCentral improve observability in Uber's data ecosystem?
DataCentral enhances observability by providing real-time insights into compute queries and applications across various engines like Presto, Spark, and Hive. It aggregates metrics to help users detect and debug applications faster, thereby minimizing downtime and improving performance.
What are the key features of DataCentral?
DataCentral offers observability, chargeback metrics, and consumption reduction programs. It provides granular insights into performance trends, costs, and resource usage for big data tools, enabling stakeholders to make informed decisions.
What challenges does Uber face in managing its big data landscape?
Uber's big data landscape is complex, with around a million applications and queries running daily. The main challenges include providing stakeholders with a holistic view of performance and resource consumption insights amidst this complexity.
How does DataCentral facilitate cost efficiency?
DataCentral drives cost efficiency through chargeback transparency and consumption reduction initiatives. It tracks resource usage at granular levels, helping identify expensive pipelines and unnecessary compute, leading to actionable cost-saving measures.

Key Statistics & Figures

Presto queries handled per day
500K
This statistic highlights the scale at which DataCentral operates, processing a significant volume of queries daily.
Spark applications handled per day
400K
This demonstrates the extensive use of Spark within Uber's data ecosystem, showcasing the need for effective observability.
Hive queries handled per day
2M
This indicates the high demand for Hive queries, underscoring the importance of DataCentral in managing such a large volume.
HDFS calls handled per day
10B
This figure illustrates the vast scale of HDFS interactions, necessitating robust monitoring and observability solutions.

Technologies & Tools

Some links below are affiliate links. We may earn a commission if you make a purchase.

Key Actionable Insights

1
Leverage DataCentral's observability features to enhance debugging processes.
By utilizing the insights provided by DataCentral, teams can quickly identify and resolve issues in their big data applications, reducing downtime and improving overall system reliability.
2
Implement chargeback metrics to foster accountability among teams.
Providing teams with detailed visibility into their resource usage encourages responsible consumption and helps identify areas for cost reduction.
3
Utilize historical trend data to inform future resource allocation.
By analyzing past performance metrics, teams can make data-driven decisions regarding resource allocation and infrastructure investments.

Common Pitfalls

1
Neglecting to monitor resource usage can lead to unexpected costs.
Without proper observability, teams may overlook inefficient resource consumption, resulting in inflated operational costs that could have been avoided with proactive monitoring.
2
Failing to utilize historical data for decision-making.
Ignoring past performance metrics can lead to repeated mistakes in resource allocation and infrastructure investments, ultimately hindering operational efficiency.

Related Concepts

Big Data Observability
Cost Efficiency In Data Management
Real-time Data Processing