Uber logo

How Uber Uses MySQL

78 engineering articles about MySQL from Uber's engineering team

Articles

Filter:
Uber logo
Uber
Advanced
This article details Uber's multi-year evolution from static, quota-based rate limiting to an intelligent, priority-aware load management system for their distributed databases (Docstore and Schema...
Dhyanam Vaidya, Prathamesh Deshpande, Mike Ma, Chaitanya Yalamanchili
15 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses Uber's collaboration with Oracle Cloud Infrastructure (OCI) and Ampere Computing to optimize the OCI AmpereOne M A4 Compute instances.
Vikrant Soman, Dan Royal, Nav Kankani, Kamran Zargahi
7 min read
Has Summary
--
Uber logo
Uber
Intermediate
This article discusses Uber's implementation of Apache Pinot to manage and analyze its extensive inventory and catalog data efficiently.
Suraj Modi, Ankit Sultana, Tarun Mavani
11 min read
Has Summary
--
Uber logo
Uber
Advanced
This article discusses how Uber improved MySQL cluster uptime by adopting MySQL Group Replication (MGR) at scale.
Siddharth Singh, Raja Sriram Ganesan, Amit Jain, Debadarsini Nayak
13 min read
Has Summary
--
Uber logo
Uber
Advanced
This article discusses the improvements made to MySQL cluster uptime at Uber through the implementation of MySQL Group Replication (MGR).
Siddharth Singh, Raja Sriram Ganesan, Amit Jain, Debadarsini Nayak
10 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Uber's implementation of I/O observability for its massive petabyte-scale data lake, focusing on the challenges and solutions in monitoring data access patterns across its hyb...
Arnav Balyan, Kartik Bommepally, Amruth Sampath, Jing Zhao, Akshayaprakash Sharma
10 min read
Has Summary
--
Uber logo
Uber
Advanced
Uber's migration from Spark 2. 4 to Spark 3. 3 involved upgrading over 2 million Spark applications, utilizing innovative automation tools like Iron Dome.
Amruth Sampath, Arnav Balyan, Nimesh Khandelwal, Sumit Singh, Parth Halani, Suprit Acharya
8 min read
Has Summary
--
Uber logo
Uber
Advanced
This article discusses the architecture and implementation of Uber's HiveSync, a critical service for data replication across its massive data lake.
Radhika Patwari, Trivedhi Talakola, Rajan Jaiswal, Chayanika Bhandary, Mukesh Verma, Sanjay Sundaresan
14 min read
Has Summary
--
Uber logo
Uber
Intermediate
This article discusses how Uber's CacheFront serves over 150 million reads per second while ensuring stronger consistency guarantees.
Eli Pozniansky, Preetham Narayanareddy
13 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Uber's implementation of encryption at rest and disk isolation at scale using their Stateful Platform, Odin.
Ivan Shibitov, Johan Abildskov
14 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Finch, Uber's conversational AI data agent designed to streamline financial data retrieval within the Slack environment.
Austin Harrison, Eddie Huang, Spencer Garth, Tim Ross, Taya Yusuf
13 min read
Has Summary
--
Uber logo
Uber
Intermediate
This article details Uber's migration from Apache Hive to Apache Spark SQL for ETL workloads, highlighting the motivations behind the transition, the architecture involved, and the challenges faced...
Kumudini Kakwani, Akshayaprakash Sharma, Nimesh Khandelwal, Aayush Chaturvedi, Chintan Betrabet, Suprit Acharya
14 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses Uber's implementation of a configuration-driven archival and retrieval framework designed to manage vast amounts of regulatory data efficiently.
Abhishek Dobliyal, Aakash Bhardwaj
12 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Uber's robust database backup recovery system, highlighting its importance for business continuity and disaster recovery.
Arjav Jain, Shivam Vijay, Debadarsini Nayak, Mohammed Khatib, Ramnik Jain
11 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses FixrLeak, a generative AI-based framework developed by Uber to automate the detection and repair of resource leaks in Java applications.
Chris Zhang, Akshay Utture, Manu Sridharan
7 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Profile-Guided Optimization (PGO) in Go programming, detailing its implementation at Uber to enhance performance through runtime profiling data.
Chris Zhang, Yufan Xu, Milind Chabbi, Shauvik Roy Choudhary
11 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Uber's transition to a multi-architecture environment by adopting Arm-based hosts at scale.
Andreas Lykke, Jesper Borlum
10 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses the MySQL fleet at Uber, which consists of over 2,300 independent clusters that support critical operations for the platform.
Banty Kumar, Debadarsini Nayak, Raja Sriram Ganesan, Amit Jain
15 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses how Uber leverages Neutrino, an internal fork of Presto, to efficiently serve millions of queries to Apache Pinot, a real-time OLAP database.
Ankit Sultana, Pratik Tibrewal, Christina Li, Shreyaa Sharma, Ujwala Tulshigiri
12 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Uber's implementation of Presto Express, an enhancement to the Presto SQL query engine aimed at reducing the end-to-end Service Level Agreement (SLA) for short-running queries.
Mingjia Hang, Gurmeet Singh
10 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses Uber's migration of its batch data platform to the cloud, focusing on the implementation of DataMesh principles.
Arun Mahadeva Iyer, Abhi Khune, Sahana Bhat
11 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses how Uber utilizes Apache Pinot for low-latency offline table analytics, highlighting its capabilities in handling various use cases, including real-time and offline data inges...
Ankit Sultana, Caner Balci
15 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses the Sparkle framework developed by Uber to standardize modular ETL processes, enhancing developer productivity and data quality.
Dinesh Jagannathan, Sharath Bhat, Suman Voleti, Praveen Raj
8 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Uber's upgrade of its MySQL fleet from version 5. 7 to 8. 0, detailing the motivations, challenges, and solutions implemented during the process.
Siddharth Singh, Sriram Rao Udupi, Raja Sriram Ganesan, Debadarsini Nayak
12 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Uber's implementation of differential backups for MyRocks-based distributed databases, addressing the challenges and solutions related to backup efficiency and cost.
Adithya Reddy, Shriniket Kale
15 min read
Has Summary
--
Uber logo
Uber
Advanced
Odin is Uber's stateful platform designed to manage various technologies for data storage efficiently.
Jesper Borlum, Gianluca Mezzetti
14 min read
Has Summary
--
Uber logo
Uber
Advanced
This article discusses how Uber has implemented single-zone failure tolerance (SZFT) for its Apache Cassandra® database, ensuring high availability even in the event of a zone failure.
Long Pan, Gopal Mor, Jaydeepkumar Chovatia, Shriniket Kale, Gabriele Di Bernardo
12 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Uber's overhaul of flaky tests through the introduction of Testopedia, a centralized system designed to enhance visibility and control over test reliability.
Xiaoyang Tan, Yushan Lin, Sergey Balabanov
17 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses how Uber's LedgerStore manages trillions of indexes to support its vast transactional data, emphasizing the architecture and indexing strategies that ensure data integrity and...
Kaushik Devarajaiah
13 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses how Uber serves over 40 million reads per second from its online storage using an integrated caching solution called CacheFront.
Preetham Narayanareddy, Eli Pozniansky, Zurab Kutsia, Afshin Salek, Piyush Patel
19 min read
Has Summary
--
Uber logo
Uber
Intermediate
DataCentral is Uber's proprietary platform designed for Big Data observability, chargeback, and governance.
Arnav Balyan, Atul Mantri, Krishna Karri, Amruth Sampath
10 min read
Has Summary
--
Uber logo
Uber
Advanced
This article discusses Uber's experience with garbage collection (GC) tuning to enhance the reliability of Presto, an open-source distributed SQL query engine.
Cristian Velazquez, Vineeth Karayil Sekharan
11 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses Uber's experience in adopting SPIFFE/SPIRE at scale, detailing the challenges and solutions encountered in implementing a Zero Trust security model across a complex microservi...
Andrew Moore, Ryan Turner, Kirutthika Raja, Prasad Borole, Kurtis Nusbaum, Zachary Train, Hasibul Haque
16 min read
Has Summary
--
Uber logo
Uber
Intermediate
This article discusses Uber's experience migrating a large-scale invoice generation service from a legacy system to a new service called Invoicer.
Georgi Zhuhov, Irina Kurteva, Iskren Dimov, Nikolay Lazarov, Plamena Todorova, Yordan Petrov
11 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses the evolution of Data Lifecycle Management (DLM) at Uber, detailing the journey from initial implementations to the development of a unified system.
Sumanth Srinivasa Krishnaswamy, Matt Mathew, Sonali Goyal
13 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses Uber's implementation of Attribute-Based Access Control (ABAC) to manage access across its microservices architecture.
Uber logo
Uber
Advanced
Cadence 1. 0 is a powerful open-source workflow orchestration platform designed for building and managing stateful services at scale.
Ender Demirkaya
10 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses the Unified Action Platform (uAct) developed by Uber, aimed at consolidating various internal communication systems into a single interface for managing requests and notificat...
Chankit Bansal, Manmeet Kalirawana, Aasav Badera
14 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses how Uber optimizes the timing of push notifications using machine learning and linear programming.
Vinay Sharma, Rémi Torracinta, Giacomo Lamberti, Britton Overall
9 min read
Has Summary
--
Uber logo
Uber
Intermediate
This article discusses Uber's migration from MySQL to MyRocks, a storage engine that integrates with RocksDB, to address disk space bottlenecks and improve operational efficiency.
Shriniket Kale, Hao Xu, Shenglin Du
9 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses Uber's implementation of an automated vertical CPU scaling system that optimizes resource allocation for storage workloads, leading to significant cost savings and improved re...
Lasse Vilhelmsen
10 min read
Has Summary
--
Uber logo
Uber
Intermediate
Uber's article discusses the implementation of a highly scalable and distributed Remote Shuffle Service (RSS) designed to enhance the efficiency of data processing in Apache Spark.
Mayank Bansal, Bo Yang, Mayur Bhosale, Kai Jiang
20 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses how Uber integrates Presto® and Apache Kafka® to enhance its big data analytics capabilities.
Yang Yang, Yupeng Fu, Hitarth Trivedi
10 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses strategies to avoid CPU throttling in a containerized environment, particularly at Uber, where stateful workloads run on a large fleet of hosts.
Joakim Recht, Yury Vostrikov
7 min read
Has Summary
--
Uber logo
Uber
Advanced
This article discusses the development of Uber's Fulfillment Platform using Google Cloud Spanner, focusing on its architecture, scalability, and operational efficiency.
Ankit Srivastava, Fabin Jose, Jean He, Nandakumar Gopalakrishnan, [email protected], Ramachandran Iyer, Uday Kiran Medisetty
20 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Uber's initiatives to enhance the efficiency of its Big Data platform, focusing on cost reduction through optimizations in file formats, HDFS erasure coding, YARN scheduling i...
Zheng Shao, Mohammad Islam
18 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses how Uber maintains operational excellence in data quality through a consolidated data quality platform (UDQ).
Ying Zou, Wei Yan, Maggie Ying, Sanjay Sundaresan, Sriharsha Chintalapani, Isabel Geracioti
21 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses Uber's comprehensive re-architecture of its Fulfillment Platform, aimed at enhancing its Go/Get strategy.
Ashwin Neerabail, Ankit Srivastava, Kamran Massoudi, Madan Thangavelu, Uday Kiran Medisetty
19 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses Uber's 'Orders Near You' feature, which utilizes real-time geospatial data analytics to enhance user experience in the Uber Eats app.
Yupeng Fu, Cassandra Tomazic, Dharak Kharod
10 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Uber's Customer Support Automation Platform, focusing on the development and implementation of the Policy Engine (PE) to enhance customer service efficiency.
Chia Yen Hung, Monis Khan, Norm Usenkanov
16 min read
Has Summary
--