Uber logo

How Uber Uses Cassandra

58 engineering articles about Cassandra from Uber's engineering team

Articles

Filter:
Uber logo
Uber
Advanced
This article discusses the development and implementation of forecasting models aimed at improving driver availability at airports, which are critical to Uber's ridesharing ecosystem.
Bob Zheng, Dhruv Ghulati, Manoj Panikkar, Michael (Yichuan) Cai
15 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Uber's implementation of encryption at rest and disk isolation at scale using their Stateful Platform, Odin.
Ivan Shibitov, Johan Abildskov
14 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Uber's robust database backup recovery system, highlighting its importance for business continuity and disaster recovery.
Arjav Jain, Shivam Vijay, Debadarsini Nayak, Mohammed Khatib, Ramnik Jain
11 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses Uber's migration of large-scale interactive compute workloads from Peloton to Kubernetes, focusing on minimizing disruption while enhancing resource management and cloud readi...
Sayan Pal, Rishabh Mishra
12 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Uber's transition to a multi-architecture environment by adopting Arm-based hosts at scale.
Andreas Lykke, Jesper Borlum
10 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses The Accounter, a global coordination system developed by Uber to enhance operational throughput and safety on its stateful platform, Odin.
Jesper Borlum, Gianluca Mezzetti, Alexander Blazhenskikh
14 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Uber's advanced settlement accounting system, which is crucial for managing financial transactions involving payment service providers (PSPs).
Onkar Singh, Sai Sameera Grandhi, Nagesh Kumar Mankala, Abhinav Agarwal
12 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses the Sparkle framework developed by Uber to standardize modular ETL processes, enhancing developer productivity and data quality.
Dinesh Jagannathan, Sharath Bhat, Suman Voleti, Praveen Raj
8 min read
Has Summary
--
Uber logo
Uber
Advanced
Odin is Uber's stateful platform designed to manage various technologies for data storage efficiently.
Jesper Borlum, Gianluca Mezzetti
14 min read
Has Summary
--
Uber logo
Uber
Advanced
This article discusses how Uber has implemented single-zone failure tolerance (SZFT) for its Apache Cassandra® database, ensuring high availability even in the event of a zone failure.
Long Pan, Gopal Mor, Jaydeepkumar Chovatia, Shriniket Kale, Gabriele Di Bernardo
12 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Uber's journey in enhancing its Palette Meta Store, focusing on the challenges faced, the solutions implemented, and the resulting improvements in machine learning feature man...
Paarth Chothani, Nicholas Marcott, Dehua Lai, Xiyuan Feng, Chunhao Zhang, Victoria Wu
10 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses the evolution of Data Lifecycle Management (DLM) at Uber, detailing the journey from initial implementations to the development of a unified system.
Sumanth Srinivasa Krishnaswamy, Matt Mathew, Sonali Goyal
13 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses how Uber optimized its operations of the open-source Apache Cassandra database at scale, addressing various challenges and improvements made over time.
Jaydeepkumar Chovatia, Gopal Mor, Runtian Liu
11 min read
Has Summary
--
Uber logo
Uber
Advanced
Cadence 1. 0 is a powerful open-source workflow orchestration platform designed for building and managing stateful services at scale.
Ender Demirkaya
10 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Uber's transition from a Server-Sent Events (SSE) architecture to a gRPC-based push platform, detailing the motivations, implementation challenges, and outcomes of this migrat...
Anirudh Raja, Shahbaz Kaladiya, Shivani Bhatia, Xinlin Peng
19 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses Uber's implementation of an automated vertical CPU scaling system that optimizes resource allocation for storage workloads, leading to significant cost savings and improved re...
Lasse Vilhelmsen
10 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses strategies to avoid CPU throttling in a containerized environment, particularly at Uber, where stateful workloads run on a large fleet of hosts.
Joakim Recht, Yury Vostrikov
7 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses the challenges and solutions in building scalable streaming pipelines for generating near real-time features at Uber.
Feng Xu, Gang Zhao
19 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses Uber's comprehensive re-architecture of its Fulfillment Platform, aimed at enhancing its Go/Get strategy.
Ashwin Neerabail, Ankit Srivastava, Kamran Massoudi, Madan Thangavelu, Uday Kiran Medisetty
19 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses Uber's 'Orders Near You' feature, which utilizes real-time geospatial data analytics to enhance user experience in the Uber Eats app.
Yupeng Fu, Cassandra Tomazic, Dharak Kharod
10 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses the challenges posed by flaky unit tests in Java, particularly in the context of Continuous Integration (CI) systems.
Ravi Agarwal, Lazaro Clapp, Gautam Korlam, Murali Krishna Ramanathan, Vijay Subramanian
19 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses the evolution of Uber's Schemaless datastore into a distributed SQL database called Docstore, highlighting its features, architecture, and motivation behind the transition.
Ovais Tariq, Deba Chatterjee, Himank Chaudhary
9 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Uber's journey towards metric standardization through the development of uMetric, a unified internal metric platform.
Xiaodong Wang, Wenrui Meng, Will Yu, Yun Wu
13 min read
Has Summary
--
Uber logo
Uber
Advanced
Uber's Real-Time Push Platform focuses on enhancing user experiences by transitioning from polling to a gRPC-based bi-directional streaming protocol.
Uday Kiran Medisetty, Nilesh Mahajan, Anirudh Raja, Madan Thangavelu
19 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses Uber's development of uWorc, a no-code workflow orchestrator designed to simplify the creation of batch and streaming data pipelines.
Sandeep Karmakar, Sriharsha Chintalapani
11 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Uber's Databook, an in-house platform designed to manage and surface metadata related to various data entities.
Sunheng Taing, Atul Gupte
25 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Fiber, a distributed computing library developed by Uber that simplifies large-scale computation for AI and machine learning applications.
Jiale Zhi, Rui Wang, Jeff Clune, Kenneth O. Stanley
17 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses Uber's implementation of multi-tenancy within its microservice architecture, highlighting its benefits for stability, modularity, and developer velocity.
Uber logo
Uber
Intermediate
The article highlights the contributions of women engineers at Uber in celebration of International Women’s Day.
Molly Vorwerck, Wayne Cunningham, Stan Yee, Lucy
23 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses how Uber engineered SQL support on Apache Pinot, enhancing real-time analytics capabilities for their Big Data stack.
Haibo Wang
16 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Uber's advancements in data infrastructure during 2019, focusing on how data science was leveraged to optimize performance and manage vast amounts of data.
Nikhil Joshi, Viv Keswani
6 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses Uber's open-source orchestration tool, Cadence, which facilitates seamless communication and coordination among microservices in a scalable and fault-tolerant manner.
Lucy
4 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses how Uber has implemented the Uber Spark Compute Service (uSCS) to simplify the use of Apache Spark across its extensive infrastructure.
Abhishek Modi, Adam Hudson
15 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses Uber's approach to consistent data partitioning using a Global Index for managing large Apache Hadoop tables.
Nishith Agarwal, Kaushik Devarajaiah
12 min read
Has Summary
--
Uber logo
Uber
Advanced
DBEvents is a change data capture system developed by Uber to efficiently ingest data into their Apache Hadoop data lake.
Nishith Agarwal, Ovais Tariq
18 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses how Uber employs machine learning to ensure the capacity safety of individual microservices, addressing challenges related to predicting service-level capacity requirements.
Ranjib Dey, Shrey Desai, Ruogu Du
13 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Uber's approach to building a scalable observability and alerting ecosystem, highlighting the development of two key systems, uMonitor and Neris, which handle metrics-based an...
Shreyas Srivatsan
12 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses the evolution and scaling of Uber's machine learning platform, Michelangelo, highlighting its development, deployment, and operational strategies.
Uber logo
Uber
Advanced
The article discusses Peloton, Uber's unified resource scheduler designed to manage diverse cluster workloads efficiently.
Uber logo
Uber
Advanced
The article highlights seven open source projects showcased at the Uber Open Summit, emphasizing Uber's contributions to the open source community.
Uber logo
Uber
Advanced
The article highlights the Uber Amsterdam Tech Team's focus on transforming payment systems and empowering developers.
Morgan Sebree
19 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article features an interview with Yuri Shkuro, the Jaeger Project Lead at Uber, discussing the development and open sourcing of Jaeger, a distributed tracing system.
Molly Vorwerck
12 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses Uber's implementation of a one-click chat feature (OCC) designed to enhance communication between drivers and riders.
Yue Weng, Huaixiu Zheng, Anwaya Aras, Franziska Bell
9 min read
Has Summary
--
Uber logo
Uber
Advanced
Marmaray is an open-source data ingestion and dispersal framework designed for Apache Hadoop, enabling Uber to manage large volumes of data efficiently.
Danny Chen, Omkar Joshi
18 min read
Has Summary
--
Uber logo
Uber
Advanced
The article discusses Uber's Databook platform, which transforms big data into actionable knowledge by managing metadata.
Luyao Li, Kaan Onuk, Lauren Tindal
12 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses how Uber enhances the quality of its maps through metrics computation, emphasizing the importance of accurate and up-to-date map data for improving user experience.
Ines Viskic
12 min read
Has Summary
--
Uber logo
Uber
Advanced
This article discusses how Uber leverages data science and machine learning to enhance financial forecasting and planning.
Chunyan Song
19 min read
Has Summary
--
Uber logo
Uber
Advanced
The article introduces Michelangelo, Uber's internal machine learning platform designed to democratize machine learning and streamline the process of building, deploying, and operating ML solutions...
Uber logo
Uber
Beginner
The article discusses Uber Engineering's XP Background Push, a tool designed to enhance the reliability of mobile applications by enabling real-time bug fixes and feature management.
AJ Ribeiro
7 min read
Has Summary
--
Uber logo
Uber
Intermediate
The article discusses the redesign of Uber Engineering's mobile content delivery ecosystem, focusing on the technical challenges and solutions developed for a new content feed and backend system.
Alex Forsythe, Denis Haenikel, Minje Zha
17 min read
Has Summary
--