How Spotify Uses Cassandra
17 engineering articles about Cassandra from Spotify's engineering team
Other Spotify Technologies
Other Companies Using Cassandra
Articles
Filter:
This article discusses Spotify's transition to a declarative infrastructure model using Kubernetes, enabling efficient management of cloud resources across numerous services.
AnsibleApacheApache KafkaCassandraDockerElasticsearchGoogle CloudJSONKubernetesMemcachedPostgreSQLPuppetTerraformTypeScriptYAML
David Flemström
11 min read
Includes Code
Has Summary
--
The article discusses Spotify's approach to user privacy through a centralized encryption system called Padlock, which manages user data encryption keys.
Bram Leenders
12 min read
Has Summary
--
The article introduces cstar, an open-source Cassandra orchestration tool developed by Spotify to simplify the management of large Cassandra clusters.
This article delves into Scio, a Scala API for Apache Beam and Google Cloud Dataflow, highlighting its unique features, basic concepts, and practical use cases at Spotify.
Neville Li
7 min read
Includes Code
Has Summary
--
This article discusses Spotify's transition to Google Cloud and the development of Scio, a Scala API for Apache Beam, which facilitates big data processing.
Neville Li
9 min read
Has Summary
--
The article discusses the evolution of music personalization at Spotify, highlighting the transition from a small team to multiple teams working on machine learning services.
Spotify Engineering
5 min read
Has Summary
--
This article discusses the metrics and methodologies for measuring and optimizing latency in load balancing systems, specifically focusing on the ELS (Elastic Load Balancer) at Spotify.
Lukáš Poláček
7 min read
Has Summary
--
The article discusses the Expected Latency Selector (ELS), a probabilistic load balancer developed by Spotify to optimize server response times by weighing machines based on their performance metri...
Lukáš Poláček
10 min read
Has Summary
--
The article introduces Heroic, Spotify's in-house scalable time series database designed to handle near real-time data collection and presentation at scale.
John-John Tedro
3 min read
Has Summary
--
The article discusses Spotify's use of Cassandra for data-driven configuration, emphasizing the importance of load testing and capacity planning for performance optimization.
The article discusses common Java linking problems, particularly focusing on runtime errors like NoSuchMethodError and NoClassDefFoundError that arise from dependency management issues.
The article discusses Spotify's transition from a PostgreSQL database to a Cassandra database for user data management.
Marcus Vesterlund
10 min read
Has Summary
--
The article discusses how Spotify utilizes Apache Cassandra to enhance user personalization by analyzing real-time and historical data.
The article discusses how Spotify scales its real-time data processing pipelines using Apache Storm, focusing on architecture, maintainability, and performance optimization.
The article discusses the Date-Tiered Compaction Strategy (DTCS) developed for Apache Cassandra, particularly for optimizing time series data storage and retrieval.
The article discusses Spotify's evolving backend infrastructure, emphasizing the importance of autonomous squads, a transparent code model, and self-service infrastructure to support rapid growth a...
Spotify Engineering
9 min read
Has Summary
--
The article discusses Spotify's reliance on mature technologies for its backend architecture, emphasizing the benefits of using proven tools like PostgreSQL and DNS for service discovery.
You've reached the end! All 17 articles loaded.