How Spotify Uses Scala
11 engineering articles about Scala from Spotify's engineering team
Other Spotify Technologies
Other Companies Using Scala
Articles
Filter:
This article continues the exploration of Spotify's data platform, detailing its building blocks, scalability, and the community-driven approach to managing a complex data ecosystem.
Anastasia Khlebnikova (Senior Engineer) and Carol Cunha (Product Manager)
6 min read
Has Summary
--
Spotify has introduced Voyager, a new nearest-neighbor search library that significantly improves upon its predecessor, Annoy, by offering increased speed and accuracy.
Peter Sobot
4 min read
Includes Code
Has Summary
--
This article discusses Spotify's migration of its Event Delivery Infrastructure (EDI) to Google Cloud Platform (GCP), detailing the challenges faced, solutions implemented, and the resulting improv...
Flavio Santos (Data Infrastructure Engineer) and Robert Stephenson (Senior Product Manager)
14 min read
Has Summary
--
This article discusses how Spotify optimized its largest Dataflow job for Wrapped 2020 by implementing Sort Merge Bucket (SMB) joins, significantly reducing costs and improving performance.
Neville Li
11 min read
Has Summary
--
The article discusses Spotify's 'Listening Together' campaign, which visualizes real-time musical connections among users worldwide.
Gandalf Hernandez
4 min read
Has Summary
--
The article discusses Spotify's journey in improving its Machine Learning infrastructure using TensorFlow Extended (TFX) and Kubeflow.
ApacheCachingDockerGoogle CloudHTMLKubernetesMachine LearningMySQLScalaSQLTensorFlowTerraformTransformerXGBoost
Josh Baer
13 min read
Has Summary
--
Scio 0. 7 is a Scala API for Apache Beam and Google Cloud Dataflow, designed to simplify large-scale data processing for Spotify engineers.
Claire McGinty
12 min read
Includes Code
Has Summary
--
This article delves into Scio, a Scala API for Apache Beam and Google Cloud Dataflow, highlighting its unique features, basic concepts, and practical use cases at Spotify.
Neville Li
7 min read
Includes Code
Has Summary
--
This article discusses Spotify's transition to Google Cloud and the development of Scio, a Scala API for Apache Beam, which facilitates big data processing.
Neville Li
9 min read
Has Summary
--
The article discusses Spotify's approach to reliably exporting Cloud Pub/Sub streams to Cloud Storage, detailing the architecture and processes involved in handling over 100 billion events generate...
The article discusses how Spotify processes vast amounts of user-generated data using Apache Crunch on Hadoop.
davidawhiting
7 min read
Includes Code
Has Summary
--
You've reached the end! All 11 articles loaded.