Netflix logo

How Netflix Uses Apache Spark

14 engineering articles about Apache Spark from Netflix's engineering team

Articles

Filter:
Netflix logo
Netflix
Advanced
The article discusses how Netflix scales its Muse application to provide data-driven creative insights at a massive scale, focusing on the architectural evolution and optimizations made to handle t...
Netflix Technology Blog
10 min read
Includes Code
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses how Netflix supports a diverse range of machine learning (ML) systems through its Machine Learning Platform (MLP) and the Metaflow framework.
Netflix logo
Netflix
Intermediate
Netflix recently hosted its first Data Engineering Summit, bringing together engineers to share insights on data processing patterns and building reliable data pipelines.
Netflix Technology Blog
3 min read
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses the implementation of sample data pipelines using Dataflow at Netflix, focusing on bootstrapping, standardization, and automation of batch data pipelines.
Netflix Technology Blog
17 min read
Includes Code
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses Netflix's auto-diagnosis and remediation system, Pensive, which addresses failures in their complex data platform.
Netflix Technology Blog
7 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
The Netflix Cosmos Platform is a computing framework that integrates microservices, asynchronous workflows, and serverless functions to handle resource-intensive algorithms and complex workflows.
Netflix Technology Blog
13 min read
Has Summary
--
Netflix logo
Netflix
Advanced
The article announces the open-source launch of Polynote, a polyglot notebook designed for data scientists and machine learning researchers.
Netflix Technology Blog
12 min read
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses the infrastructure for Contextual Bandits and Reinforcement Learning, highlighting insights from a meetup hosted at Netflix.
Netflix Technology Blog
11 min read
Includes Code
Has Summary
--
Netflix logo
Netflix
Intermediate
The article discusses Netflix's reimagined experimentation analysis infrastructure, focusing on how data scientists can now contribute more effectively to A/B testing through a modular architecture.
Netflix Technology Blog
9 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
The article discusses Netflix's use of Apache Spark for enhancing its recommendation systems, detailing three key projects presented at the Spark+AI Summit 2018.
Netflix Technology Blog
5 min read
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses Archer, a platform developed by Netflix to simplify media processing and innovation.
Netflix Technology Blog
9 min read
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses Netflix's innovative approach to feature generation using a system called DeLorean, which allows for distributed time travel to generate features from historical data.
Netflix Technology Blog
18 min read
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses Netflix's implementation of automated outlier detection to identify unhealthy servers within its extensive infrastructure.
Netflix Technology Blog
8 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
The article explores the resiliency of Spark Streaming in the context of Netflix's use of Chaos Monkey to simulate failures in their AWS cloud environment.
Netflix Technology Blog
5 min read
Has Summary
--

You've reached the end! All 14 articles loaded.