Netflix logo

How Netflix Uses SQL

51 engineering articles about SQL from Netflix's engineering team

Articles

Filter:
Netflix logo
Netflix
Advanced
This article discusses how Netflix built a resilient data platform using a Write-Ahead Log (WAL) to address data consistency, reliability, and operational efficiency challenges at scale.
Netflix Technology Blog
15 min read
Includes Code
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses the evolution of Netflix's Tudum architecture, transitioning from a CQRS model utilizing Kafka to a more efficient system based on RAW Hollow.
Netflix Technology Blog
8 min read
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses Netflix's Unified Data Architecture (UDA), which aims to streamline data modeling across its various platforms by allowing teams to define business concepts once and use them ...
Netflix Technology Blog
18 min read
Has Summary
--
Netflix logo
Netflix
Beginner
The article discusses Netflix's approach to enhancing cloud efficiency through the use of Amazon Web Services (AWS) and a structured data framework.
Netflix Technology Blog
6 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
This article is the first in a multi-part series that explores the Analytics Engineering work at Netflix, highlighting how the company empowers its teams to produce and deliver actionable analytic ...
Netflix Technology Blog
8 min read
Includes Code
Has Summary
--
Netflix logo
Netflix
Intermediate
The article discusses Maestro, a horizontally scalable workflow orchestrator developed by Netflix for managing large-scale Data/ML workflows.
Netflix Technology Blog
22 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
Netflix recently hosted its first Data Engineering Summit, bringing together engineers to share insights on data processing patterns and building reliable data pipelines.
Netflix Technology Blog
3 min read
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses the implementation of incremental processing at Netflix using Netflix Maestro and Apache Iceberg.
Netflix Technology Blog
19 min read
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses the implementation of Streaming SQL within Netflix's Data Mesh framework, highlighting how it democratizes stream processing by allowing users to express complex data transfor...
Netflix Technology Blog
8 min read
Includes Code
Has Summary
--
Netflix logo
Netflix
Intermediate
The article discusses the development of a data reprocessing pipeline within Netflix's Asset Management Platform (AMP), designed to efficiently manage and update digital media assets' metadata.
Netflix Technology Blog
9 min read
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses the implementation of sample data pipelines using Dataflow at Netflix, focusing on bootstrapping, standardization, and automation of batch data pipelines.
Netflix Technology Blog
17 min read
Includes Code
Has Summary
--
Netflix logo
Netflix
Intermediate
The article discusses Netflix Maestro, a next-generation workflow orchestrator designed to manage data and machine learning workflows at scale.
Netflix Technology Blog
15 min read
Includes Code
Has Summary
--
Netflix logo
Netflix
Intermediate
This article discusses how Netflix's Content Engineering team has implemented a federated graph search system, focusing on the querying capabilities of Studio Search.
Netflix Technology Blog
8 min read
Includes Code
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses the management of data pipeline assets at Netflix using a tool called Dataflow.
Netflix Technology Blog
15 min read
Includes Code
Has Summary
--
Netflix logo
Netflix
Intermediate
The article discusses the development of a reliable Device Management Platform at Netflix, focusing on the integration of a customized embedded computer called the Reference Automation Environment ...
Netflix Technology Blog
18 min read
Includes Code
Has Summary
--
Netflix logo
Netflix
Intermediate
The article discusses the evolution of data movement at Netflix Studio through the implementation of a Data Mesh architecture.
Netflix Technology Blog
14 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
The article features an interview with Kevin Wylie, a Data Engineer at Netflix, discussing his journey in data engineering, his experiences at Netflix, and the evolution of his role over the past d...
Netflix Technology Blog
6 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
The article discusses the challenges and solutions related to achieving observability in asynchronous workflows, particularly within Netflix's Prodicle Distribution service.
Netflix Technology Blog
6 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
The article provides an in-depth look at the roles and responsibilities of experimentation and causal inference data scientists at Netflix.
Netflix Technology Blog
12 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
The article discusses Netflix's transition from traditional REST APIs to GraphQL microservices (GQLMS) for rapid application development, particularly in response to COVID-19 challenges.
Netflix Technology Blog
9 min read
Includes Code
Has Summary
--
Netflix logo
Netflix
Intermediate
The article 'Mythbusting the Analytics Journey' by Alex Diamond explores common misconceptions about succeeding in analytics within a large tech company like Netflix.
Netflix Technology Blog
6 min read
Has Summary
--
Netflix logo
Netflix
Beginner
The article provides an insightful look into the daily life of a Content Analytics Engineer at Netflix, highlighting the blend of technical skills and collaborative efforts required in the role.
Netflix Technology Blog
7 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
The article discusses Bulldozer, a self-serve data platform developed by Netflix for efficiently moving batch data from data warehouse tables to online key-value stores.
Netflix Technology Blog
9 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
The article features a Q&A with Netflix data scientists Julie Beckley and Chris Pham, discussing their career paths, exciting projects, and the impact of their work on Netflix's streaming quality.
Netflix Technology Blog
7 min read
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses the implementation of Hexagonal Architecture at Netflix to enhance application flexibility and efficiency in managing data sources.
Netflix Technology Blog
10 min read
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses how Netflix utilizes Apache Druid for real-time analytics to enhance user experience.
Netflix Technology Blog
11 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
DBLog is a generic Change-Data-Capture (CDC) framework developed to capture committed changes from databases in real-time and propagate them to downstream consumers.
Netflix Technology Blog
17 min read
Has Summary
--
Netflix logo
Netflix
Advanced
The article announces the open-source launch of Polynote, a polyglot notebook designed for data scientists and machine learning researchers.
Netflix Technology Blog
12 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
The article discusses Netflix's reimagined experimentation analysis infrastructure, focusing on how data scientists can now contribute more effectively to A/B testing through a modular architecture.
Netflix Technology Blog
9 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
The article discusses how Python is utilized across various domains at Netflix, from content delivery to machine learning and data analysis.
Netflix logo
Netflix
Intermediate
The article discusses Netflix's journey in building and scaling a comprehensive data lineage system to enhance data infrastructure reliability and efficiency.
Netflix Technology Blog
9 min read
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses the implementation of the Netflix Media Database (NMDB), focusing on its architecture, system requirements, and key components that enable scalability, reliability, and effici...
Netflix Technology Blog
24 min read
Has Summary
--
Netflix logo
Netflix
Advanced
The Keystone Stream Processing Platform is Netflix's infrastructure for real-time data processing, enabling data-driven decision-making across the organization.
Netflix Technology Blog
15 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
This article discusses Netflix's strategic implementation of Jupyter notebooks as a unified development platform for scheduling data workflows.
Netflix Technology Blog
13 min read
Includes Code
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses Netflix's innovative approach to Jupyter notebooks, emphasizing their evolution from niche tools to integral components of the Netflix Data Platform.
Netflix Technology Blog
16 min read
Includes Code
Has Summary
--
Netflix logo
Netflix
Intermediate
Netflix's Security Intelligence and Response Team (SIRT) introduces Diffy, a triage tool designed for digital forensics and incident response (DFIR) in cloud environments.
Netflix Technology Blog
6 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
The article discusses Metacat, a metadata service developed by Netflix to enhance the discoverability and management of big data across various data sources.
Netflix Technology Blog
8 min read
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses the lessons learned from building observability tools at Netflix, emphasizing the importance of log management, distributed request tracing, metrics analysis, and user interfa...
Netflix Technology Blog
11 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
This article discusses the implementation of Event Sourcing at Netflix for managing downloads, highlighting the architectural pattern's flexibility and robustness.
Netflix Technology Blog
10 min read
Has Summary
--
Netflix logo
Netflix
Beginner
The article discusses Netflix's implementation of the downloads feature on Android, focusing on how it enhances user experience by allowing offline viewing, reducing mobile data usage, and improvin...
Netflix Technology Blog
9 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
This article details Netflix's technical journey in migrating its Billing applications and datastores from a Data Center to AWS Cloud, emphasizing the challenges and strategies employed throughout ...
Netflix Technology Blog
14 min read
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses Mantis, a stream-processing service platform developed by Netflix that processes event streams at a rate of up to 8 million events per second.
Netflix Technology Blog
10 min read
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses Netflix's innovative approach to feature generation using a system called DeLorean, which allows for distributed time travel to generate features from historical data.
Netflix Technology Blog
18 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
The article discusses the retirement of Astyanax, a Java client library for Apache Cassandra, which was created in 2011.
Netflix Technology Blog
3 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
The article introduces FIDO (Fully Integrated Defense Operation), an open-source system developed by Netflix for automating the analysis and response to security incidents.
Netflix Technology Blog
6 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
The article discusses Netflix's implementation of Presto within their Big Data Platform on AWS, detailing its architecture, performance, and integration with S3.
Netflix Technology Blog
11 min read
Has Summary
--
Netflix logo
Netflix
Advanced
PigPen is a new map-reduce language designed for Clojure that simplifies the process of writing map-reduce queries.
Netflix Technology Blog
20 min read
Includes Code
Has Summary
--
Netflix logo
Netflix
Advanced
The article discusses the system architectures used by Netflix for personalization and recommendation, emphasizing the integration of online, nearline, and offline computation methods.
Netflix Technology Blog
13 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
The article discusses Netflix's implementation of Reactive Programming, highlighting the use of Microsoft's Reactive Extensions (Rx) to simplify asynchronous programming.
Netflix Technology Blog
3 min read
Has Summary
--
Netflix logo
Netflix
Intermediate
The article discusses the redesign of the Netflix API, which was initially launched in 2008 to support public developers.
Netflix Technology Blog
5 min read
Has Summary
--