Pinterest logo

How Pinterest Uses PySpark

10 engineering articles about PySpark from Pinterest's engineering team

Articles

Filter:
Pinterest logo
Pinterest
Advanced
The article reflects on a decade of AI platform development at Pinterest, detailing the evolution from fragmented machine learning stacks to a unified AI platform that supports various models.
Pinterest logo
Pinterest
Advanced
The article discusses Pinterest's transition to Moka, a next-generation data processing platform built on AWS Elastic Kubernetes Service (EKS).
Pinterest logo
Pinterest
Advanced
This article discusses Pinterest's transition from a Hadoop-based platform to a Kubernetes-based data processing solution named Moka.
Pinterest Engineering
19 min read
Includes Code
Has Summary
--
Pinterest logo
Pinterest
Intermediate
The article discusses the integration of Ray infrastructure at Pinterest, detailing the journey, challenges, and solutions implemented to optimize machine learning workflows.
Pinterest logo
Pinterest
Intermediate
The article discusses how Pinterest improved its machine learning (ML) dataset iteration speed using Ray, an open-source framework for scaling AI and ML workloads.
Pinterest Engineering
9 min read
Has Summary
--
Pinterest logo
Pinterest
Intermediate
This article discusses Pinterest's implementation of a finer-grained access control (FGAC) framework to manage data access securely and efficiently within their data engineering platform.
Pinterest Engineering
18 min read
Has Summary
--
Pinterest logo
Pinterest
Advanced
This article discusses Pinterest's Batch Processing Platform, Monarch, focusing on efficient resource management to ensure quality of service (QoS) while maintaining cost efficiency.
Pinterest Engineering
16 min read
Has Summary
--
Pinterest logo
Pinterest
Intermediate
The article discusses how Pinterest employs machine learning to combat spam and harmful content on its platform.
Pinterest Engineering
5 min read
Has Summary
--
Pinterest logo
Pinterest
Intermediate
The article discusses how Pinterest employs machine learning to combat misinformation, hate speech, and self-harm content on its platform.
Pinterest Engineering
7 min read
Has Summary
--
Pinterest logo
Pinterest
Advanced
The article discusses how Pinterest empowered its data scientists and machine learning engineers by building a PySpark infrastructure that addresses challenges faced with existing tools like Hive a...

You've reached the end! All 10 articles loaded.