How Pinterest Uses Python
56 engineering articles about Python from Pinterest's engineering team
Other Pinterest Technologies
Other Companies Using Python
Articles
Filter:
This article discusses how Pinterest successfully reduced Android testing build times by over 36% through the implementation of a runtime-aware sharding mechanism.
The article reflects on a decade of AI platform development at Pinterest, detailing the evolution from fragmented machine learning stacks to a unified AI platform that supports various models.
AutoMLDockerEmbeddingGenerative AIJavaKubernetesLightGBMPySparkPythonPyTorchSeedSQLTensorFlowThriftTransformer
Pinterest Engineering
22 min read
Has Summary
--
The article discusses Pinterest's transition to Moka, a next-generation data processing platform built on AWS Elastic Kubernetes Service (EKS).
Pinterest Engineering
16 min read
Has Summary
--
The article discusses how Pinterest scaled its machine learning infrastructure using Ray, extending its capabilities beyond training to include feature development, sampling, and labeling.
This article discusses how implementing the Lightning Memory-Mapped Database (LMDB) improved memory management and performance for Pinterest's API service.
The article discusses the implementation and operational benefits of the Unified PubSub Client (PSC) at Pinterest, highlighting improvements in developer velocity, stability, and scalability.
Pinterest Engineering
11 min read
Includes Code
Has Summary
--
This article discusses the integration of Python-based anomaly detection algorithms into Pinterest's Warden platform, originally built in Java.
The article discusses how Pinterest improved its machine learning (ML) dataset iteration speed using Ray, an open-source framework for scaling AI and ML workloads.
Pinterest Engineering
9 min read
Has Summary
--
The article discusses the development of a unified PubSub client library at Pinterest, aimed at improving the scalability, stability, and developer velocity of the Logging Platform.
Pinterest Engineering
12 min read
Includes Code
Has Summary
--
The article discusses the critical issue of Python dependency confusion at Pinterest, highlighting the security risks associated with software supply chains.
The article outlines the engineering interview process at Pinterest, emphasizing a positive candidate experience and the importance of showcasing real-world problem-solving skills.
Pinterest Engineering
14 min read
Has Summary
--
The article discusses how Pinterest leverages machine learning to foster a healthy comment ecosystem on its platform.
The article discusses SearchSage, a search query representation model developed by Pinterest to enhance the relevance of search results and user engagement.
Pinterest Engineering
14 min read
Has Summary
--
The article discusses how Pinterest utilizes machine learning to maintain a positive comment ecosystem amidst a growing creator community.
Pinterest Engineering
7 min read
Has Summary
--
The article discusses SearchSage, a search query representation developed at Pinterest to enhance search retrieval and ranking.
Computer VisionEmbeddingModalNatural Language ProcessingPythonPyTorchTensorFlowTransformerTransformers
Pinterest Engineering
11 min read
Has Summary
--
The article discusses how Pinterest utilizes Apache Spark SQL for interactive querying, detailing the architecture, challenges faced, and solutions implemented to enhance user experience.
Pinterest Engineering
18 min read
Has Summary
--
The article discusses the importance of quality and testing of advertising metrics on mobile devices at Pinterest.
The article discusses Pinterest's efforts to develop fast and reliable iOS builds by optimizing their build process and adopting Bazel as their primary build tool.
Pinterest Engineering
10 min read
Has Summary
--
Pinterest has open sourced Querybook, a collaborative big data hub designed to improve data access and analysis for teams, especially in a remote working environment.
Pinterest Engineering
7 min read
Has Summary
--
The article discusses Guardian, a real-time analytics and rules engine developed by Pinterest's Trust & Safety team to combat spam.
The article discusses how Pinterest empowered its data scientists and machine learning engineers by building a PySpark infrastructure that addresses challenges faced with existing tools like Hive a...
Pinterest Engineering
7 min read
Has Summary
--
The article discusses the concept of 'Pinterest Paths', which describes the exploration behavior of users on Pinterest as they navigate through related ideas.
The article discusses the importance of ad metrics quality and testing on mobile platforms at Pinterest.
The article discusses the open-sourcing of Singer, Pinterest's logging agent designed for efficient and reliable data collection from numerous hosts.
Pinterest Engineering
3 min read
Has Summary
--
The article discusses how Pinterest improved the speed and reliability of its iOS builds by adopting Bazel as their build system.
Pinterest Engineering
8 min read
Has Summary
--
The article discusses the interview process at Pinterest, emphasizing a positive candidate experience that reflects the company's values and culture.
Pinterest Engineering
13 min read
Has Summary
--
The article discusses the importance of API profiling at Pinterest, focusing on optimizing the API Gateway service through memory profiling, operational cost calculations, and dead code detection.
Pinterest Engineering
9 min read
Has Summary
--
The article discusses the technological framework that supports Pinterest's growth, detailing the company's mission to help users discover and do things they love.
Pinterest Engineering
8 min read
Has Summary
--
The article discusses the challenges Pinterest faced with their Apache Thrift schemas, which had become tightly coupled and complex, leading to inefficiencies in development cycles.
Pinterest has announced the open-sourcing of ptracer, a syscall-tracing library for Python that enhances the efficiency and reliability of their large Python codebase.
Pinterest Engineering
6 min read
Has Summary
--
The article discusses the challenges of managing Python code at scale across multiple repositories at Pinterest and introduces the solution of a Python monorepo called Python Commons.
The article discusses the implementation and functionality of the Pintrace Trace Analyzer, a tool developed by Pinterest for analyzing distributed trace data.
Pinterest Engineering
4 min read
Has Summary
--
The article discusses how Pinterest improved its continuous integration (CI) process for iOS development by adopting Nix and Buildkite, moving away from Jenkins.
The article discusses the applications of Pintrace data at Pinterest, highlighting its role in improving backend service latency and debugging.
The article discusses Pinterest's implementation of distributed tracing using their open-source tool, Pintrace, which tracks requests across microservices in their backend.
Pinterest Engineering
9 min read
Has Summary
--
The article discusses Pinterest's migration from a legacy template rendering engine to React, highlighting the performance improvements and developer efficiency gained through this transition.
The article discusses how Pinterest engineers leveraged existing tools from VOTE. org to encourage voter participation by embedding interactive iframes in Pins.
Pinterest Engineering
4 min read
Includes Code
Has Summary
--
The article discusses the evolution of MySQL compression techniques at Pinterest, particularly focusing on the challenges faced with InnoDB page compression and the development of a new column comp...
The article discusses the challenges of replicability in data science, highlighting a case study at Pinterest where nine data scientists provided differing results to the same question.
Pinterest Engineering
7 min read
Has Summary
--
The article discusses Pinterest's migration of its web framework from Backbone to React, focusing on the challenges and techniques used during the transition, particularly for Pinner profiles.
Pinterest Engineering
3 min read
Includes Code
Has Summary
--
This article discusses the implementation details of Tracker, a system designed for ingesting MySQL data at scale, focusing on the improvements made to the data upload process to S3.
The article discusses Pinterest's approach to ingesting large volumes of MySQL data at scale through a system called Tracker.
The article discusses the implementation of a real-time metrics dashboard for monitoring A/B experiments at Pinterest.
Pinterest Engineering
5 min read
Has Summary
--
The article discusses the development of Pinterest's A/B testing platform, highlighting the need for a reliable and user-friendly system to manage a large volume of experiments.
The article discusses Pinterest's internal web service designed to visualize unstructured data, helping to identify trends and user engagement patterns.
The article discusses the open-sourcing of KingPin, a toolset developed by Pinterest to enhance scalability and reliability in their infrastructure.
The article discusses Pinterest's approach to automating human evaluation for content relevance and safety through crowdsourcing.
Pinterest Engineering
6 min read
Has Summary
--
The article introduces new open-source tools for the Elixir community, highlighting the advantages of Elixir and Erlang in handling concurrency and errors.
Skyline is an ETL-as-a-Service platform developed by Pinterest to streamline data processing and reporting for its users.
Pinterest has open-sourced Pinrepo, an artifact repository designed to efficiently store and serve build artifacts while addressing scalability and reliability challenges.