Modernizing Home Feed Pre-Ranking Stage

Pinterest Engineering
8 min readintermediate
--
View Original

Overview

The article discusses the modernization of Pinterest's home feed pre-ranking stage, focusing on the introduction of a sophisticated pre-ranking layer known as Lightweight Scoring. It highlights foundational improvements in system design and model architecture that enhance user engagement and streamline the recommendation process.

What You'll Learn

1

How to implement a joint training approach for request and item level models

2

Why early funnel logging is critical for effective model training

3

How to optimize model architecture for better user-item interaction

4

When to apply auto-retraining frameworks for improved recommendation timeliness

Prerequisites & Requirements

  • Understanding of recommendation systems and machine learning concepts
  • Experience with model training and deployment in production environments(optional)

Key Questions Answered

What are the limitations of the initial pre-ranking design?
The initial design faced challenges such as the need for separate services for model iterations, difficulties in model auto-retraining synchronization, and limitations in learning user-item interactions effectively due to a two-tower architecture. These issues hindered the model's ability to leverage powerful features like user action sequences.
How does the new model architecture improve user engagement?
The new model architecture allows for joint training of request and item level components, enhancing user-item interaction. This architecture, combined with early funnel logging, enables the model to utilize more representative negative samples, leading to improved performance and user engagement metrics.
What is the purpose of the logging pipeline in the pre-ranking model?
The logging pipeline is designed to collect unbiased training data by combining early funnel candidate data with final impression data. This approach addresses sample selection bias and ensures that the training data aligns closely with the serving data, enhancing the model's effectiveness.
What are the benefits of the root-leaf architecture in serving?
The root-leaf architecture optimizes the serving process by sharding items and assigning specific leaf hosts to handle certain item shards. This design improves cache hit rates and reduces infrastructure costs, enabling efficient real-time feature utilization while managing larger raw item feature sizes.

Key Statistics & Figures

Improvement in user engagement
Significant
This improvement was observed following the launch of the new pre-ranking model.

Key Actionable Insights

1
Implementing a joint training approach can significantly enhance the performance of recommendation models.
By training request and item level components together, you can leverage more comprehensive user-item interactions, which is crucial for improving engagement metrics.
2
Early funnel logging is essential for training effective models in recommendation systems.
This practice allows for the collection of more representative negative samples, which can lead to better model performance and more accurate recommendations.
3
Adopting a root-leaf architecture can optimize your serving infrastructure.
This architecture helps manage large datasets efficiently, improving cache hit rates and reducing costs associated with serving real-time features.

Common Pitfalls

1
Relying solely on a two-tower architecture can limit the model's ability to learn user-item interactions effectively.
This architecture typically performs late fusion, which reduces the model's capability to leverage powerful features such as user action sequences. Transitioning to a more integrated model design can alleviate these limitations.

Related Concepts

Recommendation Systems
Machine Learning Model Training
Data Logging Techniques