We’ve improved personalized video recommendations on Facebook Reels by moving beyond metrics such as likes and watch time and directly leveraging user feedback. Our new User True Interest Survey (…
Overview
Meta describes their User True Interest Survey (UTIS) model for Facebook Reels, which moves beyond traditional engagement metrics like likes and watch time to directly leverage user feedback via in-app surveys. The approach uses a lightweight alignment model trained on survey responses to predict user satisfaction, resulting in significant improvements in recommendation accuracy, engagement, and retention across large-scale A/B tests with over 10 million users.
What You'll Learn
Why traditional engagement signals like likes and watch time fail to capture true user interests in recommendation systems
How to design and deploy large-scale in-product user surveys to collect direct interest feedback
How to build a lightweight alignment model layer on top of an existing ranking system using survey responses
How to integrate a user interest model at both early-stage retrieval and late-stage ranking in a recommendation funnel
How to evaluate recommendation quality improvements using both offline metrics and large-scale A/B testing
Prerequisites & Requirements
- Understanding of recommendation systems and ranking models (multi-task, multi-label architectures)
- Familiarity with machine learning concepts including knowledge distillation, binary classification, and A/B testing
- Understanding of information retrieval metrics such as precision, recall, and accuracy
- Experience with large-scale ML systems and ranking pipelines(optional)
Key Questions Answered
Why are engagement signals like likes and watch time insufficient for recommendation systems?
How does Meta's UTIS model work to improve Facebook Reels recommendations?
How accurate were previous heuristic methods at identifying user interests compared to UTIS?
How is the UTIS model integrated into Facebook Reels' ranking pipeline?
What were the real-world results of deploying the UTIS model in Facebook Reels?
How does Meta collect user perception data for training the UTIS model?
What challenges remain for survey-driven recommendation models?
Key Statistics & Figures
Technologies & Tools
Key Actionable Insights
1Supplement implicit engagement signals with direct user feedback surveys to capture true interest. Engagement metrics like watch time and likes are noisy proxies that only achieved 48.3% precision at identifying true user interests, meaning over half of inferred interests were wrong.This is particularly impactful for recommendation systems where short-term engagement optimization diverges from long-term user satisfaction and retention.
2Design a lightweight alignment model that sits on top of your existing ranking system rather than rebuilding from scratch. The UTIS model uses existing main model predictions as input features, making it efficient to train on sparse survey data while leveraging the full power of the existing ranking infrastructure.This 'perception layer' architecture enables rapid iteration on user feedback signals without disrupting the core ranking model that handles engagement prediction.
3Binarize survey responses to simplify modeling and reduce noise variance. Rather than predicting the exact 1-5 scale rating, converting to binary interest/no-interest labels makes the model more robust and easier to train on sparse survey data.Survey data is inherently sparse since only a small proportion of viewing sessions include surveys, so denoising strategies are critical for model quality.
4Apply user interest models at multiple stages of the recommendation funnel—both retrieval and final ranking—for compounding improvements. Using UTIS in early-stage retrieval to source better candidates and in late-stage ranking to fine-tune final scores provides benefits at each stage.In retrieval, UTIS reconstructs true interest profiles and uses knowledge distillation to align retrieval models. In ranking, it provides an additional input feature to the value formula.
5Weight survey responses to correct for sampling and nonresponse bias before using them as training data. Raw survey responses can be systematically biased by who chooses to respond, so statistical correction is essential for building a dataset that accurately reflects real user preferences.This is especially important when surveys are deployed at scale across diverse user populations with varying response propensities.
6Consider interest matching dimensions beyond simple topic alignment, including audio, production style, mood, and motivation. True user interest is multidimensional, and narrow topic-matching heuristics miss important factors that drive genuine satisfaction.This broader view of interest matching helps surface high-quality niche content rather than defaulting to generic popularity-based recommendations.