How Pinterest fights misinformation, hate speech, and self-harm content with machine learning

Pinterest Engineering

•

Pinterest Engineering

•7 min read•intermediate•

--

•View Original

JavaMachine LearningPandasPySparkSQLTensorFlow

Overview

The article discusses how Pinterest employs machine learning to combat misinformation, hate speech, and self-harm content on its platform. It highlights the effectiveness of their models in reducing policy-violating content and outlines the various machine learning techniques and systems used to ensure a safe and inspiring environment for users.

What You'll Learn

1

How to implement machine learning models to detect harmful content on social platforms

2

Why proactive content moderation is essential for user safety

3

How to utilize image-signatures for content enforcement

Prerequisites & Requirements

Understanding of machine learning concepts and models
Familiarity with Spark and TensorFlow(optional)

Key Questions Answered

How does Pinterest use machine learning to combat misinformation?

Pinterest utilizes machine learning models to automatically detect and filter harmful content, resulting in a 52% decline in policy-violating content reports per impression since the technology was introduced. This proactive approach allows them to identify and remove harmful content before it is reported by users.

What are the key machine learning models used by Pinterest?

Pinterest employs several machine learning models, including a Pin batch model and an online model, which generate scores for content safety based on PinSage embeddings and image text extracted via Optical Character Recognition (OCR). These models help in identifying policy violations across various categories.

What impact has machine learning had on self-harm content reports?

Since the introduction of machine learning technology in April 2019, reports for self-harm content on Pinterest have decreased by 80%. This significant reduction highlights the effectiveness of their proactive content moderation strategies.

How does Pinterest enforce policies on similar images?

Pinterest groups similar images together using a unique hash called image-signature. Machine learning models generate scores for each image-signature, allowing Pinterest to apply consistent enforcement decisions across all Pins with the same signature based on these scores.

Key Statistics & Figures

Decline in policy-violating content reports per impression

52%

This decline has occurred since the introduction of machine learning technology in the fall of 2019.

Decrease in self-harm content reports

80%

This reduction has been observed since April 2019 with the implementation of machine learning models.

Technologies & Tools

Some links below are affiliate links. We may earn a commission if you make a purchase.

Backend

Spark

Used for inference and scoring of the entire corpus of billions of Pins.

Backend

Tensorflow

Utilized in the online model for real-time scoring of new Pins.

Backend

Pinsage

Provides embeddings for Pins, which are crucial for the scoring models.

Key Actionable Insights

1
Implementing machine learning models for content moderation can significantly reduce harmful content on platforms.
By proactively detecting violations, platforms like Pinterest can create a safer user experience, as evidenced by their 52% reduction in policy-violating content reports.

2
Utilizing image-signatures can enhance the efficiency of content enforcement.
This method allows for consistent policy enforcement across similar images, ensuring that harmful content is swiftly identified and removed.

3
Regularly updating machine learning models is crucial to adapt to evolving types of harmful content.
As misinformation and harmful content trends change, continuous improvement of detection models helps maintain user safety and trust.

Common Pitfalls

1

Failing to continuously update machine learning models can lead to outdated detection capabilities.

As harmful content evolves, models that are not regularly updated may struggle to identify new types of violations, resulting in increased risks for users.

Related Concepts

Machine Learning In Content Moderation

Proactive Content Enforcement Strategies

Image Recognition And Analysis Techniques