Combining LinkedIn’s Content Filtering and Microsoft Cognitive Services to Keep Inappropriate Content Off Our Sites

Rushi Bhatt
6 min readadvanced
--
View Original

Overview

The article discusses the integration of LinkedIn's Universal Content Filtering (UCF) platform with Microsoft's Content Moderator service to enhance the detection of inappropriate content on LinkedIn. It highlights the benefits of combining human and machine intelligence to improve content classification accuracy and the collaborative efforts between LinkedIn and Microsoft teams.

What You'll Learn

1

How to integrate content moderation systems using APIs

2

Why combining classifiers improves content filtering accuracy

3

How to ensure model behavior consistency across different frameworks

Prerequisites & Requirements

  • Understanding of content moderation concepts and machine learning classifiers
  • Familiarity with TensorFlow and Microsoft Cognitive Toolkit (CNTK)(optional)

Key Questions Answered

How does LinkedIn integrate Content Moderator with its UCF platform?
LinkedIn has created a bi-directional bridge between its Universal Content Filtering (UCF) platform and Microsoft's Content Moderator. This integration allows for real-time classification of images and text posted on LinkedIn, enhancing the detection of inappropriate content while preserving member privacy.
What are the benefits of combining LinkedIn's classifiers with Content Moderator?
Combining LinkedIn's classifiers with Content Moderator enhances the volume of inappropriate content that can be classified accurately. This integration improves both recall and precision, ensuring that more inappropriate content is caught while minimizing false positives.
What challenges were faced during the integration of TensorFlow and CNTK?
One major challenge was ensuring that the model behavior remained consistent after converting models from TensorFlow to CNTK. This required thorough testing to confirm that the same inputs produced identical outputs across both frameworks, particularly in preprocessing steps.

Technologies & Tools

Some links below are affiliate links. We may earn a commission if you make a purchase.

Backend
Content Moderator
Used for detecting inappropriate user-generated content through machine-assisted scanning.
Backend
Microsoft Cognitive Toolkit (cntk)
Serves as the backbone for classification in the Cognitive Services.
Backend
Tensorflow
Used by LinkedIn to create deep learning models for content classification.

Key Actionable Insights

1
Implement a bi-directional API integration between content moderation systems to enhance content filtering capabilities.
This integration allows for real-time classification of user-generated content, improving the overall quality and safety of the platform.
2
Utilize a combination of human and machine intelligence for effective content moderation.
By leveraging both classifiers, platforms can achieve higher accuracy in detecting inappropriate content, which is crucial for maintaining a professional environment.
3
Ensure thorough testing when converting machine learning models between different frameworks.
Testing is essential to maintain model performance and avoid discrepancies in classification results, which can impact user experience.

Common Pitfalls

1
Failing to ensure consistency in model behavior across different machine learning frameworks can lead to inaccurate classification results.
This issue often arises from differences in preprocessing steps and model specifications, which can significantly affect the output scores.

Related Concepts

Content Moderation
Machine Learning Classifiers
API Integration
Real-time Content Filtering