Finding AI-generated (deepfake) faces in the wild

Gonzalo Aniano Porcile, PhD
10 min readintermediate
--
View Original

Overview

The article discusses LinkedIn's efforts to detect AI-generated deepfake faces to ensure authentic interactions on its platform. It highlights the collaboration with academic experts to develop a robust detection model capable of distinguishing between real and synthetic images across various generative algorithms.

What You'll Learn

1

How to differentiate between real and AI-generated images using advanced detection models

2

Why maintaining a low false positive rate is crucial in AI-generated image detection

3

How to leverage diverse datasets for training AI models effectively

Prerequisites & Requirements

  • Understanding of AI and machine learning concepts
  • Experience with deep learning frameworks(optional)

Key Questions Answered

What techniques does LinkedIn use to detect AI-generated images?
LinkedIn employs cutting-edge models developed in collaboration with academic experts to detect AI-generated images. This includes a large-scale detector that can identify images generated by various algorithms, such as StyleGAN and Stable Diffusion, while maintaining a low false positive rate.
How effective is the model in detecting AI-generated faces?
The model achieves a true positive rate (TPR) of 98% for in-engine images and 84% for out-of-engine images, demonstrating its effectiveness in identifying AI-generated faces across different synthesis engines.
What datasets were used for training the AI detection model?
The training dataset comprises 120,000 real LinkedIn profile photos and 105,900 AI-generated faces from various GANs and diffusion models, ensuring a diverse range for robust model training and evaluation.
What are the challenges in detecting AI-generated images?
Detecting AI-generated images poses challenges due to the increasing realism of these images and the need for models to maintain low false positive rates while being robust against evolving image generation technologies.

Key Statistics & Figures

True Positive Rate (TPR) for in-engine images
98%
This rate indicates the model's effectiveness in correctly identifying AI-generated faces from images used in training.
True Positive Rate (TPR) for out-of-engine images
84%
This rate shows the model's performance when evaluating images generated by algorithms not included in the training dataset.
Total number of real LinkedIn profile photos used in training
120,000
This dataset size is crucial for training the model to ensure it can accurately differentiate between real and AI-generated images.
Total number of AI-generated faces used in training
105,900
This diverse dataset includes images from various generative algorithms, enhancing the model's robustness.

Technologies & Tools

Backend
Efficientnet-b1
Used as a convolutional neural network for image processing in the detection model.
AI/ML
Stylegan
One of the generative algorithms from which AI-generated images were sourced for training.
AI/ML
Stable Diffusion
Another generative model used to produce AI-generated images for the dataset.

Key Actionable Insights

1
Implementing a robust AI detection model requires diverse datasets to avoid overfitting to specific artifacts.
By training on a wide variety of images from different synthesis engines, developers can enhance the model's generalization capabilities, making it more effective in real-world applications.
2
Maintaining a low false positive rate is essential for user trust in AI detection systems.
A high false positive rate can lead to legitimate accounts being flagged as fake, which undermines user confidence and could result in user attrition.
3
Collaboration with academic experts can significantly enhance the development of detection technologies.
Engaging with researchers allows companies to leverage cutting-edge techniques and insights that can lead to more effective solutions in combating AI-generated content.

Common Pitfalls

1
Overfitting the model to specific artifacts from training datasets can lead to poor generalization.
This issue arises when a model learns to identify features that are not representative of the broader population of images, resulting in decreased performance on unseen data.
2
Neglecting the impact of image quality and compression on model performance can skew results.
If a model is not tested under realistic conditions, such as varying resolutions and compression levels, its effectiveness in real-world applications may be compromised.

Related Concepts

Ai-generated Content Detection
Deepfake Technology
Generative Adversarial Networks (gans)
Image Processing Techniques