Scaling Laws for Deepfake Detection

This paper presents a systematic study of scaling laws for the deepfake detection task. Specifically, we analyze the model performance against the number of real image domains, deepfake generation methods, and training images. Since no existing dataset meets the scale requirements for this research, we construct ScaleDF, the largest dataset to date in this field, which contains over 5.8 million real images from 51 different datasets (domains) and more than 8.8 million fake images generated by 102 deepfake methods. Using ScaleDF, we observe power-law scaling similar to that shown in large language models (LLMs). Specifically, the average detection error follows a predictable power-law decay as either the number of real domains or the number of deepfake methods increases. This key observation not only allows us to forecast the number of additional real domains or deepfake methods required to reach a target performance, but also inspires us to counter the evolving deepfake technology in a data-centric manner. Beyond this, we examine the role of pre-training and data augmentations in deepfake detection under scaling, as well as the limitations of scaling itself.

Key Contributions

ScaleDF: the largest deepfake detection dataset with 5.8M+ real images from 51 domains and 8.8M+ fake images from 102 deepfake generation methods across 5 forgery categories
Discovery of power-law scaling laws for deepfake detection: detection error follows 1 − AUC = A·N^(−α) as the number of real domains or deepfake methods increases, with no observed saturation
Empirical analysis of how pre-training and data augmentation interact with data scale, and identification of limitations where performance saturates beyond ~10M images at fixed domain/method counts

🛡️ Threat Analysis

Output Integrity Attack

Deepfake detection (AI-generated face content detection) is explicitly listed under ML09. The paper's primary contributions — ScaleDF dataset and scaling law analysis — directly advance the capability to detect AI-generated/manipulated imagery, which is the core output integrity concern of ML09.

Details

Domains

visiongenerative

Model Types

transformerdiffusiongan

Datasets

ScaleDF

Applications

2025 0 cit.

Output Integrity Attack

79%

Scaling Laws for Deepfake Detection

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

AI-Generated Image Detection: An Empirical Study and Future Research Directions

UniAIDet: A Unified and Universal Benchmark for AI-Generated Image Content Detection and Localization

Synthetic Image Detection with CLIP: Understanding and Assessing Predictive Cues

Your One-Stop Solution for AI-Generated Video Detection

Deepfake Synthesis vs. Detection: An Uneven Contest

Além do Desempenho: Um Estudo da Confiabilidade de Detectores de Deepfakes

How well are open sourced AI-generated image detection models out-of-the-box: A comprehensive benchmark study

FakeChain: Exposing Shallow Cues in Multi-Step Deepfake Detection