benchmark 2025

Revisiting Deepfake Detection: Chronological Continual Learning and the Limits of Generalization

Federico Fontana 1, Anxhelo Diko 1, Romeo Lanzino 1, Marco Raoul Marini 1, Bachir Kaddar 2, Gian Luca Foresti 3, Luigi Cinque 1

0 citations

α

Published on arXiv

2509.07993

Output Integrity Attack

OWASP ML Top 10 — ML09

Key Finding

Continual learning enables 155x faster adaptation than full retraining with strong historical retention, but generalization to future deepfake generators remains near-random (FWT-AUC ≈ 0.5) regardless of method.

C-AUC / FWT-AUC

Novel technique introduced


The rapid evolution of deepfake generation technologies poses critical challenges for detection systems, as non-continual learning methods demand frequent and expensive retraining. We reframe deepfake detection (DFD) as a Continual Learning (CL) problem, proposing an efficient framework that incrementally adapts to emerging visual manipulation techniques while retaining knowledge of past generators. Our framework, unlike prior approaches that rely on unreal simulation sequences, simulates the real-world chronological evolution of deepfake technologies in extended periods across 7 years. Simultaneously, our framework builds upon lightweight visual backbones to allow for the real-time performance of DFD systems. Additionally, we contribute two novel metrics: Continual AUC (C-AUC) for historical performance and Forward Transfer AUC (FWT-AUC) for future generalization. Through extensive experimentation (over 600 simulations), we empirically demonstrate that while efficient adaptation (+155 times faster than full retraining) and robust retention of historical knowledge is possible, the generalization of current approaches to future generators without additional training remains near-random (FWT-AUC $\approx$ 0.5) due to the unique imprint characterizing each existing generator. Such observations are the foundation of our newly proposed Non-Universal Deepfake Distribution Hypothesis. \textbf{Code will be released upon acceptance.}


Key Contributions

  • Continual learning framework for deepfake detection that simulates the real-world chronological evolution of 7 years of deepfake generators, enabling efficient incremental adaptation without full retraining
  • Two novel evaluation metrics: Continual AUC (C-AUC) for historical retention and Forward Transfer AUC (FWT-AUC) for future generalization, evaluated across 600+ simulations
  • Non-Universal Deepfake Distribution Hypothesis: empirical finding that current approaches fail to generalize to unseen future generators (FWT-AUC ≈ 0.5) due to unique generator-specific imprints

🛡️ Threat Analysis

Output Integrity Attack

Deepfake detection is AI-generated content detection — a canonical ML09 output integrity problem. The paper proposes novel detection methodology (continual learning framework), evaluation metrics (C-AUC, FWT-AUC), and an empirical hypothesis about the limits of deepfake detector generalization.


Details

Domains
vision
Model Types
cnntransformer
Threat Tags
inference_time
Datasets
FaceForensics++
Applications
deepfake detectionvideo forensicsface manipulation detection