defense 2025

FreqDebias: Towards Generalizable Deepfake Detection via Consistency-Driven Frequency Debiasing

Hossein Kashiani , Niloufar Alipour Talemi , Fatemeh Afghah

10 citations · 77 references · CVPR

α

Published on arXiv

2509.22412

Output Integrity Attack

OWASP ML Top 10 — ML09

Key Finding

FreqDebias outperforms state-of-the-art methods in both cross-domain and in-domain deepfake detection settings by mitigating spectral bias.

FreqDebias

Novel technique introduced


Deepfake detectors often struggle to generalize to novel forgery types due to biases learned from limited training data. In this paper, we identify a new type of model bias in the frequency domain, termed spectral bias, where detectors overly rely on specific frequency bands, restricting their ability to generalize across unseen forgeries. To address this, we propose FreqDebias, a frequency debiasing framework that mitigates spectral bias through two complementary strategies. First, we introduce a novel Forgery Mixup (Fo-Mixup) augmentation, which dynamically diversifies frequency characteristics of training samples. Second, we incorporate a dual consistency regularization (CR), which enforces both local consistency using class activation maps (CAMs) and global consistency through a von Mises-Fisher (vMF) distribution on a hyperspherical embedding space. This dual CR mitigates over-reliance on certain frequency components by promoting consistent representation learning under both local and global supervision. Extensive experiments show that FreqDebias significantly enhances cross-domain generalization and outperforms state-of-the-art methods in both cross-domain and in-domain settings.


Key Contributions

  • Identifies a new type of model bias called spectral bias, where deepfake detectors over-rely on specific frequency bands, limiting cross-domain generalization.
  • Introduces Forgery Mixup (Fo-Mixup), a data augmentation strategy that dynamically modulates amplitude spectra in dominant frequency bands to diversify training samples.
  • Proposes dual consistency regularization combining CAM-based local consistency and von Mises-Fisher distribution global consistency on a hyperspherical embedding space to mitigate spectral bias.

🛡️ Threat Analysis

Output Integrity Attack

Directly addresses detection of AI-generated facial content (deepfakes); the primary contribution is a novel detection architecture that improves cross-domain generalization of deepfake detectors, which is output integrity / AI-generated content detection.


Details

Domains
vision
Model Types
cnn
Threat Tags
inference_time
Datasets
FaceForensics++Celeb-DFDFDCDeepFakeDetection
Applications
deepfake detectionface forgery detection