FreqDebias: Towards Generalizable Deepfake Detection via Consistency-Driven Frequency Debiasing

Deepfake detectors often struggle to generalize to novel forgery types due to biases learned from limited training data. In this paper, we identify a new type of model bias in the frequency domain, termed spectral bias, where detectors overly rely on specific frequency bands, restricting their ability to generalize across unseen forgeries. To address this, we propose FreqDebias, a frequency debiasing framework that mitigates spectral bias through two complementary strategies. First, we introduce a novel Forgery Mixup (Fo-Mixup) augmentation, which dynamically diversifies frequency characteristics of training samples. Second, we incorporate a dual consistency regularization (CR), which enforces both local consistency using class activation maps (CAMs) and global consistency through a von Mises-Fisher (vMF) distribution on a hyperspherical embedding space. This dual CR mitigates over-reliance on certain frequency components by promoting consistent representation learning under both local and global supervision. Extensive experiments show that FreqDebias significantly enhances cross-domain generalization and outperforms state-of-the-art methods in both cross-domain and in-domain settings.

Key Contributions

Identifies a new type of model bias called spectral bias, where deepfake detectors over-rely on specific frequency bands, limiting cross-domain generalization.
Introduces Forgery Mixup (Fo-Mixup), a data augmentation strategy that dynamically modulates amplitude spectra in dominant frequency bands to diversify training samples.
Proposes dual consistency regularization combining CAM-based local consistency and von Mises-Fisher distribution global consistency on a hyperspherical embedding space to mitigate spectral bias.

🛡️ Threat Analysis

Output Integrity Attack

Directly addresses detection of AI-generated facial content (deepfakes); the primary contribution is a novel detection architecture that improves cross-domain generalization of deepfake detectors, which is output integrity / AI-generated content detection.

Details

Domains

vision

Model Types

cnn

Threat Tags

inference_time

Datasets

FaceForensics++Celeb-DFDFDCDeepFakeDetection

Applications

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

SpectraNet: FFT-assisted Deep Learning Classifier for Deepfake Face Detection

Efficient and Verifiable Privacy-Preserving Convolutional Computation for CNN Inference with Untrusted Clouds

Towards Generalizable AI-Generated Image Detection via Image-Adaptive Prompt Learning

LAKAN: Landmark-assisted Adaptive Kolmogorov-Arnold Network for Face Forgery Detection

Fusion-SSAT: Unleashing the Potential of Self-supervised Auxiliary Task by Feature Fusion for Generalized Deepfake Detection

A Sanity Check for Multi-In-Domain Face Forgery Detection in the Real World

Towards Generalizable Deepfake Detection via Real Distribution Bias Correction

Fourier-Based GAN Fingerprint Detection using ResNet50