Towards Sustainable Universal Deepfake Detection with Frequency-Domain Masking

Universal deepfake detection aims to identify AI-generated images across a broad range of generative models, including unseen ones. This requires robust generalization to new and unseen deepfakes, which emerge frequently, while minimizing computational overhead to enable large-scale deepfake screening, a critical objective in the era of Green AI. In this work, we explore frequency-domain masking as a training strategy for deepfake detectors. Unlike traditional methods that rely heavily on spatial features or large-scale pretrained models, our approach introduces random masking and geometric transformations, with a focus on frequency masking due to its superior generalization properties. We demonstrate that frequency masking not only enhances detection accuracy across diverse generators but also maintains performance under significant model pruning, offering a scalable and resource-conscious solution. Our method achieves state-of-the-art generalization on GAN- and diffusion-generated image datasets and exhibits consistent robustness under structured pruning. These results highlight the potential of frequency-based masking as a practical step toward sustainable and generalizable deepfake detection. Code and models are available at https://github.com/chandlerbing65nm/FakeImageDetection.

Key Contributions

Frequency-domain masking as a training strategy that improves deepfake detector generalization to unseen GAN and diffusion model outputs
Demonstration that frequency masking maintains detection performance under significant structured model pruning, enabling resource-efficient large-scale screening
State-of-the-art universal deepfake detection across both GAN- and diffusion-generated image benchmarks

🛡️ Threat Analysis

Output Integrity Attack

Proposes a novel deepfake detection technique (frequency-domain masking) that authenticates image provenance and identifies AI-generated content from unseen generative models — directly targeting output integrity and content authenticity.

Details

Domains

visiongenerative

Model Types

transformergandiffusion

Threat Tags

inference_timedigital

Datasets

ForenSynthsGenImage

Applications

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

TrueMoE: Dual-Routing Mixture of Discriminative Experts for Synthetic Image Detection

Rethinking Cross-Generator Image Forgery Detection through DINOv3

SimLBR: Learning to Detect Fake Images by Learning to Detect Real Images

Supervised Contrastive Learning for Few-Shot AI-Generated Image Detection and Attribution

Enhancing Robustness in Post-Processing Watermarking: An Ensemble Attack Network Using CNNs and Transformers

Detecting AI-Generated Forgeries via Iterative Manifold Deviation Amplification

Detecting AI-Generated Images via Distributional Deviations from Real Images

Patch-Discontinuity Mining for Generalized Deepfake Detection