defense 2025

Towards Sustainable Universal Deepfake Detection with Frequency-Domain Masking

Chandler Timm C. Doloriel 1,2, Habib Ullah 2, Kristian Hovde Liland 2, Fadi Al Machot 2, Ngai-Man Cheung 1

0 citations · 49 references · arXiv

α

Published on arXiv

2512.08042

Output Integrity Attack

OWASP ML Top 10 — ML09

Key Finding

Frequency-domain masking achieves state-of-the-art generalization on GAN- and diffusion-generated image datasets while retaining detection accuracy under structured pruning

Frequency-Domain Masking

Novel technique introduced


Universal deepfake detection aims to identify AI-generated images across a broad range of generative models, including unseen ones. This requires robust generalization to new and unseen deepfakes, which emerge frequently, while minimizing computational overhead to enable large-scale deepfake screening, a critical objective in the era of Green AI. In this work, we explore frequency-domain masking as a training strategy for deepfake detectors. Unlike traditional methods that rely heavily on spatial features or large-scale pretrained models, our approach introduces random masking and geometric transformations, with a focus on frequency masking due to its superior generalization properties. We demonstrate that frequency masking not only enhances detection accuracy across diverse generators but also maintains performance under significant model pruning, offering a scalable and resource-conscious solution. Our method achieves state-of-the-art generalization on GAN- and diffusion-generated image datasets and exhibits consistent robustness under structured pruning. These results highlight the potential of frequency-based masking as a practical step toward sustainable and generalizable deepfake detection. Code and models are available at https://github.com/chandlerbing65nm/FakeImageDetection.


Key Contributions

  • Frequency-domain masking as a training strategy that improves deepfake detector generalization to unseen GAN and diffusion model outputs
  • Demonstration that frequency masking maintains detection performance under significant structured model pruning, enabling resource-efficient large-scale screening
  • State-of-the-art universal deepfake detection across both GAN- and diffusion-generated image benchmarks

🛡️ Threat Analysis

Output Integrity Attack

Proposes a novel deepfake detection technique (frequency-domain masking) that authenticates image provenance and identifies AI-generated content from unseen generative models — directly targeting output integrity and content authenticity.


Details

Domains
visiongenerative
Model Types
transformergandiffusion
Threat Tags
inference_timedigital
Datasets
ForenSynthsGenImage
Applications
deepfake detectionai-generated image detectionimage authenticity verification