Frequency Bias Matters: Diving into Robust and Generalized Deep Image Forgery Detection

As deep image forgery powered by AI generative models, such as GANs, continues to challenge today's digital world, detecting AI-generated forgeries has become a vital security topic. Generalizability and robustness are two critical concerns of a forgery detector, determining its reliability when facing unknown GANs and noisy samples in an open world. Although many studies focus on improving these two properties, the root causes of these problems have not been fully explored, and it is unclear if there is a connection between them. Moreover, despite recent achievements in addressing these issues from image forensic or anti-forensic aspects, a universal method that can contribute to both sides simultaneously remains practically significant yet unavailable. In this paper, we provide a fundamental explanation of these problems from a frequency perspective. Our analysis reveals that the frequency bias of a DNN forgery detector is a possible cause of generalization and robustness issues. Based on this finding, we propose a two-step frequency alignment method to remove the frequency discrepancy between real and fake images, offering double-sided benefits: it can serve as a strong black-box attack against forgery detectors in the anti-forensic context or, conversely, as a universal defense to improve detector reliability in the forensic context. We also develop corresponding attack and defense implementations and demonstrate their effectiveness, as well as the effect of the frequency alignment method, in various experimental settings involving twelve detectors, eight forgery models, and five metrics.

Key Contributions

Identifies DNN frequency bias — over-reliance on high-frequency spectral artifacts — as the shared root cause of both generalization failure (unknown GANs) and robustness failure (noisy/adversarial samples) in image forgery detectors
Proposes a two-step frequency alignment method that removes frequency discrepancy between real and fake images, functioning as a dual-use tool: a strong black-box anti-forensic attack or a universal forensic defense
Demonstrates effectiveness across 12 detectors, 8 forgery models, and 5 metrics, covering both attack and defense scenarios

🛡️ Threat Analysis

Input Manipulation Attack

The anti-forensic component is a black-box adversarial evasion attack: it crafts frequency-aligned fake images that cause forgery detector classifiers to misclassify fake as real at inference time, satisfying the adversarial input manipulation definition.

Output Integrity Attack

Core contribution is within AI-generated content detection — analyzes root causes of failure in GAN forgery detectors and proposes defenses to improve their generalizability and robustness in the forensic direction.

Details

Domains

vision

Model Types

gancnn

Threat Tags

black_boxinference_timedigital

Applications

2026 0 cit.

Input Manipulation Attack

67%

Frequency Bias Matters: Diving into Robust and Generalized Deep Image Forgery Detection

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

ForensicsSAM: Toward Robust and Unified Image Forgery Detection and Localization Resisting to Adversarial Attack

AegisRF: Adversarial Perturbations Guided with Sensitivity for Protecting Intellectual Property of Neural Radiance Fields

Defending Deepfake via Texture Feature Perturbation

Wavelet-based GAN Fingerprint Detection using ResNet50

FBA$^2$D: Frequency-based Black-box Attack for AI-generated Image Detection

Beyond Vulnerabilities: A Survey of Adversarial Attacks as Both Threats and Defenses in Computer Vision Systems

TriQDef: Disrupting Semantic and Gradient Alignment to Prevent Adversarial Patch Transferability in Quantized Neural Networks

Attack Assessment and Augmented Identity Recognition for Human Skeleton Data