Detecting AI-Generated Forgeries via Iterative Manifold Deviation Amplification
Jiangling Zhang , Shuxuan Gao , Bofan Liu , Siqiang Feng , Jirui Huang , Yaxiong Chen , Ziyu Chen
Published on arXiv
2602.18842
Output Integrity Attack
OWASP ML Top 10 — ML09
Key Finding
IFA-Net achieves an average 6.5% IoU and 8.1% F1-score improvement over the second-best method across four diffusion-based inpainting benchmarks while generalizing to traditional manipulation types.
IFA-Net (Iterative Forgery Amplifier Network)
Novel technique introduced
The proliferation of highly realistic AI-generated images poses critical challenges for digital forensics, demanding precise pixel-level localization of manipulated regions. Existing methods predominantly learn discriminative patterns of specific forgeries and often struggle with novel manipulations as editing techniques continue to evolve. We propose the Iterative Forgery Amplifier Network (IFA-Net), which shifts from learning "what is fake" to modeling "what is real". Grounded in the principle that all manipulations deviate from the natural image manifold, IFA-Net leverages a frozen Masked Autoencoder (MAE) pretrained on real images as a universal realness prior. Our framework operates through a two-stage closed-loop process: an initial Dual-Stream Segmentation Network (DSSN) fuses the original image with MAE reconstruction residuals for coarse localization, followed by a Task-Adaptive Prior Injection (TAPI) module that converts this coarse prediction into guiding prompts to steer the MAE decoder and amplify reconstruction failures in suspicious regions for precise refinement. Extensive experiments on four diffusion-based inpainting benchmarks show that IFA-Net achieves an average improvement of 6.5% in IoU and 8.1% in F1-score over the second-best method, while demonstrating strong generalization to traditional manipulation types.
Key Contributions
- IFA-Net framework that shifts from learning forgery-specific patterns to modeling a universal 'realness prior' via a frozen MAE pretrained on real images
- Two-stage closed-loop pipeline: coarse localization via Dual-Stream Segmentation Network (DSSN) fusing image and MAE residuals, followed by Task-Adaptive Prior Injection (TAPI) for iterative refinement
- Achieves 6.5% IoU and 8.1% F1-score improvement over the second-best method on four diffusion-based inpainting benchmarks with generalization to traditional manipulation types
🛡️ Threat Analysis
IFA-Net is an AI-generated content detection system specifically targeting pixel-level localization of manipulated/synthesized image regions — classic output integrity and content authenticity work. Detecting diffusion-based inpainting forgeries falls squarely in the AI-generated content detection subcategory of ML09.