Detecting AI-Generated Forgeries via Iterative Manifold Deviation Amplification

The proliferation of highly realistic AI-generated images poses critical challenges for digital forensics, demanding precise pixel-level localization of manipulated regions. Existing methods predominantly learn discriminative patterns of specific forgeries and often struggle with novel manipulations as editing techniques continue to evolve. We propose the Iterative Forgery Amplifier Network (IFA-Net), which shifts from learning "what is fake" to modeling "what is real". Grounded in the principle that all manipulations deviate from the natural image manifold, IFA-Net leverages a frozen Masked Autoencoder (MAE) pretrained on real images as a universal realness prior. Our framework operates through a two-stage closed-loop process: an initial Dual-Stream Segmentation Network (DSSN) fuses the original image with MAE reconstruction residuals for coarse localization, followed by a Task-Adaptive Prior Injection (TAPI) module that converts this coarse prediction into guiding prompts to steer the MAE decoder and amplify reconstruction failures in suspicious regions for precise refinement. Extensive experiments on four diffusion-based inpainting benchmarks show that IFA-Net achieves an average improvement of 6.5% in IoU and 8.1% in F1-score over the second-best method, while demonstrating strong generalization to traditional manipulation types.

Key Contributions

IFA-Net framework that shifts from learning forgery-specific patterns to modeling a universal 'realness prior' via a frozen MAE pretrained on real images
Two-stage closed-loop pipeline: coarse localization via Dual-Stream Segmentation Network (DSSN) fusing image and MAE residuals, followed by Task-Adaptive Prior Injection (TAPI) for iterative refinement
Achieves 6.5% IoU and 8.1% F1-score improvement over the second-best method on four diffusion-based inpainting benchmarks with generalization to traditional manipulation types

🛡️ Threat Analysis

Output Integrity Attack

IFA-Net is an AI-generated content detection system specifically targeting pixel-level localization of manipulated/synthesized image regions — classic output integrity and content authenticity work. Detecting diffusion-based inpainting forgeries falls squarely in the AI-generated content detection subcategory of ML09.

Details

Domains

visiongenerative

Model Types

transformerdiffusion

Threat Tags

inference_timedigital

Datasets

diffusion-based inpainting benchmarks (4 benchmarks, names not specified in abstract/body excerpt)

Applications

2025 0 cit.

Output Integrity Attack

92%