defense 2026

PromptForge-350k: A Large-Scale Dataset and Contrastive Framework for Prompt-Based AI Image Forgery Localization

Jianpeng Wang , Haoyu Wang , Baoying Chen , Jishen Zeng , Yiming Qin , Yiqi Yang , Zhongjie Ba

0 citations

α

Published on arXiv

2603.29386

Output Integrity Attack

OWASP ML Top 10 — ML09

Key Finding

Achieves 62.5% IoU on PromptForge-350k dataset, outperforming SOTA methods by 5.1%, with less than 1% IoU drop under common degradations

ICL-Net

Novel technique introduced


The rapid democratization of prompt-based AI image editing has recently exacerbated the risks associated with malicious content fabrication and misinformation. However, forgery localization methods targeting these emerging editing techniques remain significantly under-explored. To bridge this gap, we first introduce a fully automated mask annotating framework that leverages keypoint alignment and semantic space similarity to generate precise ground-truth masks for edited regions. Based on this framework, we construct PromptForge-350k, a large-scale forgery localization dataset covering four state-of-the-art prompt-based AI image editing models, thereby mitigating the data scarcity in this domain. Furthermore, we propose ICL-Net, an effective forgery localization network featuring a triple-stream backbone and intra-image contrastive learning. This design enables the model to capture highly robust and generalizable forensic features. Extensive experiments demonstrate that our method achieves an IoU of 62.5% on PromptForge-350k, outperforming SOTA methods by 5.1%. Additionally, it exhibits strong robustness against common degradations with an IoU drop of less than 1%, and shows promising generalization capabilities on unseen editing models, achieving an average IoU of 41.5%.


Key Contributions

  • PromptForge-350k: Large-scale dataset with 350k images covering four SOTA prompt-based AI editing models with automated mask annotation
  • ICL-Net: Triple-stream forgery localization network with intra-image contrastive learning for robust forensic feature extraction
  • Achieves 62.5% IoU outperforming SOTA by 5.1% with strong robustness to degradations and generalization to unseen editing models

🛡️ Threat Analysis

Output Integrity Attack

Paper focuses on detecting and localizing AI-generated image forgeries/manipulations - this is output integrity and content authenticity, specifically detecting regions of images that have been edited by AI image editing models.


Details

Domains
visionmultimodal
Model Types
diffusioncnn
Threat Tags
inference_timedigital
Datasets
PromptForge-350k
Applications
image forgery detectionmedia forensicscontent authenticity verification