defense 2026

Adaptive Forensic Feature Refinement via Intrinsic Importance Perception

Jiazhen Yang 1, Junjun Zheng 1, Kejia Chen 1, Xiangheng Kong 2, Jie Lei 3, Zunlei Feng 1, Bingde Hu 1, Yang Gao 1

0 citations

α

Published on arXiv

2604.16879

Output Integrity Attack

OWASP ML Top 10 — ML09

Key Finding

Intermediate CLIP layers naturally exhibit clearer separable structure between real and synthetic images compared to early or deep layers

I2P

Novel technique introduced


With the rapid development of generative models and multimodal content editing technologies, the key challenge faced by synthetic image detection (SID) lies in cross-distribution generalization to unknown generation sources. In recent years, visual foundation models (VFM), which acquire rich visual priors through large scale image-text alignment pretraining, have become a promising technical route for improving the generalization ability of SID. However, existing VFM-based methods remain relatively coarse-grained in their adaptation strategies. They typically either directly use the final layer representations of VFM or simply fuse multi layer features, lacking explicit modeling of the optimal representational hierarchy for transferable forgery cues. Meanwhile, although directly fine-tuning VFM can enhance task adaptation, it may also damage the cross-modal pretrained structure that supports open-set generalization. To address this task specific tension, we reformulate VFM adaptation for SID as a joint optimization problem: it is necessary both to identify the critical representational layer that is more suitable for carrying forgery discriminative information and to constrain the disturbance caused by task knowledge injection to the pretrained structure. Based on this, we propose I2P, an SID framework centered on intrinsic importance perception. I2P first adaptively identifies the critical layer representations that are most discriminative for SID, and then constrains task-driven parameter updates within a low sensitivity parameter subspace, thereby improving task specificity while preserving the transferable structure of pretrained representations as much as possible.


Key Contributions

  • Adaptive layer selection mechanism that identifies critical VFM representational hierarchies most discriminative for synthetic image detection
  • Low-sensitivity parameter subspace optimization that injects task knowledge while preserving pretrained structure for cross-domain generalization
  • I2P framework achieving competitive generalization on cross-distribution synthetic image detection benchmarks

🛡️ Threat Analysis

Output Integrity Attack

Paper focuses on detecting synthetic/AI-generated images to verify content authenticity and distinguish real from fake content. This is output integrity - specifically AI-generated content detection, which is a core ML09 use case.


Details

Domains
visionmultimodal
Model Types
transformerdiffusiongan
Threat Tags
inference_time
Datasets
GenImage
Applications
synthetic image detectiondeepfake detectionai-generated content detection