Adaptive Forensic Feature Refinement via Intrinsic Importance Perception

With the rapid development of generative models and multimodal content editing technologies, the key challenge faced by synthetic image detection (SID) lies in cross-distribution generalization to unknown generation sources. In recent years, visual foundation models (VFM), which acquire rich visual priors through large scale image-text alignment pretraining, have become a promising technical route for improving the generalization ability of SID. However, existing VFM-based methods remain relatively coarse-grained in their adaptation strategies. They typically either directly use the final layer representations of VFM or simply fuse multi layer features, lacking explicit modeling of the optimal representational hierarchy for transferable forgery cues. Meanwhile, although directly fine-tuning VFM can enhance task adaptation, it may also damage the cross-modal pretrained structure that supports open-set generalization. To address this task specific tension, we reformulate VFM adaptation for SID as a joint optimization problem: it is necessary both to identify the critical representational layer that is more suitable for carrying forgery discriminative information and to constrain the disturbance caused by task knowledge injection to the pretrained structure. Based on this, we propose I2P, an SID framework centered on intrinsic importance perception. I2P first adaptively identifies the critical layer representations that are most discriminative for SID, and then constrains task-driven parameter updates within a low sensitivity parameter subspace, thereby improving task specificity while preserving the transferable structure of pretrained representations as much as possible.

Key Contributions

Adaptive layer selection mechanism that identifies critical VFM representational hierarchies most discriminative for synthetic image detection
Low-sensitivity parameter subspace optimization that injects task knowledge while preserving pretrained structure for cross-domain generalization
I2P framework achieving competitive generalization on cross-distribution synthetic image detection benchmarks

🛡️ Threat Analysis

Output Integrity Attack

Paper focuses on detecting synthetic/AI-generated images to verify content authenticity and distinguish real from fake content. This is output integrity - specifically AI-generated content detection, which is a core ML09 use case.

Details

Domains

visionmultimodal

Model Types

transformerdiffusiongan

Threat Tags

inference_time

Datasets

GenImage

Applications

2025 2 cit.

Output Integrity Attack

83%