SAIDO: Generalizable Detection of AI-Generated Images via Scene-Aware and Importance-Guided Dynamic Optimization in Continual Learning

The widespread misuse of image generation technologies has raised security concerns, driving the development of AI-generated image detection methods. However, generalization has become a key challenge and open problem: existing approaches struggle to adapt to emerging generative methods and content types in real-world scenarios. To address this issue, we propose a Scene-Aware and Importance-Guided Dynamic Optimization detection framework with continual learning (SAIDO). Specifically, we design Scene-Awareness-Based Expert Module (SAEM) that dynamically identifies and incorporates new scenes using VLLMs. For each scene, independent expert modules are dynamically allocated, enabling the framework to capture scene-specific forgery features better and enhance cross-scene generalization. To mitigate catastrophic forgetting when learning from multiple image generative methods, we introduce Importance-Guided Dynamic Optimization Mechanism (IDOM), which optimizes each neuron through an importance-guided gradient projection strategy, thereby achieving an effective balance between model plasticity and stability. Extensive experiments on continual learning tasks demonstrate that our method outperforms the current SOTA method in both stability and plasticity, achieving 44.22\% and 40.57\% relative reductions in average detection error rate and forgetting rate, respectively. On open-world datasets, it improves the average detection accuracy by 9.47\% compared to the current SOTA method.

Key Contributions

Scene-Awareness-Based Expert Module (SAEM) that uses VLLMs to dynamically identify new content scenes and allocate independent expert modules per scene for better cross-scene generalization
Importance-Guided Dynamic Optimization Mechanism (IDOM) that applies gradient projection per-neuron to balance plasticity and stability, mitigating catastrophic forgetting across sequential generative methods
Continual learning evaluation showing 44.22% and 40.57% relative reductions in detection error rate and forgetting rate over SOTA, plus 9.47% accuracy gain on open-world datasets

🛡️ Threat Analysis

Output Integrity Attack

Primary contribution is a novel architecture for AI-generated image detection — detecting synthetic/AI-produced content is a canonical ML09 (Output Integrity) task. The paper proposes new forensic detection methodology (SAEM + IDOM) rather than merely applying existing detectors to a new domain.

Details

Domains

vision

Model Types

vlmtransformer

Threat Tags

inference_time

Applications

2025 0 cit.

Output Integrity Attack

89%

SAIDO: Generalizable Detection of AI-Generated Images via Scene-Aware and Importance-Guided Dynamic Optimization in Continual Learning

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Simplicity Prevails: The Emergence of Generalizable AIGI Detection in Visual Foundation Models

Semantic Discrepancy-aware Detector for Image Forgery Identification

Deepfake Forensics Adapter: A Dual-Stream Network for Generalizable Deepfake Detection

OmniFD: A Unified Model for Versatile Face Forgery Detection

DeiTFake: Deepfake Detection Model using DeiT Multi-Stage Training

Deepfake Detection that Generalizes Across Benchmarks

Vulnerability-Aware Spatio-Temporal Learning for Generalizable Deepfake Video Detection

DevFD: Developmental Face Forgery Detection by Learning Shared and Orthogonal LoRA Subspaces