RA-Det: Towards Universal Detection of AI-Generated Images via Robustness Asymmetry

Recent image generators produce photo-realistic content that undermines the reliability of downstream recognition systems. As visual appearance cues become less pronounced, appearance-driven detectors that rely on forensic cues or high-level representations lose stability. This motivates a shift from appearance to behavior, focusing on how images respond to controlled perturbations rather than how they look. In this work, we identify a simple and universal behavioral signal. Natural images preserve stable semantic representations under small, structured perturbations, whereas generated images exhibit markedly larger feature drift. We refer to this phenomenon as robustness asymmetry and provide a theoretical analysis that establishes a lower bound connecting this asymmetry to memorization tendencies in generative models, explaining its prevalence across architectures. Building on this insight, we introduce Robustness Asymmetry Detection (RA-Det), a behavior-driven detection framework that converts robustness asymmetry into a reliable decision signal. Evaluated across 14 diverse generative models and against more than 10 strong detectors, RA-Det achieves superior performance, improving the average performance by 7.81 percent. The method is data- and model-agnostic, requires no generator fingerprints, and transfers across unseen generators. Together, these results indicate that robustness asymmetry is a stable, general cue for synthetic-image detection and that carefully designed probing can turn this cue into a practical, universal detector. The source code is publicly available at Github.

Key Contributions

Identifies 'robustness asymmetry' as a universal behavioral signal: natural images maintain stable semantic embeddings under small structured perturbations while AI-generated images exhibit markedly larger feature drift
Provides theoretical analysis establishing a lower bound connecting robustness asymmetry to memorization tendencies in generative models, explaining its generality across architectures
Introduces RA-Det, a data- and model-agnostic detection framework requiring no generator fingerprints that achieves +7.81% average improvement over 10+ detectors across 14 generative models

🛡️ Threat Analysis

Output Integrity Attack

Proposes a novel detection framework for AI-generated (synthetic) images — a direct contribution to output integrity and content authenticity. The paper introduces a new behavioral forensic signal (robustness asymmetry) and a detector that outperforms 10+ existing methods across 14 generative models.

Details

Domains

vision

Model Types

transformerdiffusiongan

Threat Tags

inference_timedigital

Applications

2025 0 cit.

Output Integrity Attack

85%

RA-Det: Towards Universal Detection of AI-Generated Images via Robustness Asymmetry

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

When Semantics Regulate: Rethinking Patch Shuffle and Internal Bias for Generated Image Detection with CLIP

A Hybrid Deep Learning and Forensic Approach for Robust Deepfake Detection

Toward Medical Deepfake Detection: A Comprehensive Dataset and Novel Method

DNA: Uncovering Universal Latent Forgery Knowledge

CausalCLIP: Causally-Informed Feature Disentanglement and Filtering for Generalizable Detection of Generated Images

TrueMoE: Dual-Routing Mixture of Discriminative Experts for Synthetic Image Detection

Towards Sustainable Universal Deepfake Detection with Frequency-Domain Masking

Leveraging Hierarchical Image-Text Misalignment for Universal Fake Image Detection