defense 2026

Efficient Zero-Shot AI-Generated Image Detection

Ryosuke Sonoda 1, Ramya Srinivasan 2

0 citations

α

Published on arXiv

2603.21619

Output Integrity Attack

OWASP ML Top 10 — ML09

Key Finding

Improves AUC by nearly 10% over state-of-the-art on OpenFake benchmark while achieving 1-2 orders of magnitude faster inference than most training-free detectors


The rapid progress of text-to-image models has made AI-generated images increasingly realistic, posing significant challenges for accurate detection of generated content. While training-based detectors often suffer from limited generalization to unseen images, training-free approaches offer better robustness, yet struggle to capture subtle discrepancies between real and synthetic images. In this work, we propose a training-free AI-generated image detection method that measures representation sensitivity to structured frequency perturbations, enabling detection of minute manipulations. The proposed method is computationally lightweight, as perturbation generation requires only a single Fourier transform for an input image. As a result, it achieves one to two orders of magnitude faster inference than most training-free detectors.Extensive experiments on challenging benchmarks demonstrate the efficacy of our method over state-of-the-art (SoTA). In particular, on OpenFake benchmark, our method improves AUC by nearly $10\%$ compared to SoTA, while maintaining substantially lower computational cost.


Key Contributions

  • Training-free detection method using structured frequency perturbations to measure representation sensitivity
  • Achieves 1-2 orders of magnitude faster inference than most training-free detectors via single Fourier transform
  • 10% AUC improvement over state-of-the-art on OpenFake benchmark with substantially lower computational cost

🛡️ Threat Analysis

Output Integrity Attack

Detecting AI-generated images (distinguishing synthetic from real content) is an output integrity and content authenticity problem — the paper proposes a method to verify whether images are AI-generated, which falls under ML09's scope of AI-generated content detection.


Details

Domains
visiongenerative
Model Types
diffusiongan
Threat Tags
inference_timedigital
Datasets
OpenFake
Applications
ai-generated image detectiondeepfake detectionsynthetic content detection