FIND: A Simple yet Effective Baseline for Diffusion-Generated Image Detection
Jie Li , Yingying Feng , Chi Xie , Jie Hu , Lei Tan , Jiayi Ji
Published on arXiv
2603.14220
Output Integrity Attack
OWASP ML Top 10 — ML09
Key Finding
Achieves 11.7% performance improvement on GenImage benchmark while running 126x faster than reconstruction-based methods
FIND
Novel technique introduced
The remarkable realism of images generated by diffusion models poses critical detection challenges. Current methods utilize reconstruction error as a discriminative feature, exploiting the observation that real images exhibit higher reconstruction errors when processed through diffusion models. However, these approaches require costly reconstruction computations and depend on specific diffusion models, making their performance highly model-dependent. We identify a fundamental difference: real images are more difficult to fit with Gaussian distributions compared to synthetic ones. In this paper, we propose Forgery Identification via Noise Disturbance (FIND), a novel method that requires only a simple binary classifier. It eliminates reconstruction by directly targeting the core distributional difference between real and synthetic images. Our key operation is to add Gaussian noise to real images during training and label these noisy versions as synthetic. This step allows the classifier to focus on the statistical patterns that distinguish real from synthetic images. We theoretically prove that the noise-augmented real images resemble diffusion-generated images in their ease of Gaussian fitting. Furthermore, simply by adding noise, they still retain visual similarity to the original images, highlighting the most discriminative distribution-related features. The proposed FIND improves performance by 11.7% on the GenImage benchmark while running 126x faster than existing methods. By removing the need for auxiliary diffusion models and reconstruction, it offers a practical, efficient, and generalizable way to detect diffusion-generated content.
Key Contributions
- Novel detection method based on distributional differences between real and synthetic images without requiring reconstruction
- Noise augmentation training strategy that makes real images resemble synthetic ones in Gaussian fitting properties
- 126x faster than existing methods with 11.7% performance improvement on GenImage benchmark
🛡️ Threat Analysis
Core contribution is detecting AI-generated images (diffusion model outputs) to verify content authenticity and distinguish synthetic from real images — this is output integrity and AI-generated content detection.