FerretNet: Efficient Synthetic Image Detection via Local Pixel Dependencies
Shuqiao Liang , Jian Liu , Renzhang Chen , Quanlong Guan
Published on arXiv
2509.20890
Output Integrity Attack
OWASP ML Top 10 — ML09
Key Finding
FerretNet trained solely on a 4-class ProGAN dataset achieves 97.1% average accuracy across 22 generative models on an open-world benchmark, outperforming larger models while using only 1.1M parameters.
FerretNet (Local Pixel Dependencies)
Novel technique introduced
The increasing realism of synthetic images generated by advanced models such as VAEs, GANs, and LDMs poses significant challenges for synthetic image detection. To address this issue, we explore two artifact types introduced during the generation process: (1) latent distribution deviations and (2) decoding-induced smoothing effects, which manifest as inconsistencies in local textures, edges, and color transitions. Leveraging local pixel dependencies (LPD) properties rooted in Markov Random Fields, we reconstruct synthetic images using neighboring pixel information to expose disruptions in texture continuity and edge coherence. Building upon LPD, we propose FerretNet, a lightweight neural network with only 1.1M parameters that delivers efficient and robust synthetic image detection. Extensive experiments demonstrate that FerretNet, trained exclusively on the 4-class ProGAN dataset, achieves an average accuracy of 97.1% on an open-world benchmark comprising 22 generative models. Our code and datasets are publicly available at https://github.com/xigua7105/FerretNet.
Key Contributions
- Novel Local Pixel Dependencies (LPD) representation using Markov Random Fields and median-based reconstruction to expose generation artifacts (latent distribution deviations and decoding-induced smoothing) across GAN, VAE, and LDM architectures
- FerretNet: a lightweight 1.1M-parameter detector using depthwise separable and dilated convolutions that achieves 97.1% accuracy on 22 generative models when trained only on a 4-class ProGAN dataset
- Synthetic-Pop: a new 60K-image open-world benchmark covering six high-fidelity generators for evaluating synthetic image detectors
🛡️ Threat Analysis
Core contribution is a novel AI-generated image detection method (FerretNet) that identifies synthetic images from GANs, VAEs, and LDMs by exposing artifacts — directly in scope for ML09's 'AI-generated content detection / deepfake detection' category. The paper also introduces a new benchmark (Synthetic-Pop) for evaluating detectors.