defense 2025

FerretNet: Efficient Synthetic Image Detection via Local Pixel Dependencies

Shuqiao Liang , Jian Liu , Renzhang Chen , Quanlong Guan

4 citations · 1 influential · 55 references · arXiv

α

Published on arXiv

2509.20890

Output Integrity Attack

OWASP ML Top 10 — ML09

Key Finding

FerretNet trained solely on a 4-class ProGAN dataset achieves 97.1% average accuracy across 22 generative models on an open-world benchmark, outperforming larger models while using only 1.1M parameters.

FerretNet (Local Pixel Dependencies)

Novel technique introduced


The increasing realism of synthetic images generated by advanced models such as VAEs, GANs, and LDMs poses significant challenges for synthetic image detection. To address this issue, we explore two artifact types introduced during the generation process: (1) latent distribution deviations and (2) decoding-induced smoothing effects, which manifest as inconsistencies in local textures, edges, and color transitions. Leveraging local pixel dependencies (LPD) properties rooted in Markov Random Fields, we reconstruct synthetic images using neighboring pixel information to expose disruptions in texture continuity and edge coherence. Building upon LPD, we propose FerretNet, a lightweight neural network with only 1.1M parameters that delivers efficient and robust synthetic image detection. Extensive experiments demonstrate that FerretNet, trained exclusively on the 4-class ProGAN dataset, achieves an average accuracy of 97.1% on an open-world benchmark comprising 22 generative models. Our code and datasets are publicly available at https://github.com/xigua7105/FerretNet.


Key Contributions

  • Novel Local Pixel Dependencies (LPD) representation using Markov Random Fields and median-based reconstruction to expose generation artifacts (latent distribution deviations and decoding-induced smoothing) across GAN, VAE, and LDM architectures
  • FerretNet: a lightweight 1.1M-parameter detector using depthwise separable and dilated convolutions that achieves 97.1% accuracy on 22 generative models when trained only on a 4-class ProGAN dataset
  • Synthetic-Pop: a new 60K-image open-world benchmark covering six high-fidelity generators for evaluating synthetic image detectors

🛡️ Threat Analysis

Output Integrity Attack

Core contribution is a novel AI-generated image detection method (FerretNet) that identifies synthetic images from GANs, VAEs, and LDMs by exposing artifacts — directly in scope for ML09's 'AI-generated content detection / deepfake detection' category. The paper also introduces a new benchmark (Synthetic-Pop) for evaluating detectors.


Details

Domains
visiongenerative
Model Types
cnngandiffusion
Threat Tags
inference_time
Datasets
ProGANSynthetic-PopCOCOLAION-Aesthetics V2
Applications
synthetic image detectiondeepfake detectionai-generated content forensics