defense 2025

Beyond Semantic Features: Pixel-level Mapping for Generalized AI-Generated Image Detection

Chenming Zhou 1,2, Jiaan Wang 1,2, Yu Li 1,2, Lei Li 1,2, Juan Cao 1,2, Sheng Tang 1,2

0 citations · 63 references · arXiv

α

Published on arXiv

2512.17350

Output Integrity Attack

OWASP ML Top 10 — ML09

Key Finding

Pixel-level mapping pre-processing significantly boosts cross-generator detection performance of state-of-the-art detectors on both GAN and diffusion-generated images by reducing reliance on source-specific semantic cues.

Pixel-level mapping

Novel technique introduced


The rapid evolution of generative technologies necessitates reliable methods for detecting AI-generated images. A critical limitation of current detectors is their failure to generalize to images from unseen generative models, as they often overfit to source-specific semantic cues rather than learning universal generative artifacts. To overcome this, we introduce a simple yet remarkably effective pixel-level mapping pre-processing step to disrupt the pixel value distribution of images and break the fragile, non-essential semantic patterns that detectors commonly exploit as shortcuts. This forces the detector to focus on more fundamental and generalizable high-frequency traces inherent to the image generation process. Through comprehensive experiments on GAN and diffusion-based generators, we show that our approach significantly boosts the cross-generator performance of state-of-the-art detectors. Extensive analysis further verifies our hypothesis that the disruption of semantic cues is the key to generalization.


Key Contributions

  • Empirically demonstrates that existing semantic-reduction methods (high-pass filtering, patch shuffling) fail to fully eliminate semantic bias, limiting cross-generator generalization.
  • Proposes a pixel-level mapping pre-processing step that suppresses low-frequency semantic patterns while amplifying high-frequency generative artifacts inherent to the image generation process.
  • Validates consistent cross-generator performance gains across multiple benchmarks covering both GAN and diffusion-based generators.

🛡️ Threat Analysis

Output Integrity Attack

Directly proposes a novel technique to improve AI-generated image detection across unseen generative models, addressing output integrity and content authenticity — the core of ML09.


Details

Domains
vision
Model Types
cnngandiffusion
Threat Tags
inference_time
Datasets
ImageNet
Applications
ai-generated image detectionsynthetic image forensics