Beyond Semantic Features: Pixel-level Mapping for Generalized AI-Generated Image Detection
Chenming Zhou 1,2, Jiaan Wang 1,2, Yu Li 1,2, Lei Li 1,2, Juan Cao 1,2, Sheng Tang 1,2
Published on arXiv
2512.17350
Output Integrity Attack
OWASP ML Top 10 — ML09
Key Finding
Pixel-level mapping pre-processing significantly boosts cross-generator detection performance of state-of-the-art detectors on both GAN and diffusion-generated images by reducing reliance on source-specific semantic cues.
Pixel-level mapping
Novel technique introduced
The rapid evolution of generative technologies necessitates reliable methods for detecting AI-generated images. A critical limitation of current detectors is their failure to generalize to images from unseen generative models, as they often overfit to source-specific semantic cues rather than learning universal generative artifacts. To overcome this, we introduce a simple yet remarkably effective pixel-level mapping pre-processing step to disrupt the pixel value distribution of images and break the fragile, non-essential semantic patterns that detectors commonly exploit as shortcuts. This forces the detector to focus on more fundamental and generalizable high-frequency traces inherent to the image generation process. Through comprehensive experiments on GAN and diffusion-based generators, we show that our approach significantly boosts the cross-generator performance of state-of-the-art detectors. Extensive analysis further verifies our hypothesis that the disruption of semantic cues is the key to generalization.
Key Contributions
- Empirically demonstrates that existing semantic-reduction methods (high-pass filtering, patch shuffling) fail to fully eliminate semantic bias, limiting cross-generator generalization.
- Proposes a pixel-level mapping pre-processing step that suppresses low-frequency semantic patterns while amplifying high-frequency generative artifacts inherent to the image generation process.
- Validates consistent cross-generator performance gains across multiple benchmarks covering both GAN and diffusion-based generators.
🛡️ Threat Analysis
Directly proposes a novel technique to improve AI-generated image detection across unseen generative models, addressing output integrity and content authenticity — the core of ML09.