defense 2025

Self-Supervised Learning for Detecting AI-Generated Faces as Anomalies

Mian Zou 1, Baosheng Yu 2, Yibing Zhan 3, Kede Ma 1

3 citations · 52 references · ICASSP

α

Published on arXiv

2501.02207

Output Integrity Attack

OWASP ML Top 10 — ML09

Key Finding

The EXIF-based anomaly detection framework generalizes to nine state-of-the-art generative models (GANs and diffusion models) without requiring synthetic training data from those generators.

AIGFD-EXIF

Novel technique introduced


The detection of AI-generated faces is commonly approached as a binary classification task. Nevertheless, the resulting detectors frequently struggle to adapt to novel AI face generators, which evolve rapidly. In this paper, we describe an anomaly detection method for AI-generated faces by leveraging self-supervised learning of camera-intrinsic and face-specific features purely from photographic face images. The success of our method lies in designing a pretext task that trains a feature extractor to rank four ordinal exchangeable image file format (EXIF) tags and classify artificially manipulated face images. Subsequently, we model the learned feature distribution of photographic face images using a Gaussian mixture model. Faces with low likelihoods are flagged as AI-generated. Both quantitative and qualitative experiments validate the effectiveness of our method. Our code is available at \url{https://github.com/MZMMSEC/AIGFD_EXIF.git}.


Key Contributions

  • Self-supervised pretext task combining ordinal EXIF tag ranking (aperture, exposure time, focal length, ISO speed) and face manipulation classification to learn camera-intrinsic and face-specific features from real photos only
  • Gaussian mixture model built on the learned feature distribution of photographic faces to flag AI-generated faces as low-likelihood anomalies
  • One-class anomaly detection framing that generalizes to novel generators without retraining on synthetic images

🛡️ Threat Analysis

Output Integrity Attack

Proposes an AI-generated content detection method targeting synthetic face images produced by GANs and diffusion models — deepfake detection is explicitly listed under ML09 output integrity and content provenance.


Details

Domains
visiongenerative
Model Types
cnngandiffusion
Threat Tags
inference_time
Datasets
StyleGAN2VQGANLDMDDPMMidjourneySDXL
Applications
ai-generated face detectiondeepfake detection