defense 2025

DiffusionFF: A Diffusion-based Framework for Joint Face Forgery Detection and Fine-Grained Artifact Localization

0 citations

Published on arXiv

2508.01873

Output Integrity Attack

OWASP ML Top 10 — ML09

Key Finding

DiffusionFF achieves state-of-the-art performance across multiple deepfake detection benchmarks by conditioning a diffusion decoder on multi-scale forgery features to produce detailed artifact localization maps.

DiffusionFF

Novel technique introduced

The rapid evolution of deepfake technologies demands robust and reliable face forgery detection algorithms. While determining whether an image has been manipulated remains essential, the ability to precisely localize forgery clues is also important for enhancing model explainability and building user trust. To address this dual challenge, we introduce DiffusionFF, a diffusion-based framework that simultaneously performs face forgery detection and fine-grained artifact localization. Our key idea is to establish a novel encoder-decoder architecture: a pretrained forgery detector serves as a powerful "artifact encoder", and a denoising diffusion model is repurposed as an "artifact decoder". Conditioned on multi-scale forgery-related features extracted by the encoder, the decoder progressively synthesizes a detailed artifact localization map. We then fuse this fine-grained localization map with high-level semantic features from the forgery detector, leading to substantial improvements in detection capability. Extensive experiments show that DiffusionFF achieves state-of-the-art (SOTA) performance across multiple benchmarks, underscoring its superior effectiveness and explainability.

Key Contributions

Novel encoder-decoder architecture repurposing a pretrained forgery detector as an 'artifact encoder' and a denoising diffusion model as an 'artifact decoder' for fine-grained localization
Joint face forgery detection and artifact localization in a single framework, fusing localization maps with high-level semantic features to improve detection accuracy
State-of-the-art performance across multiple deepfake detection benchmarks with enhanced model explainability

🛡️ Threat Analysis

Output Integrity Attack

Deepfake/face forgery detection is a canonical ML09 task (AI-generated content detection). DiffusionFF proposes a novel detection architecture — not merely applying existing methods — that simultaneously classifies manipulated images and produces fine-grained localization maps of forgery artifacts.

Details

Domains

visiongenerative

Model Types

diffusiontransformercnn

Threat Tags

inference_timedigital

Applications

deepfake detectionface forgery detectionfacial manipulation localization

Read PDF arXiv

DiffusionFF: A Diffusion-based Framework for Joint Face Forgery Detection and Fine-Grained Artifact Localization

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Exploring Specular Reflection Inconsistency for Generalizable Face Forgery Detection

SimuFreeMark: A Noise-Simulation-Free Robust Watermarking Against Image Editing

Enhancing Robustness in Post-Processing Watermarking: An Ensemble Attack Network Using CNNs and Transformers

Supervised Contrastive Learning for Few-Shot AI-Generated Image Detection and Attribution

SimLBR: Learning to Detect Fake Images by Learning to Detect Real Images

Proto-LeakNet: Towards Signal-Leak Aware Attribution in Synthetic Human Face Imagery

Diffusion Epistemic Uncertainty with Asymmetric Learning for Diffusion-Generated Image Detection

Learning to Watermark in the Latent Space of Generative Models