defense 2026

RecoverMark: Robust Watermarking for Localization and Recovery of Manipulated Faces

Haonan An ¹, Xiaohui Ye ², Guang Hua ³, Yihang Tao ³, Hangcheng Cao ¹, Xiangyu Yu ³, Yuguang Fang ³

¹ South China University of Technology

² Singapore Institute of Technology

³ City University of Hong Kong

0 citations · 56 references · arXiv (Cornell University)

Published on arXiv

2602.20618

Output Integrity Attack

OWASP ML Top 10 — ML09

Key Finding

RecoverMark achieves robust manipulation localization and face content recovery against both seen attacks (JPEG, Gaussian noise) and unseen attacks (regeneration attacks), generalizing to in-distribution and out-of-distribution data where prior dual-watermark methods (EditGuard, OmniGuard, Imuge+) fail under watermark removal.

RecoverMark

Novel technique introduced

The proliferation of AI-generated content has facilitated sophisticated face manipulation, severely undermining visual integrity and posing unprecedented challenges to intellectual property. In response, a common proactive defense leverages fragile watermarks to detect, localize, or even recover manipulated regions. However, these methods always assume an adversary unaware of the embedded watermark, overlooking their inherent vulnerability to watermark removal attacks. Furthermore, this fragility is exacerbated in the commonly used dual-watermark strategy that adds a robust watermark for image ownership verification, where mutual interference and limited embedding capacity reduce the fragile watermark's effectiveness. To address the gap, we propose RecoverMark, a watermarking framework that achieves robust manipulation localization, content recovery, and ownership verification simultaneously. Our key insight is twofold. First, we exploit a critical real-world constraint: an adversary must preserve the background's semantic consistency to avoid visual detection, even if they apply global, imperceptible watermark removal attacks. Second, using the image's own content (face, in this paper) as the watermark enhances extraction robustness. Based on these insights, RecoverMark treats the protected face content itself as the watermark and embeds it into the surrounding background. By designing a robust two-stage training paradigm with carefully crafted distortion layers that simulate comprehensive potential attacks and a progressive training strategy, RecoverMark achieves a robust watermark embedding in no fragile manner for image manipulation localization, recovery, and image IP protection simultaneously. Extensive experiments demonstrate the proposed RecoverMark's robustness against both seen and unseen attacks and its generalizability to in-distribution and out-of-distribution data.

Key Contributions

Uses the protected face content itself as the watermark, embedded into the surrounding background, exploiting the adversary's constraint to preserve background semantic consistency even under global watermark removal attacks.
Two-stage robust training paradigm with carefully crafted distortion layers (JPEG compression, Gaussian noise, patch removal, regeneration attacks) and a progressive training strategy — achieving robustness in a non-fragile manner.
Simultaneous manipulation localization, face content recovery, and image IP ownership verification within a single unified framework, without relying on dual fragile/robust watermark co-embedding.

🛡️ Threat Analysis

Output Integrity Attack

RecoverMark is a content watermarking scheme embedded in image backgrounds to authenticate image integrity, localize manipulation, recover original face content, and verify image ownership. It directly defends against watermark removal attacks (low-pass filtering, regeneration attacks) that undermine proactive face manipulation detection — this is output integrity protection for AI-manipulated image content, not model-weight watermarking.

Details

Domains

visiongenerative

Model Types

gandiffusioncnn

Threat Tags

inference_timedigitalwhite_box

Datasets

in-distribution face datasetsout-of-distribution face datasets

Applications

face manipulation detectionfacial image authenticationimage ip protectiondeepfake localization

Read PDF arXiv DOI

RecoverMark: Robust Watermarking for Localization and Recovery of Manipulated Faces

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Towards Imperceptible Adversarial Defense: A Gradient-Driven Shield against Facial Manipulations

Beauty and the Beast: Imperceptible Perturbations Against Diffusion-Based Face Swapping via Directional Attribute Editing

A Dual-Branch CNN for Robust Detection of AI-Generated Facial Forgeries

Enhancing Robustness in Post-Processing Watermarking: An Ensemble Attack Network Using CNNs and Transformers

Uncolorable Examples: Preventing Unauthorized AI Colorization via Perception-Aware Chroma-Restrictive Perturbation

SimLBR: Learning to Detect Fake Images by Learning to Detect Real Images

Supervised Contrastive Learning for Few-Shot AI-Generated Image Detection and Attribution

Detection of AI Generated Images Using Combined Uncertainty Measures and Particle Swarm Optimised Rejection Mechanism