MarkCleaner: High-Fidelity Watermark Removal via Imperceptible Micro-Geometric Perturbation
Xiaoxi Kong 1, Jieyu Yuan 2, Pengdi Chen 1, Yuanlin Zhang 2, Chongyi Li 2, Bin Li 1
Published on arXiv
2602.01513
Output Integrity Attack
OWASP ML Top 10 — ML09
Key Finding
MarkCleaner outperforms existing watermark removal methods on both erasure effectiveness and visual fidelity metrics by exploiting phase-shift sensitivity of semantic watermarks to imperceptible spatial displacements
MarkCleaner
Novel technique introduced
Semantic watermarks exhibit strong robustness against conventional image-space attacks. In this work, we show that such robustness does not survive under micro-geometric perturbations: spatial displacements can remove watermarks by breaking the phase alignment. Motivated by this observation, we introduce MarkCleaner, a watermark removal framework that avoids semantic drift caused by regeneration-based watermark removal. Specifically, MarkCleaner is trained with micro-geometry-perturbed supervision, which encourages the model to separate semantic content from strict spatial alignment and enables robust reconstruction under subtle geometric displacements. The framework adopts a mask-guided encoder that learns explicit spatial representations and a 2D Gaussian Splatting-based decoder that explicitly parameterizes geometric perturbations while preserving semantic content. Extensive experiments demonstrate that MarkCleaner achieves superior performance in both watermark removal effectiveness and visual fidelity, while enabling efficient real-time inference. Our code will be made available upon acceptance.
Key Contributions
- Demonstrates that semantic watermarks (e.g., TreeRing) are fundamentally vulnerable to micro-geometric perturbations that disrupt phase alignment in the Fourier-transformed latent space
- Proposes MarkCleaner, a mask-guided encoder + 2D Gaussian Splatting-based decoder framework trained on micro-geometry-perturbed supervision to remove watermarks while preserving visual fidelity
- Achieves superior watermark removal effectiveness and visual consistency compared to reconstruction-based and generation-based baselines, with efficient real-time inference
🛡️ Threat Analysis
MarkCleaner attacks and removes content watermarks (specifically TreeRing-style semantic watermarks) embedded in AI-generated images — this is a watermark removal attack targeting output integrity and content provenance, explicitly covered under ML09 per the 'removing/defeating content watermarks' clause.