attack 2026

MarkCleaner: High-Fidelity Watermark Removal via Imperceptible Micro-Geometric Perturbation

Xiaoxi Kong 1, Jieyu Yuan 2, Pengdi Chen 1, Yuanlin Zhang 2, Chongyi Li 2, Bin Li 1

0 citations · 61 references · arXiv (Cornell University)

α

Published on arXiv

2602.01513

Output Integrity Attack

OWASP ML Top 10 — ML09

Key Finding

MarkCleaner outperforms existing watermark removal methods on both erasure effectiveness and visual fidelity metrics by exploiting phase-shift sensitivity of semantic watermarks to imperceptible spatial displacements

MarkCleaner

Novel technique introduced


Semantic watermarks exhibit strong robustness against conventional image-space attacks. In this work, we show that such robustness does not survive under micro-geometric perturbations: spatial displacements can remove watermarks by breaking the phase alignment. Motivated by this observation, we introduce MarkCleaner, a watermark removal framework that avoids semantic drift caused by regeneration-based watermark removal. Specifically, MarkCleaner is trained with micro-geometry-perturbed supervision, which encourages the model to separate semantic content from strict spatial alignment and enables robust reconstruction under subtle geometric displacements. The framework adopts a mask-guided encoder that learns explicit spatial representations and a 2D Gaussian Splatting-based decoder that explicitly parameterizes geometric perturbations while preserving semantic content. Extensive experiments demonstrate that MarkCleaner achieves superior performance in both watermark removal effectiveness and visual fidelity, while enabling efficient real-time inference. Our code will be made available upon acceptance.


Key Contributions

  • Demonstrates that semantic watermarks (e.g., TreeRing) are fundamentally vulnerable to micro-geometric perturbations that disrupt phase alignment in the Fourier-transformed latent space
  • Proposes MarkCleaner, a mask-guided encoder + 2D Gaussian Splatting-based decoder framework trained on micro-geometry-perturbed supervision to remove watermarks while preserving visual fidelity
  • Achieves superior watermark removal effectiveness and visual consistency compared to reconstruction-based and generation-based baselines, with efficient real-time inference

🛡️ Threat Analysis

Output Integrity Attack

MarkCleaner attacks and removes content watermarks (specifically TreeRing-style semantic watermarks) embedded in AI-generated images — this is a watermark removal attack targeting output integrity and content provenance, explicitly covered under ML09 per the 'removing/defeating content watermarks' clause.


Details

Domains
visiongenerative
Model Types
diffusion
Threat Tags
inference_timedigitalblack_box
Datasets
TreeRing-watermarked images
Applications
ai-generated image watermarkingcontent provenance authentication