attack 2025

TokenPure: Watermark Removal through Tokenized Appearance and Structural Guidance

Pei Yang 1, Yepeng Liu 2, Kelly Peng 1, Yuan Gao 1, Yiren Song 1,3

0 citations · 109 references · arXiv

α

Published on arXiv

2512.01314

Output Integrity Attack

OWASP ML Top 10 — ML09

Key Finding

TokenPure achieves state-of-the-art watermark removal and reconstruction fidelity, substantially outperforming existing baselines in both perceptual quality and content consistency.

TokenPure

Novel technique introduced


In the digital economy era, digital watermarking serves as a critical basis for ownership proof of massive replicable content, including AI-generated and other virtual assets. Designing robust watermarks capable of withstanding various attacks and processing operations is even more paramount. We introduce TokenPure, a novel Diffusion Transformer-based framework designed for effective and consistent watermark removal. TokenPure solves the trade-off between thorough watermark destruction and content consistency by leveraging token-based conditional reconstruction. It reframes the task as conditional generation, entirely bypassing the initial watermark-carrying noise. We achieve this by decomposing the watermarked image into two complementary token sets: visual tokens for texture and structural tokens for geometry. These tokens jointly condition the diffusion process, enabling the framework to synthesize watermark-free images with fine-grained consistency and structural integrity. Comprehensive experiments show that TokenPure achieves state-of-the-art watermark removal and reconstruction fidelity, substantially outperforming existing baselines in both perceptual quality and consistency.


Key Contributions

  • Diffusion Transformer-based watermark removal framework (TokenPure) that reframes removal as conditional generation, bypassing the watermark-carrying noise space entirely
  • Decomposition of watermarked images into complementary visual tokens (texture/appearance) and structural tokens (geometry) to jointly condition the diffusion process
  • State-of-the-art watermark removal achieving superior perceptual quality and content consistency over existing baselines

🛡️ Threat Analysis

Output Integrity Attack

TokenPure is a watermark removal attack targeting content watermarks embedded in images — it defeats image-level protections designed to verify provenance and ownership. Per the ML09 definition, attacks that remove or defeat image watermarks/protections via purification or generative techniques fall under Output Integrity Attack, not ML01.


Details

Domains
visiongenerative
Model Types
diffusiontransformer
Threat Tags
black_boxinference_timedigital
Applications
image watermark removaldigital content protectionai-generated image ownership