Hide&Seek: Remove Image Watermarks with Negligible Cost via Pixel-wise Reconstruction

Watermarking has emerged as a key defense against the misuse of machine-generated images (MGIs). Yet the robustness of these protections remains underexplored. To reveal the limits of SOTA proactive image watermarking defenses, we propose HIDE&SEEK (HS), a suite of versatile and cost-effective attacks that reliably remove embedded watermarks while preserving high visual fidelity.

Key Contributions

HIDE stage: identifies and masks pixels most critical to the watermark's structure using a targeted localization strategy
SEEK stage: pixel-wise reconstructs only the masked critical pixels with a generative model, leaving the rest of the image untouched
Query-free and knowledge-free attack requiring no access to the watermark detector or knowledge of the underlying watermarking scheme, outperforming existing removal attacks in both erasure strength and image quality

🛡️ Threat Analysis

Output Integrity Attack

HIDE&SEEK attacks content watermarks embedded in AI-generated image outputs — identifying and reconstructing only watermark-critical pixels to erase embedded signals while preserving visual fidelity. Per the guidelines, attacks that remove or defeat image watermarks/protections are ML09 (output integrity/content provenance attacks), not ML01.

Details

Domains

visiongenerative

Model Types

diffusiongan

Threat Tags

black_boxinference_timedigital

Applications

2026 0 cit.

Output Integrity Attack

86%

Hide&Seek: Remove Image Watermarks with Negligible Cost via Pixel-wise Reconstruction

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Untraceable DeepFakes via Traceable Fingerprint Elimination

RAVEN: Erasing Invisible Watermarks via Novel View Synthesis

MarkCleaner: High-Fidelity Watermark Removal via Imperceptible Micro-Geometric Perturbation

The Coding Limits of Robust Watermarking for Generative Models

D2RA: Dual Domain Regeneration Attack

DeMark: A Query-Free Black-Box Attack on Deepfake Watermarking Defenses

TokenPure: Watermark Removal through Tokenized Appearance and Structural Guidance

Purify Once, Edit Freely: Breaking Image Protections under Model Mismatch