OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization
Jiazheng Xing 1,2, Hai Ci 2, Hongbin Xu 2, Hangjie Yuan 1, Yong Liu 1, Mike Zheng Shou 2
Published on arXiv
2508.21727
Output Integrity Attack
OWASP ML Top 10 — ML09
Key Finding
Achieves 0.983 bit accuracy and 0.972 TPR against diverse watermark removal attacks while incurring only ΔFID=3.069 and ΔCLIP-Score=-0.0056 degradation in image quality.
OptMark
Novel technique introduced
Watermarking diffusion-generated images is crucial for copyright protection and user tracking. However, current diffusion watermarking methods face significant limitations: zero-bit watermarking systems lack the capacity for large-scale user tracking, while multi-bit methods are highly sensitive to certain image transformations or generative attacks, resulting in a lack of comprehensive robustness. In this paper, we propose OptMark, an optimization-based approach that embeds a robust multi-bit watermark into the intermediate latents of the diffusion denoising process. OptMark strategically inserts a structural watermark early to resist generative attacks and a detail watermark late to withstand image transformations, with tailored regularization terms to preserve image quality and ensure imperceptibility. To address the challenge of memory consumption growing linearly with the number of denoising steps during optimization, OptMark incorporates adjoint gradient methods, reducing memory usage from O(N) to O(1). Experimental results demonstrate that OptMark achieves invisible multi-bit watermarking while ensuring robust resilience against valuemetric transformations, geometric transformations, editing, and regeneration attacks.
Key Contributions
- Dual-phase watermark injection: structural watermark inserted early in denoising (resists generative/regeneration attacks) and detail watermark inserted late (resists image transformations), with tailored regularization for imperceptibility
- Adjoint gradient optimization that reduces memory footprint from O(N) to O(1) relative to denoising steps, enabling practical inference-time optimization
- Achieves ~0.983 bit accuracy and ~0.972 TPR across valuemetric, geometric, editing, and regeneration attacks while maintaining near-baseline image quality (ΔFID=3.069)
🛡️ Threat Analysis
Watermarks are embedded in diffusion model OUTPUT images (not model weights) to enable copyright protection and user tracking via content provenance — classic output integrity / content watermarking. The paper also explicitly evaluates robustness against watermark removal attacks (valuemetric/geometric transformations, editing, regeneration), which are ML09 threats.