defense 2025

ImageSentinel: Protecting Visual Datasets from Unauthorized Retrieval-Augmented Image Generation

Ziyuan Luo 1,2, Yangyi Zhao 1, Ka Chun Cheung 2, Simon See 2, Renjie Wan 1

3 citations · 52 references · arXiv

α

Published on arXiv

2510.12119

Output Integrity Attack

OWASP ML Top 10 — ML09

Key Finding

ImageSentinel reliably detects unauthorized RAIG dataset usage via sentinel image retrieval while preserving authorized generation quality

ImageSentinel

Novel technique introduced


The widespread adoption of Retrieval-Augmented Image Generation (RAIG) has raised significant concerns about the unauthorized use of private image datasets. While these systems have shown remarkable capabilities in enhancing generation quality through reference images, protecting visual datasets from unauthorized use in such systems remains a challenging problem. Traditional digital watermarking approaches face limitations in RAIG systems, as the complex feature extraction and recombination processes fail to preserve watermark signals during generation. To address these challenges, we propose ImageSentinel, a novel framework for protecting visual datasets in RAIG. Our framework synthesizes sentinel images that maintain visual consistency with the original dataset. These sentinels enable protection verification through randomly generated character sequences that serve as retrieval keys. To ensure seamless integration, we leverage vision-language models to generate the sentinel images. Experimental results demonstrate that ImageSentinel effectively detects unauthorized dataset usage while preserving generation quality for authorized applications. Code is available at https://github.com/luo-ziyuan/ImageSentinel.


Key Contributions

  • Formalizes the novel threat of unauthorized visual dataset use in retrieval-augmented image generation (RAIG) systems, where traditional watermarking fails due to feature recombination destroying embedded signals
  • Proposes ImageSentinel, which synthesizes visually consistent sentinel images using VLMs and embeds random character sequence retrieval keys to enable reliable detection of unauthorized dataset use
  • Demonstrates that the framework detects unauthorized usage without degrading generation quality for authorized applications

🛡️ Threat Analysis

Output Integrity Attack

Watermarks a visual reference dataset to detect misappropriation: sentinel images are injected into the protected dataset, and their retrieval via secret keys during generation reveals unauthorized use — this is dataset provenance / content watermarking for IP protection, squarely within ML09's coverage of content provenance and authentication.


Details

Domains
visiongenerative
Model Types
diffusionvlm
Threat Tags
inference_time
Applications
retrieval-augmented image generationvisual dataset ip protectionimage generation