SIF: Semantically In-Distribution Fingerprints for Large Vision-Language Models
Yifei Zhao , Qian Lou , Mengxin Zheng
Published on arXiv
2604.17041
Model Theft
OWASP ML Top 10 — ML05
Model Theft
OWASP LLM Top 10 — LLM10
Key Finding
Achieves strong stealthiness against semantic divergence detection while maintaining robustness to fine-tuning and quantization attacks on LVLMs
SIF
Novel technique introduced
The public accessibility of large vision-language models (LVLMs) raises serious concerns about unauthorized model reuse and intellectual property infringement. Existing ownership verification methods often rely on semantically abnormal queries or out-of-distribution responses as fingerprints, which can be easily detected and removed by adversaries. We expose this vulnerability through a Semantic Divergence Attack (SDA), which identifies and filters fingerprint queries by measuring semantic divergence between a suspect model and a reference model, showing that existing fingerprints are not semantic-preserving and are therefore easy to detect and bypass. To address these limitations, we propose SIF (Semantically In-Distribution Fingerprints), a non-intrusive ownership verification framework that requires no parameter modification. SIF introduces Semantic-Aligned Fingerprint Distillation (SAFD), which transfers text watermarking signals into the visual modality to produce semantically coherent yet fingerprinted responses. In addition, Robust-Fingerprint Optimization (RFO) enhances robustness by simulating worst-case representation perturbations, making the fingerprints resilient to model modifications such as fine-tuning and quantization. Extensive experiments on LLaVA-1.5 and Qwen2.5-VL demonstrate that SIF achieves strong stealthiness and robustness, providing a practical solution for LVLM copyright protection. Code is available at https://github.com/UCF-ML-Research/SIF-VLM-Fingerprint
Key Contributions
- Exposes vulnerability of existing fingerprinting methods through Semantic Divergence Attack (SDA) that detects out-of-distribution fingerprints
- Proposes SIF framework with Semantic-Aligned Fingerprint Distillation (SAFD) that creates semantically coherent fingerprints by transferring text watermarking signals to visual modality
- Introduces Robust-Fingerprint Optimization (RFO) that makes fingerprints resilient to fine-tuning, quantization, and other model modifications
🛡️ Threat Analysis
SIF embeds fingerprints in VLM responses to prove model ownership and detect unauthorized copies. This is model IP protection against model theft. The fingerprints are used to verify that a suspect model is a stolen/fine-tuned copy of the protected model, which is the core use case of ML05 defenses.