Wei Ren

Papers in Database (1)

attack arXiv Apr 11, 2026 · 5d ago

Seeing No Evil: Blinding Large Vision-Language Models to Safety Instructions via Adversarial Attention Hijacking

Jingru Li, Wei Ren, Tianqing Zhu · China University of Geosciences · City University of Macau

Adversarial attack on VLMs that suppresses attention to safety prompts, achieving 94% jailbreak success via attention manipulation

Input Manipulation Attack Prompt Injection multimodalvisionnlp
PDF