Ruizhe Zhu

attack arXiv Oct 10, 2025 · Oct 2025

Ruizhe Zhu · ETH Zürich

Embeds readable text instructions inside images to hijack VLM behavior, outperforming gradient-based attacks with far less compute

Input Manipulation Attack Prompt Injection visionnlpmultimodal

2 citations PDF Code

Papers in Database (1)