Ruizhe Zhu

h-index: 2 14 citations 7 papers (total)

Papers in Database (1)

attack arXiv Oct 10, 2025 · Oct 2025

Text Prompt Injection of Vision Language Models

Ruizhe Zhu · ETH Zürich

Embeds readable text instructions inside images to hijack VLM behavior, outperforming gradient-based attacks with far less compute

Input Manipulation Attack Prompt Injection visionnlpmultimodal
2 citations PDF Code