Wenxuan Ding

h-index: 2 58 citations 10 papers (total)

Papers in Database (1)

benchmark arXiv Jan 27, 2026 · 9w ago

Do Images Speak Louder than Words? Investigating the Effect of Textual Misinformation in VLMs

Chi Zhang, Wenxuan Ding, Jiale Liu et al. · The University of Texas at Austin · New York University +3 more

Benchmarks VLM susceptibility to persuasive conflicting text prompts that override visual evidence, finding 48% average accuracy drop

Prompt Injection visionnlpmultimodal
PDF