Tianle Chen

Papers in Database (1)

attack arXiv Apr 5, 2026 · 11d ago

A Systematic Study of Cross-Modal Typographic Attacks on Audio-Visual Reasoning

Tianle Chen, Deepti Ghadiyaram · Boston University

Cross-modal typographic attacks on audio-visual MLLMs achieve 83% success rate by coordinating text perturbations across modalities

Input Manipulation Attack Prompt Injection multimodalaudiovisionnlp
PDF