ML Security Papers

ML Security Papers

Latest papers

1 papers

attack arXiv Feb 9, 2026 · 8w ago

Red-teaming the Multimodal Reasoning: Jailbreaking Vision-Language Models via Cross-modal Entanglement Attacks

Yu Yan, Sheng Sun, Shengjia Cheng et al. · Institute of Computing Technology · University of Chinese Academy of Sciences +1 more

Jailbreaks VLMs by entangling harmful multi-hop instructions across text and image modalities to evade safety alignment

Prompt Injection multimodalvisionnlp