Yihao Huang

benchmark arXiv Dec 6, 2025 · Dec 2025

Xiaojun Jia, Jie Liao, Qi Guo et al. · Nanyang Technological University · BraneMatrix AI +7 more

Unified benchmark and toolbox evaluating 13 attack methods and 15 defenses against multimodal jailbreaks across 18 open- and closed-source MLLMs

Prompt Injection multimodalnlpvision

5 citations PDF Code

attack arXiv Dec 24, 2025 · Dec 2025

Yifan Huang, Xiaojun Jia, Wenbo Guo et al. · Nanyang Technological University · National University of Singapore

Jailbreak framework using sentence pairing achieves 84% attack success on GPT-4.1 for malicious code generation

Prompt Injection nlp

1 citations PDF

defense arXiv Nov 16, 2025 · Nov 2025

Jiayi Zhu, Yihao Huang, Yue Cao et al. · Xidian University · Ltd +5 more

Defends geo-privacy by embedding semantics-aware deceptive text overlays around images to mislead LVLMs into predicting wrong geolocations.

Input Manipulation Attack Prompt Injection visionmultimodal

Papers in Database (3)