Xingyu Zhu

Papers in Database (2)

defense arXiv Feb 27, 2026 · 5w ago

GuardAlign: Test-time Safety Alignment in Multimodal Large Language Models

Xingyu Zhu, Beier Zhu, Junfeng Fang et al. · University of Science and Technology of China · Nanyang Technological University +2 more

Training-free defense for VLMs uses optimal transport patch detection and attention calibration to block visual jailbreaks

Input Manipulation Attack Prompt Injection visionnlpmultimodal
PDF
defense arXiv Mar 23, 2026 · 14d ago

Principled Steering via Null-space Projection for Jailbreak Defense in Vision-Language Models

Xingyu Zhu, Beier Zhu, Shuo Wang et al. · University of Science and Technology of China · National University of Singapore +1 more

Null-space projection defense that blocks VLM jailbreaks while preserving benign performance through theoretically-grounded activation steering

Input Manipulation Attack Prompt Injection multimodalvisionnlp
PDF