Xiyu Zeng

h-index: 2 23 citations 4 papers (total)

Papers in Database (1)

defense arXiv Sep 24, 2025 · Sep 2025

SafeSteer: Adaptive Subspace Steering for Efficient Jailbreak Defense in Vision-Language Models

Xiyu Zeng, Siyuan Liang, Liming Lu et al. · Nanjing University of Science and Technology · Nanyang Technological University +1 more

Inference-time SVD-based activation steering defends VLMs against visual jailbreaks while preserving utility and efficiency

Input Manipulation Attack Prompt Injection visionnlpmultimodal
1 citations PDF