Xilin Chen

Papers in Database (1)

defense arXiv Mar 13, 2026 · 24d ago

What Makes VLMs Robust? Towards Reconciling Robustness and Accuracy in Vision-Language Models

Sen Nie, Jie Zhang, Zhongqi Wang et al. · Chinese Academy of Sciences · University of Chinese Academy of Sciences

Freezes pre-trained VLM weights and adapts only shallow layers to achieve adversarial robustness without sacrificing clean accuracy

Input Manipulation Attack visionnlpmultimodal
PDF Code