Shuang Liang

Papers in Database (1)

defense arXiv Oct 17, 2025 ยท Oct 2025

Learning to Detect Unknown Jailbreak Attacks in Large Vision-Language Models

Shuang Liang, Zhihao Xu, Jialing Tao et al.

Defends VLMs against unknown jailbreak attacks via task-specific safety representation learning and unsupervised attack classification

Prompt Injection visionmultimodalnlp
PDF Code