Ke Xu

h-index: 3 240 citations 13 papers (total)

Papers in Database (1)

defense arXiv Oct 30, 2025 · Oct 2025

ALMGuard: Safety Shortcuts and Where to Find Them as Guardrails for Audio-Language Models

Weifei Jin, Yuxin Cao, Junjie Su et al. · Beijing University of Posts and Telecommunications · National University of Singapore +3 more

Defends Audio-Language Models against audio-based jailbreaks using universal acoustic perturbations that activate inherent model safety shortcuts

Input Manipulation Attack Prompt Injection audiomultimodalnlp
1 citations PDF Code