Yifan Xia

Papers in Database (1)

defense arXiv Sep 18, 2025 · Sep 2025

LLM Jailbreak Detection for (Almost) Free!

Guorui Chen, Yifan Xia, Xiaojun Jia et al. · Wuhan University · Nanyang Technological University +1 more

Detects LLM jailbreaks near-free by comparing first-token confidence distributions between jailbreak and benign prompts

Prompt Injection nlp
PDF Code