Yuang Qi

defense arXiv Sep 26, 2025 · Sep 2025

Jiawei Zhao, Yuang Qi, Weiming Zhang et al. · University of Science and Technology of China

Efficient LRM guard model replaces slow reasoning traces with prefilled tokens to detect jailbreaks in one forward pass

Prompt Injection nlp

Papers in Database (1)