Rui Pu

Papers in Database (1)

defense arXiv Aug 5, 2025 · Aug 2025

Beyond Surface-Level Detection: Towards Cognitive-Driven Defense Against Jailbreak Attacks via Meta-Operations Reasoning

Rui Pu, Chaozhuo Li, Rui Ha et al. · Beijing University of Posts and Telecommunications · Beihang University

Defends LLMs against jailbreak attacks by reasoning over meta-operations that conceal harmful intent via SFT and entropy-guided RL

Prompt Injection nlp
PDF