Mi Zhang

Papers in Database (1)

defense arXiv Aug 6, 2025 · Aug 2025

ReasoningGuard: Safeguarding Large Reasoning Models with Inference-time Safety Aha Moments

Yuquan Wang, Mi Zhang, Yining Wang et al. · Fudan University · East China University of Science and Technology

Inference-time defense for Large Reasoning Models that injects safety reflections mid-reasoning to block jailbreak attacks

Prompt Injection nlp
PDF