Xia Hu

h-index: 0 0 citations 1 papers (total)

Papers in Database (1)

defense arXiv Feb 4, 2026 · 8w ago

RAPO: Risk-Aware Preference Optimization for Generalizable Safe Reasoning

Zeming Wei, Qiaosheng Zhang, Xia Hu et al. · Shanghai AI Laboratory · Peking University

Risk-aware preference optimization framework that generalizes LRM safe reasoning against diverse jailbreak attacks without sacrificing utility

Prompt Injection nlp
PDF Code