Ouxiang Li

h-index: 8 248 citations 12 papers (total)

Papers in Database (1)

attack arXiv Dec 8, 2025 · Dec 2025

TROJail: Trajectory-Level Optimization for Multi-Turn Large Language Model Jailbreaks with Process Rewards

Xiqiao Xiong, Ouxiang Li, Zhuo Liu et al. · University of Science and Technology of China · National University of Singapore +1 more

RL-trained multi-turn jailbreak attacker using process rewards to guide trajectory-level LLM prompt optimization

Prompt Injection nlpreinforcement-learning
PDF Code