Jialin Song

h-index: 0 0 citations 0 papers (total)

Papers in Database (1)

attack arXiv Feb 6, 2026 · 8w ago

SEMA: Simple yet Effective Learning for Multi-Turn Jailbreak Attacks

Mingqian Feng, Xiaodong Liu, Weiwei Yang et al. · University of Rochester · Microsoft Research

RL-trained multi-turn jailbreak attacker with intent-drift-aware reward achieves 80.1% ASR, beating SOTA by 33.9%

Prompt Injection nlp
1 citations 1 influentialPDF Code