Jinwen He

h-index: 5 178 citations 9 papers (total)

Papers in Database (1)

attack arXiv Jan 20, 2026 · 10w ago

Turn-Based Structural Triggers: Prompt-Free Backdoors in Multi-Turn LLMs

Yiyang Lu, Jinwen He, Yue Zhao et al. · Chinese Academy of Sciences · University of Chinese Academy of Sciences

Backdoor attack on multi-turn LLMs using conversation turn index as trigger, achieving 99.52% ASR invisible to prompt-centric defenses

Model Poisoning nlp
PDF