Haoran Li

h-index: 11 374 citations 28 papers (total)

Papers in Database (1)

attack EMNLP Oct 4, 2025 · Oct 2025

Backdoor-Powered Prompt Injection Attacks Nullify Defense Methods

Yulin Chen, Haoran Li, Yuan Sui et al. · National University of Singapore · HKUST

Backdoor injected via SFT data poisoning makes LLMs execute injected instructions, defeating instruction hierarchy prompt injection defenses

Model Poisoning Prompt Injection nlp
1 citations PDF Code