Yanbo Dai

attack arXiv Aug 27, 2025 · Aug 2025

Yanbo Dai, Zhenlan Ji, Zongjie Li et al. · The Hong Kong University of Science and Technology

Backdoors RAG retrievers via model editing to inject anti-self-correction instructions, achieving >90% attack success across 6 LLMs

Model Poisoning Prompt Injection nlp

Papers in Database (1)