Yanbo Dai

Papers in Database (1)

attack arXiv Aug 27, 2025 · Aug 2025

Disabling Self-Correction in Retrieval-Augmented Generation via Stealthy Retriever Poisoning

Yanbo Dai, Zhenlan Ji, Zongjie Li et al. · The Hong Kong University of Science and Technology

Backdoors RAG retrievers via model editing to inject anti-self-correction instructions, achieving >90% attack success across 6 LLMs

Model Poisoning Prompt Injection nlp
PDF