Xinyi Wu

survey arXiv Oct 9, 2025 · Oct 2025

Man Hu, Xinyi Wu, Zuofeng Suo et al. · Beijing Electronic Science and Technology Institute · Nanyang Technological University +1 more

First survey on backdoor attacks targeting LLM reasoning processes, proposing a three-type taxonomy of associative, passive, and active backdoors

Model Poisoning nlp

defense arXiv Oct 6, 2025 · Oct 2025

Shuai Zhao, Xinyi Wu, Shiqian Zhao et al. · Nanyang Technological University · Shanghai Jiao Tong University

Defends LLMs from fine-tuning backdoor attacks by re-poisoning training data with benign triggers and safe labels

Model Poisoning nlpmultimodal

Papers in Database (2)