Yatao Yang

Papers in Database (1)

defense arXiv Aug 3, 2025 · Aug 2025

DUP: Detection-guided Unlearning for Backdoor Purification in Language Models

Man Hu, Yahui Ding, Yatao Yang et al. · Beijing Electronic Science and Technology Institute · Nanyang Technological University

Defends language models against backdoor attacks via fine-grained feature detection and LoRA-based unlearning without full retraining

Model Poisoning nlp
PDF Code