Xixiang Lyu

Papers in Database (1)

defense arXiv Jan 5, 2025 · Jan 2025

Backdoor Token Unlearning: Exposing and Defending Backdoors in Pretrained Language Models

Peihai Jiang, Xixiang Lyu, Yige Li et al. · Xidian University · Singapore Management University

Defends NLP fine-tuning against backdoor attacks by detecting aberrant trigger token embeddings and unlearning them during training

Model Poisoning nlp
PDF Code