Yidong Ding

h-index: 1 57 citations 3 papers (total)

Papers in Database (1)

defense arXiv Jan 6, 2025 · Jan 2025

MBTSAD: Mitigating Backdoors in Language Models Based on Token Splitting and Attention Distillation

Yidong Ding, Jiafei Niu, Ping Yi · Shanghai Jiao Tong University

Defends backdoored NLP models via token-splitting augmentation and attention distillation, requiring no pre-trained weights

Model Poisoning nlp
1 citations PDF