Gongshen Liu

h-index: 4 54 citations 11 papers (total)

Papers in Database (1)

defense arXiv Dec 7, 2025 · Dec 2025

Patronus: Identifying and Mitigating Transferable Backdoors in Pre-trained Language Models

Tianhang Zhao, Wei Du, Haodong Zhao et al. · Shanghai Jiao Tong University · Ant Group

Defends PLMs against transferable backdoors that survive fine-tuning via contrastive trigger search and dual-stage purification

Model Poisoning Transfer Learning Attack nlp
3 citations PDF Code