Yige Li

Papers in Database (2)

benchmark arXiv Mar 8, 2026 · 29d ago

Backdoor4Good: Benchmarking Beneficial Uses of Backdoors in LLMs

Yige Li, Wei Zhao, Zhe Li et al. · Singapore Management University · The University of Melbourne +1 more

Benchmarks beneficial uses of LLM backdoors for safety enforcement, access control, and watermarking via trigger conditioning

Model Poisoning Prompt Injection nlp
PDF Code
defense arXiv Jan 5, 2025 · Jan 2025

Backdoor Token Unlearning: Exposing and Defending Backdoors in Pretrained Language Models

Peihai Jiang, Xixiang Lyu, Yige Li et al. · Xidian University · Singapore Management University

Defends NLP fine-tuning against backdoor attacks by detecting aberrant trigger token embeddings and unlearning them during training

Model Poisoning nlp
PDF Code