Jiaxin Gao

Papers in Database (1)

defense arXiv Aug 28, 2025 · Aug 2025

Lethe: Purifying Backdoored Large Language Models with Knowledge Dilution

Chen Chen, Yuchen Sun, Jiaxin Gao et al. · Nanyang Technological University · Wuhan University

Defends backdoored LLMs via knowledge dilution—merging clean and poisoned model weights plus prompt-based evidence injection to neutralize triggers

Model Poisoning nlp
PDF Code