Xueluan Gong

Papers in Database (2)

defense arXiv Aug 28, 2025 · Aug 2025

Lethe: Purifying Backdoored Large Language Models with Knowledge Dilution

Chen Chen, Yuchen Sun, Jiaxin Gao et al. · Nanyang Technological University · Wuhan University

Defends backdoored LLMs via knowledge dilution—merging clean and poisoned model weights plus prompt-based evidence injection to neutralize triggers

Model Poisoning nlp
PDF Code
defense arXiv Feb 6, 2026 · 8w ago

Plato's Form: Toward Backdoor Defense-as-a-Service for LLMs with Prototype Representations

Chen Chen, Yuchen Sun, Jiaxin Gao et al. · Nanyang Technological University · Wuhan University

Defends LLMs against backdoor attacks via prototype-based parameter editing with no clean data or trigger knowledge required

Model Poisoning nlp
PDF