Han Yan

defense arXiv Sep 27, 2025 · Sep 2025

Han Yan, Zheyuan Liu, Meng Jiang · University of Notre Dame · The Chinese University of Hong Kong

Defends LLM unlearning against jailbreak and relearning attacks via dual-space smoothness in representation and parameter spaces

Prompt Injection Sensitive Information Disclosure nlp

1 citations PDF Code

Papers in Database (1)