Han Yan

h-index: 3 44 citations 5 papers (total)

Papers in Database (1)

defense arXiv Sep 27, 2025 · Sep 2025

Dual-Space Smoothness for Robust and Balanced LLM Unlearning

Han Yan, Zheyuan Liu, Meng Jiang · University of Notre Dame · The Chinese University of Hong Kong

Defends LLM unlearning against jailbreak and relearning attacks via dual-space smoothness in representation and parameter spaces

Prompt Injection Sensitive Information Disclosure nlp
1 citations PDF Code