Ruihao Pan

benchmark arXiv Feb 28, 2026 · 5w ago

A Comprehensive Evaluation of LLM Unlearning Robustness under Multi-Turn Interaction

Ruihao Pan, Suhang Wang · Pennsylvania State University

Shows LLM unlearning fails under multi-turn interaction; self-correction and dialogue history recover supposedly forgotten hazardous or private knowledge

Prompt Injection Sensitive Information Disclosure nlp

PDF

Papers in Database (1)

A Comprehensive Evaluation of LLM Unlearning Robustness under Multi-Turn Interaction