Changsheng Wang

benchmark arXiv Oct 8, 2025 · Oct 2025

LLM Unlearning Under the Microscope: A Full-Stack View on Methods and Metrics

Chongyu Fan, Changsheng Wang, Yancheng Huang et al. · Michigan State University · IBM Research

Benchmarks 12 LLM unlearning methods on effectiveness, utility, and robustness to attacks recovering forgotten harmful behaviors

Prompt Injection nlp

PDF

defense arXiv Oct 1, 2025 · Oct 2025

Downgrade to Upgrade: Optimizer Simplification Enhances Robustness in LLM Unlearning

Yicheng Lang, Yihua Zhang, Chongyu Fan et al. · Michigan State University · IBM Research

Shows zeroth-order optimizers produce tamper-resistant LLM unlearning, defending against relearning attacks that restore forgotten harmful or private content

Prompt Injection Sensitive Information Disclosure nlp

PDF

Papers in Database (2)

LLM Unlearning Under the Microscope: A Full-Stack View on Methods and Metrics

Downgrade to Upgrade: Optimizer Simplification Enhances Robustness in LLM Unlearning