Ren Wang

h-index: 2 6 citations 5 papers (total)

Papers in Database (1)

defense arXiv Sep 24, 2025 · Sep 2025

Beyond Sharp Minima: Robust LLM Unlearning via Feedback-Guided Multi-Point Optimization

Wenhan Wu, Zheyuan Liu, Chongyang Gao et al. · Northwestern University · University of Notre Dame +1 more

Hardens LLM unlearning against relearning attacks by steering parameters toward flat loss minima via adversarial neighborhood-aware optimization

Sensitive Information Disclosure Prompt Injection nlp
1 citations PDF