Huan Liu

h-index: 0 0 citations 1 papers (total)

Papers in Database (1)

defense arXiv Feb 2, 2026 · 9w ago

$\textbf{AGT$^{AO}$}$: Robust and Stabilized LLM Unlearning via Adversarial Gating Training with Adaptive Orthogonality

Pengyu Li, Lingling Zhang, Zhitao Gao et al. · Xi'an Jiaotong University · Shaanxi Province Key Laboratory of Big Data Knowledge Engineering

Defends LLMs against adversarial recovery of memorized sensitive data via min-max gating and gradient orthogonality during unlearning

Model Inversion Attack Sensitive Information Disclosure nlp
PDF Code