Nir Shavit

h-index: 58 14,647 citations 245 papers (total)

Papers in Database (1)

attack arXiv Feb 22, 2026 · 6w ago

Understanding Empirical Unlearning with Combinatorial Interpretability

Shingo Kodama, Niv Cohen, Micah Adler et al. · Middlebury College · New York University +2 more

Attacks machine unlearning methods using combinatorial interpretability, showing erased knowledge persists in weights and recovers rapidly via fine-tuning

Model Inversion Attack nlpvision
PDF