Alexander Hoover

defense arXiv Nov 4, 2025 · Nov 2025

Roy Rinberg, Adam Karvonen, Alexander Hoover et al. · Harvard University · ML Alignment & Theory Scholars (MATS) +2 more

Defends against LLM weight theft via steganographic output channels by verifying inference non-determinism, achieving >200x adversary slowdown

Model Theft Model Theft nlp

2 citations PDF

Papers in Database (1)