Patrikas Vanagas

h-index: 0 0 citations 1 papers (total)

Papers in Database (1)

defense arXiv Jan 30, 2026 · 9w ago

No More, No Less: Least-Privilege Language Models

Paulius Rauba, Dominykas Seputis, Patrikas Vanagas et al. · University of Cambridge · Vinted +2 more

Proposes inference-time capability restriction for LLMs by controlling reachable internal computation via rank-indexed weight interventions

Prompt Injection nlp
PDF