Alireza Nadali

h-index: 3 21 citations 9 papers (total)

Papers in Database (1)

defense arXiv Feb 2, 2026 · 9w ago

Monotonicity as an Architectural Bias for Robust Language Models

Patrick Cooper, Alireza Nadali, Ashutosh Trivedi et al. · University of Colorado Boulder

Enforces monotonicity in Transformer FFN layers to cut LLM adversarial attack success rates from 69% to 19% with minimal performance cost

Input Manipulation Attack Prompt Injection nlp
PDF