Kevin Scaria

defense arXiv Apr 6, 2026 · 6w ago

Purva Chiniya, Kevin Scaria, Sagar Chaturvedi · Amazon

Dual-anchor gradient detection combined with deterministic refusal-token injection to prevent LLM jailbreaks while reducing false positives by 52%

Prompt Injection nlp

Papers in Database (1)