Ian G. Harris

defense arXiv Aug 9, 2025 · Aug 2025

Jinhwa Kim, Ian G. Harris · University of California

Plug-and-play input preprocessor strips adversarial context from prompts to defend LLMs against jailbreaks, cutting attack success rate by 88%

Prompt Injection nlp

Papers in Database (1)