Ian G. Harris

Papers in Database (1)

defense arXiv Aug 9, 2025 · Aug 2025

Context Misleads LLMs: The Role of Context Filtering in Maintaining Safe Alignment of LLMs

Jinhwa Kim, Ian G. Harris · University of California

Plug-and-play input preprocessor strips adversarial context from prompts to defend LLMs against jailbreaks, cutting attack success rate by 88%

Prompt Injection nlp
PDF