Bill Byrne

Papers in Database (1)

defense arXiv Aug 22, 2025 · Aug 2025

Retrieval-Augmented Defense: Adaptive and Controllable Jailbreak Prevention for Large Language Models

Guangyu Yang, Jinghong Chen, Jingbiao Mei et al. · University of Cambridge

RAG-based jailbreak defense for LLMs that retrieves known attack examples to detect and block prompt injection attempts without retraining

Prompt Injection nlp
PDF Code