Andrew Bell

defense arXiv Jan 2, 2025 · Jan 2025

Joao Fonseca, Andrew Bell, Julia Stoyanovich · New York University

Real-time LLM jailbreak defense using controlled text generation and nudging interventions, cutting attack success by 30%

Prompt Injection nlp

Papers in Database (1)