Andrew Bell

Papers in Database (1)

defense arXiv Jan 2, 2025 · Jan 2025

Safeguarding Large Language Models in Real-time with Tunable Safety-Performance Trade-offs

Joao Fonseca, Andrew Bell, Julia Stoyanovich · New York University

Real-time LLM jailbreak defense using controlled text generation and nudging interventions, cutting attack success by 30%

Prompt Injection nlp
PDF