Jonah Burgess

h-index: 5 103 citations 9 papers (total)

Papers in Database (1)

benchmark arXiv Oct 14, 2025 ยท Oct 2025

Breaking Guardrails, Facing Walls: Insights on Adversarial AI for Defenders & Researchers

Giacomo Bertollo, Naz Bodemir, Jonah Burgess

CTF study of 500 participants reveals layered multi-step AI guardrails significantly resist common jailbreak techniques versus simple defenses

Prompt Injection nlp
PDF