Marcantonio Bracale Syrnikov

h-index: 1 1 citations 1 papers (total)

Papers in Database (1)

attack arXiv Dec 16, 2025 · Dec 2025

From Adversarial Poetry to Adversarial Tales: An Interpretability Research Agenda

Piercosma Bisconti, Marcello Galisai, Matteo Prandi et al. · Sapienza University of Rome · VU Amsterdam +1 more

Novel jailbreak embeds harmful content in cyberpunk tales using Proppian analysis to bypass LLM safety, achieving 71.3% ASR across 26 models

Prompt Injection nlp
1 citations PDF