Vincenzo Scotti

Papers in Database (1)

tool IEEE Transactions on Software ... Jan 3, 2025 · Jan 2025

How Toxic Can You Get? Search-based Toxicity Testing for Large Language Models

Simone Corbo, Luca Bancale, Valeria De Gennaro et al. · Politecnico di Milano · Karlsruhe Institute of Technology

Evolutionary search-based tool that auto-generates fluent prompts to elicit toxic outputs from aligned LLMs, outperforming jailbreak baselines

Prompt Injection nlp
PDF