Motahhare Eslami

Papers in Database (1)

attack arXiv Sep 3, 2025 · Sep 2025

PersonaTeaming: Exploring How Introducing Personas Can Improve Automated AI Red-Teaming

Wesley Hanwen Deng, Sunnie S. Y. Kim, Akshita Jha et al. · Carnegie Mellon University · Apple

Persona-driven automated red-teaming method improves LLM adversarial prompt attack success rates by up to 144% over state-of-the-art

Prompt Injection nlp
PDF