Aashray Reddy

h-index: 3 106 citations 6 papers (total)

Papers in Database (1)

attack arXiv Nov 4, 2025 · Nov 2025

AutoAdv: Automated Adversarial Prompting for Multi-Turn Jailbreaking of Large Language Models

Aashray Reddy, Andrew Zagula, Nicholas Saban · Del Norte High School · University of California +1 more

AutoAdv automates multi-turn LLM jailbreaks via adaptive prompting, hitting 95% attack success rate on Llama-3.1-8B in six turns

Prompt Injection nlp
5 citations PDF