Lulu Zhao

h-index: 3 47 citations 9 papers (total)

Papers in Database (1)

attack arXiv Oct 9, 2025 · Oct 2025

AutoRed: A Free-form Adversarial Prompt Generation Framework for Automated Red Teaming

Muxi Diao, Yutao Mou, Keqing He et al. · Beijing University of Posts and Telecommunications · Peking University +1 more

Seed-free LLM red teaming framework using persona-guided generation and reflection loops to produce diverse, high-ASR jailbreak prompts

Prompt Injection nlp
PDF