Mingyuan Wang

h-index: 5 223 citations 16 papers (total)

Papers in Database (1)

attack arXiv Oct 2, 2025 · Oct 2025

Bypassing Prompt Guards in Production with Controlled-Release Prompting

Jaiden Fairoze, Sanjam Garg, Keewoo Lee et al. · University of California · Exponential Science Foundation +1 more

Cipher-encoded prompts exploit resource asymmetry between prompt guards and main LLMs to jailbreak Gemini, DeepSeek, Grok, and Mistral in production

Prompt Injection Sensitive Information Disclosure nlp
1 citations PDF