Mingyuan Wang

attack arXiv Oct 2, 2025 · Oct 2025

Jaiden Fairoze, Sanjam Garg, Keewoo Lee et al. · University of California · Exponential Science Foundation +1 more

Cipher-encoded prompts exploit resource asymmetry between prompt guards and main LLMs to jailbreak Gemini, DeepSeek, Grok, and Mistral in production

Prompt Injection Sensitive Information Disclosure nlp

1 citations PDF

Papers in Database (1)