Aditya Joshi

h-index: 3 92 citations 17 papers (total)

Papers in Database (1)

attack arXiv Jan 19, 2026 · 11w ago

In Vino Veritas and Vulnerabilities: Examining LLM Safety via Drunk Language Inducement

Anudeex Shetty, Aditya Joshi, Salil S. Kanhere · UNSW Sydney · The University of Melbourne

Novel drunk-persona jailbreak attack on LLMs bypasses safety tuning and induces privacy leaks across five models

Prompt Injection Sensitive Information Disclosure nlp
PDF