Tushar Sandhan

h-index: 5 96 citations 57 papers (total)

Papers in Database (1)

attack arXiv Jan 23, 2026 · 10w ago

Persona Jailbreaking in Large Language Models

Jivnesh Sandhan, Fei Cheng, Tushar Sandhan et al. · Kyoto University · Indian Institute of Technology Kanpur

Black-box attack gradually hijacks LLM personas via adversarial conversational history, bypassing guardrails across 8 LLMs

Prompt Injection nlp
PDF Code