Suvadeep Hajra

attack arXiv Mar 15, 2026 · 22d ago

Suvadeep Hajra, Palash Nandi, Tanmoy Chakraborty · Indian Institute of Technology Delhi

Efficient red-teaming method that uncovers LLM jailbreaks through diverse response sampling rather than adversarial prompt optimization

Prompt Injection nlp

Papers in Database (1)