Palash Nandi

attack arXiv Sep 19, 2025 · Sep 2025

Maithili Joshi, Palash Nandi, Tanmoy Chakraborty · Indian Institute of Technology Delhi

White-box jailbreak bypasses LLM safety alignment by adding cross-layer residual connections through middle-to-late layers, beating GCG by 51%

Prompt Injection nlp

attack arXiv Mar 15, 2026 · 24d ago

Suvadeep Hajra, Palash Nandi, Tanmoy Chakraborty · Indian Institute of Technology Delhi

Efficient red-teaming method that uncovers LLM jailbreaks through diverse response sampling rather than adversarial prompt optimization

Prompt Injection nlp

Papers in Database (2)