Maithili Joshi

Papers in Database (1)

attack arXiv Sep 19, 2025 · Sep 2025

SABER: Uncovering Vulnerabilities in Safety Alignment via Cross-Layer Residual Connection

Maithili Joshi, Palash Nandi, Tanmoy Chakraborty · Indian Institute of Technology Delhi

White-box jailbreak bypasses LLM safety alignment by adding cross-layer residual connections through middle-to-late layers, beating GCG by 51%

Prompt Injection nlp
PDF Code