Shoumik Saha

h-index: 7 258 citations 23 papers (total)

Papers in Database (1)

benchmark arXiv Oct 1, 2025 · Oct 2025

Breaking the Code: Security Assessment of AI Code Agents Through Systematic Jailbreaking Attacks

Shoumik Saha, Jifan Chen, Sam Mayers et al. · University of Maryland - College Park · AWS AI Labs +1 more

Benchmarks jailbreak attacks on code-capable LLM agents, showing agent wrapping raises attack success 1.6x with 32% instantly deployable malicious code

Prompt Injection Excessive Agency nlp
2 citations 1 influentialPDF