Shoumik Saha

benchmark arXiv Oct 1, 2025 · Oct 2025

Breaking the Code: Security Assessment of AI Code Agents Through Systematic Jailbreaking Attacks

Shoumik Saha, Jifan Chen, Sam Mayers et al. · University of Maryland - College Park · AWS AI Labs +1 more

Benchmarks jailbreak attacks on code-capable LLM agents, showing agent wrapping raises attack success 1.6x with 32% instantly deployable malicious code

Prompt Injection Excessive Agency nlp

2 citations 1 influentialPDF

Papers in Database (1)

Breaking the Code: Security Assessment of AI Code Agents Through Systematic Jailbreaking Attacks