attack 2025

Analyzing Code Injection Attacks on LLM-based Multi-Agent Systems in Software Development

Brian Bowers ¹, Smita Khapre ¹, Jugal Kalita ²

¹ Loyola Marymount University

² University of Colorado Colorado Springs

0 citations · 27 references · arXiv

Published on arXiv

2512.21818

Prompt Injection

OWASP LLM Top 10 — LLM01

Excessive Agency

OWASP LLM Top 10 — LLM08

Key Finding

Embedding poisonous few-shot examples in injected code increases attack success rate against the LLM security analysis agent from 0% to 71.95%

Poisonous Few-Shot Code Injection

Novel technique introduced

Agentic AI and Multi-Agent Systems are poised to dominate industry and society imminently. Powered by goal-driven autonomy, they represent a powerful form of generative AI, marking a transition from reactive content generation into proactive multitasking capabilities. As an exemplar, we propose an architecture of a multi-agent system for the implementation phase of the software engineering process. We also present a comprehensive threat model for the proposed system. We demonstrate that while such systems can generate code quite accurately, they are vulnerable to attacks, including code injection. Due to their autonomous design and lack of humans in the loop, these systems cannot identify and respond to attacks by themselves. This paper analyzes the vulnerability of multi-agent systems and concludes that the coder-reviewer-tester architecture is more resilient than both the coder and coder-tester architectures, but is less efficient at writing code. We find that by adding a security analysis agent, we mitigate the loss in efficiency while achieving even better resiliency. We conclude by demonstrating that the security analysis agent is vulnerable to advanced code injection attacks, showing that embedding poisonous few-shot examples in the injected code can increase the attack success rate from 0% to 71.95%.

Key Contributions

Proposes and evaluates coder, coder-tester, and coder-reviewer-tester MAS architectures for SDLC implementation phase against code injection attacks
Introduces a security analysis agent that improves resilience while recovering efficiency lost in the coder-reviewer-tester architecture
Demonstrates that embedding poisonous few-shot examples in injected code bypasses the security analysis agent, raising attack success rate from 0% to 71.95%

🛡️ Threat Analysis

Details

Domains

nlpgenerative

Model Types

llm

Threat Tags

black_boxinference_timetargeteddigital

Applications

llm-based multi-agent software development systemsautomated code generation pipelines

Read PDF arXiv DOI

Analyzing Code Injection Attacks on LLM-based Multi-Agent Systems in Software Development

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Genesis: Evolving Attack Strategies for LLM Web Agent Red-Teaming

Attack the Messages, Not the Agents: A Multi-round Adaptive Stealthy Tampering Framework for LLM-MAS

Tipping the Dominos: Topology-Aware Multi-Hop Attacks on LLM-Based Multi-Agent Systems

Zombie Agents: Persistent Control of Self-Evolving LLM Agents via Self-Reinforcing Injections

MURMUR: Using cross-user chatter to break collaborative language agents in groups

Deep Research Brings Deeper Harm

Shadows in the Code: Exploring the Risks and Defenses of LLM-based Multi-Agent Software Development Systems

From Storage to Steering: Memory Control Flow Attacks on LLM Agents