survey 2026

From Stateless Queries to Autonomous Actions: A Layered Security Framework for Agentic AI Systems

Kexin Chu

University of Connecticut

0 citations

Published on arXiv

2604.23338

Prompt Injection

OWASP LLM Top 10 — LLM01

Insecure Plugin Design

OWASP LLM Top 10 — LLM07

Excessive Agency

OWASP LLM Top 10 — LLM08

Key Finding

Identifies that most dangerous emerging threats (covert agent collusion, long-term memory poisoning, MCP supply-chain compromise) concentrate at intersection of high-layer attacks (L5-L7) and slow-burn temporality (T3-T4), yet only 8 of 120 paper-cell assignments (7%) cover this zone

LASM

Novel technique introduced

Agentic AI systems face security challenges that stateless large language models do not. They plan across extended horizons, maintain persistent memory, invoke external tools, and coordinate with peer agents. Existing security analyses organize threats by attack type (prompt injection, jailbreaking), but provide no principled model of which architectural component is vulnerable or over what timescale the threat manifests. This paper makes five contributions. First, we introduce the Layered Attack Surface Model (LASM), a seven-layer framework that maps threats to distinct architectural components: Foundation, Cognitive, Memory, Tool Execution, Multi-Agent Coordination, Ecosystem, and Governance, the accountability and observability layer that spans the stack analogously to the network management plane. Second, we introduce attack temporality as an orthogonal analytical dimension with four classes: Instantaneous (T1), Session-Persistent (T2), Cross-Session Cumulative (T3), and Sub-Session-Stack, Non-Session-Bounded (T4). Third, through a systematic review of 94 papers (2021--2025), we show that the most dangerous emerging threats concentrate at the intersection of high-layer attacks (L5--L7) and slow-burn temporality (T3--T4): covert agent collusion, long-term memory poisoning, MCP supply-chain compromise, and alignment failure that manifests as an insider threat with no external adversary. Only 8 of 120 paper-cell assignments (7%) fall in this zone. Fourth, we propose a cross-layer defense taxonomy spanning all seven LASM layers and all four temporality classes, exposing which threat classes existing defenses leave unaddressed. Fifth, we survey evaluation benchmarks, identify five research gaps in the under-studied high-layer, slow-burn zone, and argue that agentic security must be treated as a distributed systems problem embedded in an adversarial ecosystem.

Key Contributions

Introduces Layered Attack Surface Model (LASM): seven-layer framework mapping agentic AI threats to architectural components (Foundation, Cognitive, Memory, Tool Execution, Multi-Agent Coordination, Ecosystem, Governance)
Proposes attack temporality dimension with four classes (T1-T4) revealing that slow-burn, cross-session attacks (T3-T4) at high layers (L5-L7) are most dangerous yet least studied
Systematic review of 94 papers (2021-2025) exposing coverage gap: only 7% of research addresses high-layer, slow-burn threats like agent collusion, memory poisoning, and MCP supply-chain attacks

🛡️ Threat Analysis

Details

Domains

nlpmultimodal

Model Types

llmvlmmultimodal

Threat Tags

inference_timetraining_timetargetedblack_box

Applications

autonomous agentsmulti-agent systemstool-using llmsagentic ai orchestration

Read PDF arXiv

From Stateless Queries to Autonomous Actions: A Layered Security Framework for Agentic AI Systems

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Security Risks of Agentic Vehicles: A Systematic Analysis of Cognitive and Cross-Layer Threats

The Attack and Defense Landscape of Agentic AI: A Comprehensive Survey

Agentic AI Security: Threats, Defenses, Evaluation, and Open Challenges

SoK: The Attack Surface of Agentic AI -- Tools, and Autonomy

Your Agent is More Brittle Than You Think: Uncovering Indirect Injection Vulnerabilities in Agentic LLMs

Agentic AI as a Cybersecurity Attack Surface: Threats, Exploits, and Defenses in Runtime Supply Chains

Systems Security Foundations for Agentic Computing

Uncovering Security Threats and Architecting Defenses in Autonomous Agents: A Case Study of OpenClaw