survey 2026

Uncovering Security Threats and Architecting Defenses in Autonomous Agents: A Case Study of OpenClaw

Zonghao Ying ¹, Xiao Yang ¹, Siyang Wu ², Yumeng Song ¹, Yang Qu ¹, Hainan Li ³, Tianlin Li ¹, Jiakai Wang ², Aishan Liu ¹, Xianglong Liu ^1,2

¹ Beihang University

² Zhongguancun Laboratory

³ Hefei Comprehensive National Science Center

0 citations

Published on arXiv

2603.12644

AI Supply Chain Attacks

OWASP ML Top 10 — ML06

Prompt Injection

OWASP LLM Top 10 — LLM01

Insecure Plugin Design

OWASP LLM Top 10 — LLM07

Excessive Agency

OWASP LLM Top 10 — LLM08

Key Finding

Identifies that traditional content-filtering defenses are obsolete for autonomous agents with OS-level permissions, requiring architectural defense paradigm shift

FASA

Novel technique introduced

The rapid evolution of Large Language Models (LLMs) into autonomous, tool-calling agents has fundamentally altered the cybersecurity landscape. Frameworks like OpenClaw grant AI systems operating-system-level permissions and the autonomy to execute complex workflows. This level of access creates unprecedented security challenges. Consequently, traditional content-filtering defenses have become obsolete. This report presents a comprehensive security analysis of the OpenClaw ecosystem. We systematically investigate its current threat landscape, highlighting critical vulnerabilities such as prompt injection-driven Remote Code Execution (RCE), sequential tool attack chains, context amnesia, and supply chain contamination. To systematically contextualize these threats, we propose a novel tri-layered risk taxonomy for autonomous Agents, categorizing vulnerabilities across AI Cognitive, Software Execution, and Information System dimensions. To address these systemic architectural flaws, we introduce the Full-Lifecycle Agent Security Architecture (FASA). This theoretical defense blueprint advocates for zero-trust agentic execution, dynamic intent verification, and cross-layer reasoning-action correlation. Building on this framework, we present Project ClawGuard, our ongoing engineering initiative. This project aims to implement the FASA paradigm and transition autonomous agents from high-risk experimental utilities into trustworthy systems. Our code and dataset are available at https://github.com/NY1024/ClawGuard.

Key Contributions

Tri-layered risk taxonomy for autonomous agents categorizing vulnerabilities across AI Cognitive, Software Execution, and Information System dimensions
Full-Lifecycle Agent Security Architecture (FASA) proposing zero-trust agentic execution, dynamic intent verification, and cross-layer reasoning-action correlation
Systematic threat landscape analysis of OpenClaw revealing prompt injection-driven RCE, sequential tool attack chains, context amnesia, and supply chain contamination

🛡️ Threat Analysis

AI Supply Chain Attacks

Paper explicitly addresses supply chain contamination in the OpenClaw ecosystem, including trojaned/poisoned tools and malicious third-party integrations distributed through the agent framework supply chain.

Details

Domains

nlpmultimodal

Model Types

llm

Threat Tags

inference_timeblack_box

Applications

autonomous ai agentstool-calling llm systemsagentic workflows

Read PDF arXiv Code

Uncovering Security Threats and Architecting Defenses in Autonomous Agents: A Case Study of OpenClaw

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Systematization of Knowledge: Security and Safety in the Model Context Protocol Ecosystem

SoK: The Attack Surface of Agentic AI -- Tools, and Autonomy

Agentic AI Security: Threats, Defenses, Evaluation, and Open Challenges

Systems Security Foundations for Agentic Computing

Agentic AI as a Cybersecurity Attack Surface: Threats, Exploits, and Defenses in Runtime Supply Chains

MCPGuard : Automatically Detecting Vulnerabilities in MCP Servers

The Attack and Defense Landscape of Agentic AI: A Comprehensive Survey

ClawWorm: Self-Propagating Attacks Across LLM Agent Ecosystems