defense 2026

From Tool Orchestration to Code Execution: A Study of MCP Design Choices

Yuval Felendler , Parth A. Gandhi , Idan Habler , Yuval Elovici , Asaf Shabtai

Ben Gurion University of The Negev

0 citations · 36 references · arXiv (Cornell University)

Published on arXiv

2602.15945

Insecure Plugin Design

OWASP LLM Top 10 — LLM07

Excessive Agency

OWASP LLM Top 10 — LLM08

Key Finding

CE-MCP reduces token usage and execution latency but introduces 16 distinct attack classes including code injection; layered sandboxing and semantic gating defenses address these vulnerabilities across multiple LLMs.

Semantic gating + containerized sandboxing for CE-MCP

Novel technique introduced

Model Context Protocols (MCPs) provide a unified platform for agent systems to discover, select, and orchestrate tools across heterogeneous execution environments. As MCP-based systems scale to incorporate larger tool catalogs and multiple concurrently connected MCP servers, traditional tool-by-tool invocation increases coordination overhead, fragments state management, and limits support for wide-context operations. To address these scalability challenges, recent MCP designs have incorporated code execution as a first-class capability, an approach called Code Execution MCP (CE-MCP). This enables agents to consolidate complex workflows, such as SQL querying, file analysis, and multi-step data transformations, into a single program that executes within an isolated runtime environment. In this work, we formalize the architectural distinction between context-coupled (traditional) and context-decoupled (CE-MCP) models, analyzing their fundamental scalability trade-offs. Using the MCP-Bench framework across 10 representative servers, we empirically evaluate task behavior, tool utilization patterns, execution latency, and protocol efficiency as the scale of connected MCP servers and available tools increases, demonstrating that while CE-MCP significantly reduces token usage and execution latency, it introduces a vastly expanded attack surface. We address this security gap by applying the MAESTRO framework, identifying sixteen attack classes across five execution phases-including specific code execution threats such as exception-mediated code injection and unsafe capability synthesis. We validate these vulnerabilities through adversarial scenarios across multiple LLMs and propose a layered defense architecture comprising containerized sandboxing and semantic gating. Our findings provide a rigorous roadmap for balancing scalability and security in production-ready executable agent workflows.

Key Contributions

Formalizes the architectural distinction between context-coupled (traditional) and context-decoupled (CE-MCP) MCP models and empirically evaluates their scalability trade-offs using MCP-Bench across 10 servers
Applies the MAESTRO framework to identify 16 attack classes across 5 CE-MCP execution phases, including novel threats such as exception-mediated code injection and unsafe capability synthesis, validated adversarially across multiple LLMs
Proposes a layered defense architecture comprising containerized sandboxing and pre/post-execution semantic gating to mitigate the expanded attack surface of CE-MCP

🛡️ Threat Analysis

Details

Domains

nlp

Model Types

llm

Threat Tags

inference_timeblack_box

Datasets

MCP-Bench

Applications

llm agent systemstool orchestrationagentic workflows

Read PDF arXiv DOI Code

From Tool Orchestration to Code Execution: A Study of MCP Design Choices

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Securing AI Agent Execution

TraceAegis: Securing LLM-Based Agents via Hierarchical and Behavioral Anomaly Detection

MiniScope: A Least Privilege Framework for Authorizing Tool Calling Agents

Agentic JWT: A Secure Delegation Protocol for Autonomous AI Agents

Autonomous Action Runtime Management(AARM):A System Specification for Securing AI-Driven Actions at Runtime

Tracking Capabilities for Safer Agents

Execution Is the New Attack Surface: Survivability-Aware Agentic Crypto Trading with OpenClaw-Style Local Executors

LPS-Bench: Benchmarking Safety Awareness of Computer-Use Agents in Long-Horizon Planning under Benign and Adversarial Scenarios