attack 2026

Invisible Threats from Model Context Protocol: Generating Stealthy Injection Payload via Tree-based Adaptive Search

Yulin Shen ¹, Xudong Pan ^1,2, Geng Hong ¹, Min Yang ¹

¹ Fudan University

² Shanghai Innovation Institute

0 citations

Published on arXiv

2603.24203

Prompt Injection

OWASP LLM Top 10 — LLM01

Insecure Plugin Design

OWASP LLM Top 10 — LLM07

Key Finding

Achieves >95% attack success rate on four mainstream LLMs in undefended settings with 10x fewer queries than prior attacks; maintains >50% effectiveness against four defense approaches

TIP (Tree structured Injection for Payloads)

Novel technique introduced

Recent advances in the Model Context Protocol (MCP) have enabled large language models (LLMs) to invoke external tools with unprecedented ease. This creates a new class of powerful and tool augmented agents. Unfortunately, this capability also introduces an under explored attack surface, specifically the malicious manipulation of tool responses. Existing techniques for indirect prompt injection that target MCP suffer from high deployment costs, weak semantic coherence, or heavy white box requirements. Furthermore, they are often easily detected by recently proposed defenses. In this paper, we propose Tree structured Injection for Payloads (TIP), a novel black-box attack which generates natural payloads to reliably seize control of MCP enabled agents even under defense. Technically, We cast payload generation as a tree structured search problem and guide the search with an attacker LLM operating under our proposed coarse-to-fine optimization framework. To stabilize learning and avoid local optima, we introduce a path-aware feedback mechanism that surfaces only high quality historical trajectories to the attacker model. The framework is further hardened against defensive transformations by explicitly conditioning the search on observable defense signals and dynamically reallocating the exploration budget. Extensive experiments on four mainstream LLMs show that TIP attains over 95% attack success in undefended settings while requiring an order of magnitude fewer queries than prior adaptive attacks. Against four representative defense approaches, TIP preserves more than 50% effectiveness and significantly outperforms the state-of-the-art attacks. By implementing the attack on real world MCP systems, our results expose an invisible but practical threat vector in MCP deployments. We also discuss potential mitigation approaches to address this critical security gap.

Key Contributions

Tree-structured search framework (TIP) for generating semantically coherent injection payloads in MCP tool responses
Coarse-to-fine optimization with path-aware feedback mechanism to avoid local optima and stabilize black-box search
Defense-aware adaptive search that dynamically reallocates exploration budget based on observable defense signals

🛡️ Threat Analysis

Details

Domains

nlp

Model Types

llm

Threat Tags

black_boxinference_timetargeted

Applications

llm agentsmcp-enabled systemstool-augmented ai

Read PDF arXiv

Invisible Threats from Model Context Protocol: Generating Stealthy Injection Payload via Tree-based Adaptive Search

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

AdapTools: Adaptive Tool-based Indirect Prompt Injection Attacks on Agentic LLMs

STAC: When Innocent Tools Form Dangerous Chains to Jailbreak LLM Agents

Automatic Red Teaming LLM-based Agents with Model Context Protocol Tools

MCP-ITP: An Automated Framework for Implicit Tool Poisoning in MCP

Jailbreaking Large Language Models through Iterative Tool-Disguised Attacks via Reinforcement Learning

Cuckoo Attack: Stealthy and Persistent Attacks Against AI-IDE

SkillJect: Automating Stealthy Skill-Based Prompt Injection for Coding Agents with Trace-Driven Closed-Loop Refinement

QueryIPI: Query-agnostic Indirect Prompt Injection on Coding Agents