attack 2026

WebWeaver: Breaking Topology Confidentiality in LLM Multi-Agent Systems with Stealthy Context-Based Inference

Zixun Xiong ¹, Gaoyi Wu ¹, Lingfeng Yao ², Miao Pan ², Xiaojiang Du ¹, Hao Wang ¹

¹ Stevens Institute of Technology

² University of Houston

0 citations

Published on arXiv

2603.11132

Excessive Agency

OWASP LLM Top 10 — LLM08

Prompt Injection

OWASP LLM Top 10 — LLM01

Key Finding

WebWeaver achieves approximately 60% higher topology inference accuracy than state-of-the-art baselines under active defenses with negligible overhead.

WebWeaver

Novel technique introduced

Communication topology is a critical factor in the utility and safety of LLM-based multi-agent systems (LLM-MAS), making it a high-value intellectual property (IP) whose confidentiality remains insufficiently studied. % Existing topology inference attempts rely on impractical assumptions, including control over the administrative agent and direct identity queries via jailbreaks, which are easily defeated by basic keyword-based defenses. As a result, prior analyses fail to capture the real-world threat of such attacks. % To bridge this realism gap, we propose \textit{WebWeaver}, an attack framework that infers the complete LLM-MAS topology by compromising only a single arbitrary agent instead of the administrative agent. % Unlike prior approaches, WebWeaver relies solely on agent contexts rather than agent IDs, enabling significantly stealthier inference. % WebWeaver further introduces a new covert jailbreak-based mechanism and a novel fully jailbreak-free diffusion design to handle cases where jailbreaks fail. % Additionally, we address a key challenge in diffusion-based inference by proposing a masking strategy that preserves known topology during diffusion, with theoretical guarantees of correctness. % Extensive experiments show that WebWeaver substantially outperforms state-of-the-art (SOTA) baselines, achieving about 60\% higher inference accuracy under active defenses with negligible overhead.

Key Contributions

WebWeaver attack framework that infers the complete LLM-MAS communication topology by compromising only a single arbitrary non-administrative agent
Context-based (not agent-identity-based) inference that is stealthier and defeats keyword-based defenses used against prior methods
Jailbreak-free diffusion-based topology inference with a masking strategy that preserves known topology and provides theoretical correctness guarantees

🛡️ Threat Analysis

Details

Domains

nlp

Model Types

llm

Threat Tags

grey_boxinference_timetargeted

Applications

llm multi-agent systems

Read PDF arXiv

WebWeaver: Breaking Topology Confidentiality in LLM Multi-Agent Systems with Stealthy Context-Based Inference

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

Optimizing AI Agent Attacks With Synthetic Data

MURMUR: Using cross-user chatter to break collaborative language agents in groups

Zombie Agents: Persistent Control of Self-Evolving LLM Agents via Self-Reinforcing Injections

When Agents See Humans as the Outgroup: Belief-Dependent Bias in LLM-Powered Agents

David vs. Goliath: Verifiable Agent-to-Agent Jailbreaking via Reinforcement Learning

Tipping the Dominos: Topology-Aware Multi-Hop Attacks on LLM-Based Multi-Agent Systems

From Storage to Steering: Memory Control Flow Attacks on LLM Agents