defense 2025

ceLLMate: Sandboxing Browser AI Agents

Luoxi Meng 1, Henry Feng 1,2, Ilia Shumailov 2,1, Earlence Fernandes 1

4 citations · 69 references · arXiv

α

Published on arXiv

2512.12594

Prompt Injection

OWASP LLM Top 10 — LLM01

Excessive Agency

OWASP LLM Top 10 — LLM08

Key Finding

ceLLMate's sandboxing framework effectively blocks various prompt injection attack types on browser-using agents with negligible runtime overhead.

ceLLMate

Novel technique introduced


Browser-using agents (BUAs) are an emerging class of autonomous agents that interact with web browsers in human-like ways, including clicking, scrolling, filling forms, and navigating across pages. While these agents help automate repetitive online tasks, they are vulnerable to prompt injection attacks that can trick an agent into performing undesired actions, such as leaking private information or issuing state-changing requests. We propose ceLLMate, a browser-level sandboxing framework that restricts the agent's ambient authority and reduces the blast radius of prompt injections. We address two fundamental challenges: (1) The semantic gap challenge in policy enforcement arises because the agent operates through low-level UI observations and manipulations; however, writing and enforcing policies directly over UI-level events is brittle and error-prone. To address this challenge, we introduce an agent sitemap that maps low-level browser behaviors to high-level semantic actions. (2) Policy prediction in BUAs is the norm rather than the exception. BUAs have no app developer to pre-declare sandboxing policies, and thus, ceLLMate pairs website-authored mandatory policies with an automated policy-prediction layer that adapts and instantiates these policies from the user's natural-language task. We implement ceLLMate as an agent-agnostic browser extension and demonstrate how it enables sandboxing policies that effectively block various types of prompt injection attacks with negligible overhead.


Key Contributions

  • Agent sitemap abstraction that maps low-level UI browser events to high-level semantic actions, bridging the policy-enforcement semantic gap
  • Automated policy-prediction layer that instantiates sandboxing policies from user natural-language tasks in the absence of pre-declared app developer policies
  • Agent-agnostic browser extension implementation (ceLLMate) that blocks diverse prompt injection attacks with negligible overhead

🛡️ Threat Analysis


Details

Domains
nlp
Model Types
llm
Threat Tags
inference_timeblack_box
Applications
browser-using ai agentsweb automation