defense 2025

BrowseSafe: Understanding and Preventing Prompt Injection Within AI Browser Agents

Kaiyuan Zhang ¹, Mark Tenenholtz ², Kyle Polley ², Jerry Ma ², Denis Yarats ², Ninghui Li ¹

¹ Purdue University

² Perplexity AI

7 citations · 58 references · arXiv

Published on arXiv

2511.20597

Prompt Injection

OWASP LLM Top 10 — LLM01

Excessive Agency

OWASP LLM Top 10 — LLM08

Key Finding

Existing defenses are insufficient against realistic HTML-embedded prompt injections, and a multi-layered architectural plus model-based defense-in-depth strategy is required for practical web agent security

BrowseSafe

Novel technique introduced

The integration of artificial intelligence (AI) agents into web browsers introduces security challenges that go beyond traditional web application threat models. Prior work has identified prompt injection as a new attack vector for web agents, yet the resulting impact within real-world environments remains insufficiently understood. In this work, we examine the landscape of prompt injection attacks and synthesize a benchmark of attacks embedded in realistic HTML payloads. Our benchmark goes beyond prior work by emphasizing injections that can influence real-world actions rather than mere text outputs, and by presenting attack payloads with complexity and distractor frequency similar to what real-world agents encounter. We leverage this benchmark to conduct a comprehensive empirical evaluation of existing defenses, assessing their effectiveness across a suite of frontier AI models. We propose a multi-layered defense strategy comprising both architectural and model-based defenses to protect against evolving prompt injection attacks. Our work offers a blueprint for designing practical, secure web agents through a defense-in-depth approach.

Key Contributions

Benchmark of prompt injection attacks embedded in realistic HTML payloads, emphasizing injections that trigger real-world actions with distractor frequency matching production environments
Comprehensive empirical evaluation of existing defenses against prompt injection across multiple frontier AI models
Multi-layered BrowseSafe defense strategy combining architectural and model-based defenses for a defense-in-depth approach to secure web agents

🛡️ Threat Analysis

Details

Domains

nlp

Model Types

llm

Threat Tags

inference_timeblack_box

Datasets

custom HTML prompt injection benchmark

Applications

ai browser agentsweb agentsagentic llm systems

Read PDF arXiv DOI

BrowseSafe: Understanding and Preventing Prompt Injection Within AI Browser Agents

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

AgenTRIM: Tool Risk Mitigation for Agentic AI

AgentArmor: Enforcing Program Analysis on Agent Runtime Trace to Defend Against Prompt Injection

Agent Privilege Separation in OpenClaw: A Structural Defense Against Prompt Injection

AI Kill Switch for malicious web-based LLM agent

Deep Research with Open-Domain Evaluation and Multi-Stage Guardrails for Safety

Taming Various Privilege Escalation in LLM-Based Agent Systems: A Mandatory Access Control Framework

MAS-Shield: A Defense Framework for Secure and Efficient LLM MAS

Don't Let the Claw Grip Your Hand: A Security Analysis and Defense Framework for OpenClaw