Latest papers

2 papers
benchmark arXiv Oct 11, 2025 · Oct 2025

SecureWebArena: A Holistic Security Evaluation Benchmark for LVLM-based Web Agents

Zonghao Ying, Yangguang Shao, Jianle Gan et al. · Beihang University · Chinese Academy of Sciences +7 more

Benchmark evaluating LVLM web agent security across six attack vectors in realistic web environments, exposing universal vulnerabilities across 9 models

Prompt Injection Excessive Agency multimodalnlp
5 citations PDF
defense arXiv Aug 2, 2025 · Aug 2025

PromptSafe: Gated Prompt Tuning for Safe Text-to-Image Generation

Zonglei Jing, Xiao Yang, Xiaoqian Li et al. · Beihang University · Beijing University of Posts and Telecommunications +3 more

Gated soft prompt tuning defense for T2I diffusion models that adaptively suppresses NSFW generation based on estimated prompt toxicity

Prompt Injection generative
PDF