Xianglong Liu

Papers in Database (4)

defense arXiv Aug 2, 2025 · Aug 2025

PromptSafe: Gated Prompt Tuning for Safe Text-to-Image Generation

Zonglei Jing, Xiao Yang, Xiaoqian Li et al. · Beihang University · Beijing University of Posts and Telecommunications +3 more

Gated soft prompt tuning defense for T2I diffusion models that adaptively suppresses NSFW generation based on estimated prompt toxicity

Prompt Injection generative
PDF
defense arXiv Apr 27, 2026 · 24d ago

AgentVisor: Defending LLM Agents Against Prompt Injection via Semantic Virtualization

Zonghao Ying, Haozheng Wang, Jiangfan Liu et al. · Beihang University · 360 AI Security Lab +1 more

OS-inspired defense framework that intercepts LLM agent tool calls and enforces privilege separation to block prompt injection attacks

Prompt Injection Excessive Agency nlp
PDF
survey arXiv Mar 13, 2026 · 9w ago

Uncovering Security Threats and Architecting Defenses in Autonomous Agents: A Case Study of OpenClaw

Zonghao Ying, Xiao Yang, Siyang Wu et al. · Beihang University · Zhongguancun Laboratory +1 more

Security analysis of OpenClaw autonomous agents revealing prompt injection RCE, tool chain attacks, and proposing FASA defense architecture

AI Supply Chain Attacks Prompt Injection Insecure Plugin Design Excessive Agency nlpmultimodal
PDF Code
attack arXiv Sep 18, 2025 · Sep 2025

Vulnerable Agent Identification in Large-Scale Multi-Agent Reinforcement Learning

Simin Li, Zheng Yuwei, Zihao Mao et al. · Beihang University · Peking University +3 more

Identifies maximally vulnerable agent subsets in large-scale MARL and learns worst-case adversarial policies via hierarchical mean-field control decomposition

Input Manipulation Attack reinforcement-learning
PDF