Shuai Wang

defense arXiv Jan 17, 2026 · 11w ago

Taming Various Privilege Escalation in LLM-Based Agent Systems: A Mandatory Access Control Framework

Zimo Ji, Daoyuan Wu, Wenyuan Jiang et al. · Hong Kong University of Science and Technology · Lingnan University +3 more

Proposes SEAgent, a mandatory access control framework that blocks privilege escalation attacks in LLM agent tool use via information flow monitoring and ABAC policies

Prompt Injection Excessive Agency nlp

1 citations PDF

defense arXiv Nov 14, 2025 · Nov 2025

SEAL: Subspace-Anchored Watermarks for LLM Ownership

Yanbo Dai, Zongjie Li, Zhenlan Ji et al. · The Hong Kong University of Science and Technology

Embeds multi-bit ownership watermarks into LLM latent representations, surviving fine-tuning and resisting knowledgeable removal attacks

Model Theft Model Theft nlp

PDF

survey arXiv Nov 19, 2025 · Nov 2025

Taxonomy, Evaluation and Exploitation of IPI-Centric LLM Agent Defense Frameworks

Zimo Ji, Xunguang Wang, Zongjie Li et al. · The Hong Kong University of Science and Technology · Zhejiang University of Technology +3 more

SoK paper taxonomizes IPI defenses for LLM agents, identifies six bypass root causes, and proposes three novel adaptive attacks

Prompt Injection nlp

PDF

Papers in Database (3)

Taming Various Privilege Escalation in LLM-Based Agent Systems: A Mandatory Access Control Framework

SEAL: Subspace-Anchored Watermarks for LLM Ownership

Taxonomy, Evaluation and Exploitation of IPI-Centric LLM Agent Defense Frameworks