ML Security Papers

Latest papers

2 papers

defense arXiv Feb 15, 2026 · 7w ago

MCPShield: A Security Cognition Layer for Adaptive Trust Calibration in Model Context Protocol Agents

Zhenhong Zhou, Yuanhe Zhang, Hongwei Cai et al. · NTU · BUPT +3 more

Proposes MCPShield, a lifecycle-aware security layer defending LLM agents against malicious third-party MCP tool servers

Insecure Plugin Design nlp

PDF

survey arXiv Feb 11, 2026 · 7w ago

The Landscape of Prompt Injection Threats in LLM Agents: From Taxonomy to Analysis

Peiran Wang, Xinfeng Li, Chong Xiang et al. · UCLA · NTU +1 more

Systematizes prompt injection attacks and defenses for LLM agents, introducing AgentPI benchmark that exposes context-dependent gaps in existing evaluations

Prompt Injection Excessive Agency nlp

PDF

Latest papers

MCPShield: A Security Cognition Layer for Adaptive Trust Calibration in Model Context Protocol Agents

The Landscape of Prompt Injection Threats in LLM Agents: From Taxonomy to Analysis

Filters

Time Period

Paper Type

OWASP ML Top 10

OWASP LLM Top 10

Institution

Venue