ML Security Papers

Latest papers

3 papers

attack arXiv Apr 6, 2026 · 6w ago

Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw

Zijun Wang, Haoqin Tu, Letian Zhang et al. · UC Santa Cruz · National University of Singapore +4 more

Real-world evaluation showing poisoning of agent persistent state (skills, config, memory) increases attack success from 25% to 64-74% across four LLM backbones

Prompt Injection Excessive Agency nlp

PDF Code

attack CoDAIM workshop Mar 21, 2026 · 8w ago

ACRFence: Preventing Semantic Rollback Attacks in Agent Checkpoint-Restore

Yusheng Zheng, Yiwei Yang, Wei Zhang et al. · UC Santa Cruz · University of Connecticut

LLM agent checkpoint-restore creates replay vulnerabilities enabling duplicate payments and credential reuse through non-deterministic request regeneration

Insecure Plugin Design Excessive Agency nlp

PDF

tool PACMI'2025 Aug 2, 2025 · Aug 2025

AgentSight: System-Level Observability for AI Agents Using eBPF

Yusheng Zheng, Yanpeng Hu, Tong Yu et al. · UC Santa Cruz · ShanghaiTech University +1 more

eBPF-based observability tool that intercepts LLM agent traffic and syscalls to detect prompt injection and resource abuse

Prompt Injection Excessive Agency Blue-Team Agents nlp

PDF Code

Latest papers

Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw

ACRFence: Preventing Semantic Rollback Attacks in Agent Checkpoint-Restore

AgentSight: System-Level Observability for AI Agents Using eBPF

Filters

Time Period

Paper Type

OWASP ML Top 10

OWASP LLM Top 10

Institution

Venue