attack 2026

ACRFence: Preventing Semantic Rollback Attacks in Agent Checkpoint-Restore

Yusheng Zheng ¹, Yiwei Yang ¹, Wei Zhang ², Andi Quinn ¹

¹ UC Santa Cruz

² University of Connecticut

0 citations · CoDAIM workshop

Published on arXiv

2603.20625

Insecure Plugin Design

OWASP LLM Top 10 — LLM07

Excessive Agency

OWASP LLM Top 10 — LLM08

Key Finding

Surveyed 12 major agent frameworks and found none enforce exactly-once tool-call semantics; confirmed duplicate payments and credential reuse in proof-of-concept experiments

Semantic Rollback Attacks (Action Replay, Authority Resurrection)

Novel technique introduced

LLM agent frameworks increasingly offer checkpoint-restore for error recovery and exploration, advising developers to make external tool calls safe to retry. This advice assumes that a retried call will be identical to the original, an assumption that holds for traditional programs but fails for LLM agents, which re-synthesize subtly different requests after restore. Servers treat these re-generated requests as new, enabling duplicate payments, unauthorized reuse of consumed credentials, and other irreversible side effects; we term these semantic rollback attacks. We identify two attack classes, Action Replay and Authority Resurrection, validate them in a proof of concept experiment, and confirm that the problem has been independently acknowledged by framework maintainers. We propose ACRFence, a framework-agnostic mitigation that records irreversible tool effects and enforces replay-or-fork semantics upon restoration

Key Contributions

Identifies semantic rollback attacks (Action Replay and Authority Resurrection) in LLM agent checkpoint-restore mechanisms
Validates attacks across 12 major agent frameworks (LangGraph, CrewAI, Google ADK, AutoGen, Claude Code, etc.)
Proposes ACRFence, a framework-agnostic mitigation enforcing replay-or-fork semantics for irreversible tool effects

🛡️ Threat Analysis

Details

Domains

nlp

Model Types

llm

Threat Tags

inference_timeblack_box

Applications

llm agent frameworksautonomous agentstool-calling systems

Read PDF arXiv

ACRFence: Preventing Semantic Rollback Attacks in Agent Checkpoint-Restore

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Takedown: How It's Done in Modern Coding Agent Exploits

Web Fraud Attacks Against LLM-Driven Multi-Agent Systems

Servant, Stalker, Predator: How An Honest, Helpful, And Harmless (3H) Agent Unlocks Adversarial Skills

From Tool Orchestration to Code Execution: A Study of MCP Design Choices

LPS-Bench: Benchmarking Safety Awareness of Computer-Use Agents in Long-Horizon Planning under Benign and Adversarial Scenarios

MCP-SafetyBench: A Benchmark for Safety Evaluation of Large Language Models with Real-World MCP Servers

Securing AI Agent Execution

Mind the Gap: Time-of-Check to Time-of-Use Vulnerabilities in LLM-Enabled Agents