defense 2026

Prompt Control-Flow Integrity: A Priority-Aware Runtime Defense Against Prompt Injection in LLM Systems

Md Takrim Ul Alam ¹, Akif Islam ¹, Mohd Ruhul Ameen ², Abu Saleh Musa Miah ³, Jungpil Shin ³

¹ University of Rajshahi

² Marshall University

³ University of Aizu

0 citations

Published on arXiv

2603.18433

Prompt Injection

OWASP LLM Top 10 — LLM01

Key Finding

Achieves 100% attack interception rate with 0% false positive rate and only 0.04ms median processing overhead on benchmark suite

PCFI

Novel technique introduced

Large language models (LLMs) deployed behind APIs and retrieval-augmented generation (RAG) stacks are vulnerable to prompt injection attacks that may override system policies, subvert intended behavior, and induce unsafe outputs. Existing defenses often treat prompts as flat strings and rely on ad hoc filtering or static jailbreak detection. This paper proposes Prompt Control-Flow Integrity (PCFI), a priority-aware runtime defense that models each request as a structured composition of system, developer, user, and retrieved-document segments. PCFI applies a three-stage middleware pipeline, lexical heuristics, role-switch detection, and hierarchical policy enforcement, before forwarding requests to the backend LLM. We implement PCFI as a FastAPI-based gateway for deployed LLM APIs and evaluate it on a custom benchmark of synthetic and semi-realistic prompt-injection workloads. On the evaluated benchmark suite, PCFI intercepts all attack-labeled requests, maintains a 0% False Positive Rate, and introduces a median processing overhead of only 0.04 ms. These results suggest that provenance- and priority-aware prompt enforcement is a practical and lightweight defense for deployed LLM systems.

Key Contributions

PCFI framework that models prompts as structured hierarchical segments (system, developer, user, retrieved-document) with explicit priority enforcement
Three-stage middleware pipeline (lexical heuristics, role-switch detection, hierarchical policy enforcement) for runtime prompt validation
FastAPI-based gateway implementation achieving 100% attack interception with 0% FPR and 0.04ms median overhead

🛡️ Threat Analysis

Details

Domains

nlp

Model Types

llm

Threat Tags

inference_time

Datasets

custom benchmark of synthetic and semi-realistic prompt-injection workloads

Applications

llm apisrag systemsdeployed llm applications

Read PDF arXiv

Prompt Control-Flow Integrity: A Priority-Aware Runtime Defense Against Prompt Injection in LLM Systems

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Invasive Context Engineering to Control Large Language Models

FraudShield: Knowledge Graph Empowered Defense for LLMs against Fraud Attacks

Guard Vector: Beyond English LLM Guardrails with Task-Vector Composition and Streaming-Aware Prefix SFT

LLM Reinforcement in Context

A Multi-Agent LLM Defense Pipeline Against Prompt Injection Attacks

Soft Instruction De-escalation Defense

$C$-$ΔΘ$: Circuit-Restricted Weight Arithmetic for Selective Refusal

Safe Transformer: An Explicit Safety Bit For Interpretable And Controllable Alignment