Kyomin Jung

h-index: 3 22 citations 4 papers (total)

Papers in Database (1)

defense arXiv Feb 8, 2026 · 8w ago

CausalArmor: Efficient Indirect Prompt Injection Guardrails via Causal Attribution

Minbeom Kim, Mihir Parmar, Phillip Wallis et al. · Google Cloud AI Research · Seoul National University +2 more

Defends LLM tool-calling agents against indirect prompt injection via causal attribution-based dominance shift detection at privileged action points

Prompt Injection Excessive Agency nlp
PDF