Anthony Kum Hoe Tun

h-index: 0 0 citations 1 papers (total)

Papers in Database (1)

attack arXiv Jan 15, 2026 · 11w ago

Reasoning Hijacking: Subverting LLM Classification via Decision-Criteria Injection

Yuansen Liu, Yixuan Tang, Anthony Kum Hoe Tun · National University of Singapore

Injects spurious decision criteria into LLM classifiers via indirect prompt injection, bypassing intent-based defenses like SecAlign and StruQ

Prompt Injection nlp
PDF Code