Yixuan Tang

Papers in Database (1)

attack arXiv Jan 15, 2026 · 11w ago

Reasoning Hijacking: Subverting LLM Classification via Decision-Criteria Injection

Yuansen Liu, Yixuan Tang, Anthony Kum Hoe Tun · National University of Singapore

Injects spurious decision criteria into LLM classifiers via indirect prompt injection, bypassing intent-based defenses like SecAlign and StruQ

Prompt Injection nlp
PDF Code