Chandan K. Reddy

h-index: 3 25 citations 6 papers (total)

Papers in Database (1)

defense arXiv Oct 17, 2025 · Oct 2025

Distractor Injection Attacks on Large Reasoning Models: Characterization and Defense

Zhehao Zhang, Weijie Xu, Shixian Cui et al. · Amazon

Identifies reasoning distraction attacks on LRMs where injected prompt distractors slash accuracy 60%, proposes SFT+DPO defense

Prompt Injection nlp
PDF