Sachin Kumar

h-index: 3 261 citations 4 papers (total)

Papers in Database (1)

defense arXiv Oct 30, 2025 · Oct 2025

Reasoning Up the Instruction Ladder for Controllable Language Models

Zishuo Zheng, Vidhisha Balachandran, Chan Young Park et al. · The Ohio State University · Microsoft Research +1 more

Trains LLMs via RL on instruction-hierarchy data to resist jailbreaks and prompt injection, cutting attack success rates by 20%

Prompt Injection nlp
1 citations PDF Code