Hyundong Jin

h-index: 1 5 citations 7 papers (total)

Papers in Database (1)

defense arXiv Jan 7, 2026 · Jan 2026

How Does the Thinking Step Influence Model Safety? An Entropy-based Safety Reminder for LRMs

Su-Hyeon Kim, Hyundong Jin, Yejin Lee et al. · Yonsei University

Defends LLMs against jailbreaks by injecting entropy-triggered safe-reminding phrases into reasoning model thinking steps at inference time

Prompt Injection nlp
PDF