Wonjoong Kim

Papers in Database (1)

defense arXiv Apr 21, 2026 · 4w ago

Reasoning Structure Matters for Safety Alignment of Reasoning Models

Yeonjun In, Wonjoong Kim, Sangwu Park et al. · KAIST

Safety alignment for reasoning LLMs via structured reasoning that assesses harmfulness before solving, reducing unsafe outputs

Prompt Injection nlp
PDF Code