Wonhyuk Lee

h-index: 0 0 citations 2 papers (total)

Papers in Database (1)

defense arXiv Sep 27, 2025 · Sep 2025

Guard Vector: Beyond English LLM Guardrails with Task-Vector Composition and Streaming-Aware Prefix SFT

Wonhyuk Lee, Youngchol Kim, Yunjin Park et al. · KT Corporation

Transfers LLM safety guardrail behavior to non-English models via task-vector weight differencing, no target-language labels needed

Prompt Injection nlp
PDF Code