Wanjin Park

h-index: 1 2 citations 4 papers (total)

Papers in Database (1)

defense arXiv Sep 27, 2025 · Sep 2025

Guard Vector: Beyond English LLM Guardrails with Task-Vector Composition and Streaming-Aware Prefix SFT

Wonhyuk Lee, Youngchol Kim, Yunjin Park et al. · KT Corporation

Transfers LLM safety guardrail behavior to non-English models via task-vector weight differencing, no target-language labels needed

Prompt Injection nlp
PDF Code