Jianfeng Si

Papers in Database (1)

defense arXiv Aug 12, 2025 · Aug 2025

Efficient Switchable Safety Control in LLMs via Magic-Token-Guided Co-Training

Jianfeng Si, Lin Sun, Zhewen Tan et al. · Qiyuan Tech

Co-training framework embeds switchable safety modes in one LLM via magic tokens, achieving robust jailbreak resistance at lower cost

Prompt Injection nlp
PDF Code