Xin Zhang

h-index: 2 10 citations 5 papers (total)

Papers in Database (1)

defense arXiv Jan 19, 2026 · 11w ago

LSSF: Safety Alignment for Large Language Models through Low-Rank Safety Subspace Fusion

Guanghao Zhou, Panjia Qiu, Cen Chen et al. · East China Normal University · Ant Group

Post-hoc LLM safety re-alignment via low-rank safety subspace fusion to restore guardrails degraded by fine-tuning

Transfer Learning Attack Prompt Injection nlp
3 citations 1 influentialPDF