Feifei Zhao

Papers in Database (1)

defense arXiv Aug 8, 2025 · Aug 2025

Multi-Level Safety Continual Projection for Fine-Tuned Large Language Models without Retraining

Bing Han, Feifei Zhao, Dongcheng Zhao et al. · University of Chinese Academy of Sciences · Chinese Academy of Sciences +2 more

Training-free post-fine-tuning defense restoring LLM safety alignment via sparse neuron projection without retraining

Transfer Learning Attack Prompt Injection nlp
PDF