Xiaoling Wang

h-index: 3 89 citations 15 papers (total)

Papers in Database (1)

defense arXiv Nov 18, 2025 · Nov 2025

Unified Defense for Large Language Models against Jailbreak and Fine-Tuning Attacks in Education

Xin Yi, Yue Li, Dongsheng Shi et al. · East China Normal University

Three-stage defense framework for educational LLMs that resists both jailbreak and fine-tuning safety-removal attacks

Transfer Learning Attack Prompt Injection nlp
1 citations PDF