Dongsheng Shi

h-index: 1 3 citations 5 papers (total)

Papers in Database (2)

defense arXiv Nov 18, 2025 · Nov 2025

Unified Defense for Large Language Models against Jailbreak and Fine-Tuning Attacks in Education

Xin Yi, Yue Li, Dongsheng Shi et al. · East China Normal University

Three-stage defense framework for educational LLMs that resists both jailbreak and fine-tuning safety-removal attacks

Transfer Learning Attack Prompt Injection nlp
1 citations PDF
defense arXiv Feb 10, 2026 · 8w ago

AGMark: Attention-Guided Dynamic Watermarking for Large Vision-Language Models

Yue Li, Xin Yi, Dongsheng Shi et al. · East China Normal University · Hasso Plattner Institute +1 more

Attention-guided dynamic watermarking for LVLM outputs that preserves visual fidelity while achieving 99.36% AUC detection accuracy

Output Integrity Attack nlpmultimodalvision
PDF