Xi Zhang

h-index: 4 83 citations 20 papers (total)

Papers in Database (2)

survey arXiv Jan 7, 2026 · 12w ago

Jailbreaking LLMs & VLMs: Mechanisms, Evaluation, and Unified Defense

Zejian Chen, Chaozhuo Li, Chao Li et al. · Beijing University of Posts and Telecommunications · China Academy of Information and Communications Technology

Surveys LLM and VLM jailbreak attacks and defenses, proposing a unified three-layer defense framework across text and multimodal settings

Input Manipulation Attack Prompt Injection nlpmultimodal
1 citations PDF
defense arXiv Dec 3, 2025 · Dec 2025

From static to adaptive: immune memory-based jailbreak detection for large language models

Jun Leng, Yu Liu, Litian Zhang et al. · Beijing University of Posts and Telecommunications · Hunan Branch of National Computer Network Emergency Response +1 more

Adaptive jailbreak detection for LLMs using immune memory retrieval and dual-agent simulation to counter evolving attacks

Prompt Injection nlp
PDF