Zhen Xiang

h-index: 8 714 citations 15 papers (total)

Papers in Database (2)

attack arXiv Oct 11, 2025 · Oct 2025

MetaBreak: Jailbreaking Online LLM Services via Special Token Manipulation

Wentian Zhu, Zhen Xiang, Wei Niu et al. · University of Georgia

Exploits LLM special tokens to construct jailbreak primitives that bypass both safety alignment and content moderation simultaneously

Prompt Injection nlp
PDF
defense arXiv Jan 13, 2026 · 11w ago

Q-realign: Piggybacking Realignment on Quantization for Safe and Efficient LLM Deployment

Qitao Tan, Xiaoying Song, Ningxi Cheng et al. · University of Georgia · University of North Texas +2 more

Recovers LLM safety alignment eroded by fine-tuning via post-training quantization, without retraining, in 40 minutes on one GPU

Transfer Learning Attack Prompt Injection nlp
PDF Code