Xiaoyan Zhang

Papers in Database (1)

defense arXiv Aug 24, 2025 · Aug 2025

Risk Assessment and Security Analysis of Large Language Models

Xiaoyan Zhang, Dongyang Lyu, Xiaoqi Li · Hainan University

Hierarchical LLM defense framework combining BERT-CRF input filtering, adversarial training, and neural output watermarking to detect jailbreaks

Output Integrity Attack Prompt Injection nlp
PDF