Li Wang

Papers in Database (2)

attack arXiv Mar 24, 2026 · 13d ago

Not All Tokens Are Created Equal: Query-Efficient Jailbreak Fuzzing for LLMs

Wenyu Chen, Xiangtao Meng, Chuanchao Zang et al. · Shandong University

Token-aware jailbreak fuzzing that achieves 90% attack success with 70% fewer queries by prioritizing high-contribution tokens

Prompt Injection nlp
PDF
defense arXiv Aug 28, 2025 · Aug 2025

Beyond the Safety Tax: Mitigating Unsafe Text-to-Image Generation via External Safety Rectification

Xiangtao Meng, Yingkai Dong, Ning Yu et al. · Shandong University · Netflix

Proposes SafePatch, an external safety module for T2I diffusion models that suppresses unsafe generation without degrading benign image quality

Prompt Injection visiongenerative
PDF