Xuan Luo

Papers in Database (1)

attack arXiv Sep 17, 2025 · Sep 2025

A Simple and Efficient Jailbreak Method Exploiting LLMs' Helpfulness

Xuan Luo, Yue Wang, Zefeng He et al. · Harbin Institute of Technology · Hong Kong Polytechnic University +2 more

Jailbreaks LLMs by reframing harmful queries as educational learning questions, bypassing safety alignment on 22 models

Prompt Injection nlp
PDF Code