Shuaishuai Yang

h-index: 0 0 citations 1 papers (total)

Papers in Database (1)

benchmark arXiv Feb 4, 2026 · 8w ago

How Few-shot Demonstrations Affect Prompt-based Defenses Against LLM Jailbreak Attacks

Yanshu Wang, Shuaishuai Yang, Jingjing He et al. · Peking University

Reveals few-shot demonstrations boost role-oriented jailbreak defenses but degrade task-oriented defenses by up to 21% in LLMs

Prompt Injection nlp
PDF