Bo Wen

h-index: 1 7 citations 2 papers (total)

Papers in Database (1)

attack arXiv Sep 28, 2025 · Sep 2025

Quant Fever, Reasoning Blackholes, Schrodinger's Compliance, and More: Probing GPT-OSS-20B

Shuyi Lin, Tian Lu, Zikai Wang et al. · Northeastern University · Shanghai Jiao Tong University

Discovers five jailbreak failure modes in GPT-OSS-20B, introducing chain-oriented prompting and reasoning mirage attacks with 80% success rates

Prompt Injection nlp
PDF