Cheng Tan

h-index: 3 31 citations 7 papers (total)

Papers in Database (1)

attack arXiv Sep 28, 2025 · Sep 2025

Quant Fever, Reasoning Blackholes, Schrodinger's Compliance, and More: Probing GPT-OSS-20B

Shuyi Lin, Tian Lu, Zikai Wang et al. · Northeastern University · Shanghai Jiao Tong University

Discovers five jailbreak failure modes in GPT-OSS-20B, introducing chain-oriented prompting and reasoning mirage attacks with 80% success rates

Prompt Injection nlp
PDF