Yuqing Kong

h-index: 3 39 citations 5 papers (total)

Papers in Database (1)

attack arXiv Jan 31, 2026 · 9w ago

Jailbreaking LLMs via Calibration

Yuxuan Lu, Yongkang Guo, Yuqing Kong · Peking University

Recasts Weak-to-Strong LLM jailbreaking as forecast aggregation, deriving optimal logit-space strategies that beat existing methods on frontier models

Prompt Injection nlp
PDF