Christopher White

h-index: 2 40 citations 17 papers (total)

Papers in Database (1)

benchmark arXiv Jan 30, 2026 · 9w ago

Statistical Estimation of Adversarial Risk in Large Language Models under Best-of-N Sampling

Mingqian Feng, Xiaodong Liu, Weiwei Yang et al. · University of Rochester · Microsoft Research

Statistical scaling law using Beta distributions to predict LLM jailbreak success rates at large N from small-budget measurements

Prompt Injection nlp
PDF