Rachel Longjohn

h-index: 2 19 citations 3 papers (total)

Papers in Database (1)

benchmark arXiv Nov 4, 2025 · Nov 2025

Bayesian Evaluation of Large Language Model Behavior

Rachel Longjohn, Shang Wu, Saatvik Kher et al. · University of California

Bayesian framework for statistically rigorous evaluation of LLM safety behaviors like jailbreak refusal rates and information leakage

Prompt Injection nlp
1 citations PDF