Mary Llewellyn

h-index: 1 6 citations 4 papers (total)

Papers in Database (1)

benchmark arXiv Oct 7, 2025 · Oct 2025

Towards Reliable and Practical LLM Security Evaluations via Bayesian Modelling

Mary Llewellyn, Annie Gray, Josh Collyer et al. · The Alan Turing Institute · Loughborough University

Proposes Bayesian hierarchical evaluation framework with embedding clustering to reliably quantify LLM prompt injection vulnerability

Prompt Injection nlp
PDF