Dan Vann

h-index: 6 134 citations 10 papers (total)

Papers in Database (2)

attack EMNLP Sep 23, 2025 · Sep 2025

Anecdoctoring: Automated Red-Teaming Across Language and Place

Alejandro Cuevas, Saloni Dash, Bharat Kumar Nayak et al. · Carnegie Mellon University · Microsoft Research +2 more

Automated multilingual red-teaming attack elicits LLM disinformation using knowledge graph-augmented adversarial prompt generation

Prompt Injection nlp
2 citations 1 influentialPDF
benchmark arXiv Jan 26, 2026 · 10w ago

Comparison requires valid measurement: Rethinking attack success rate comparisons in AI red teaming

Alexandra Chouldechova, A. Feder Cooper, Solon Barocas et al. · Microsoft Research · Microsoft

Critiques LLM jailbreak ASR comparisons as methodologically invalid using social science measurement theory and inferential statistics

Prompt Injection nlp
1 citations PDF