Fred Morstatter

benchmark arXiv Nov 1, 2025 · Nov 2025

Berk Atil, Rebecca J. Passonneau, Fred Morstatter · Penn State University · Information Sciences Institute

Benchmarks multilingual jailbreak attacks and defenses across ten languages and six LLMs, finding language-dependent safety gaps

Prompt Injection nlp

1 citations PDF

benchmark arXiv Oct 29, 2025 · Oct 2025

Abha Jha, Abel Salinas, Fred Morstatter · USC Information Sciences Institute

Benchmarks LLM safety mechanisms against bioweapons-domain prompts using knowledge graphs and RAG to expose harmful output vulnerabilities

Prompt Injection nlp

Papers in Database (2)