Fred Morstatter

h-index: 7 191 citations 16 papers (total)

Papers in Database (2)

benchmark arXiv Nov 1, 2025 · Nov 2025

Do Methods to Jailbreak and Defend LLMs Generalize Across Languages?

Berk Atil, Rebecca J. Passonneau, Fred Morstatter · Penn State University · Information Sciences Institute

Benchmarks multilingual jailbreak attacks and defenses across ten languages and six LLMs, finding language-dependent safety gaps

Prompt Injection nlp
1 citations PDF
benchmark arXiv Oct 29, 2025 · Oct 2025

Knowledge Graph Analysis of Legal Understanding and Violations in LLMs

Abha Jha, Abel Salinas, Fred Morstatter · USC Information Sciences Institute

Benchmarks LLM safety mechanisms against bioweapons-domain prompts using knowledge graphs and RAG to expose harmful output vulnerabilities

Prompt Injection nlp
PDF