Grigorios Chrysos

Papers in Database (1)

benchmark arXiv Mar 21, 2026 · 16d ago

LJ-Bench: Ontology-Based Benchmark for U.S. Crime

Hung Yun Tseng, Wuzhen Li, Blerina Gkotse et al. · University of Wisconsin–Madison

Systematic benchmark evaluating LLM jailbreak robustness across 76 crime categories grounded in U.S. legal frameworks

Prompt Injection nlp
PDF Code