Fatih Uenal

Papers in Database (1)

benchmark arXiv Apr 7, 2026 · 6w ago

Swiss-Bench 003: Evaluating LLM Reliability and Adversarial Security for Swiss Regulatory Contexts

Fatih Uenal · University of Colorado Boulder

Benchmark evaluating LLM security across prompt injection, PII extraction, and system prompt leakage for Swiss regulatory compliance

Prompt Injection Sensitive Information Disclosure nlp
PDF