Adel Khorramrouz

benchmark arXiv Oct 31, 2025 · Oct 2025

Adel Khorramrouz, Sharon Levy · Rutgers University

Reveals demographic-selective bias in LLM safety guardrails and exploits it via indirect jailbreak attacks on refused groups

Prompt Injection nlp

1 citations PDF

Papers in Database (1)