Adel Khorramrouz

h-index: 1 1 citations 1 papers (total)

Papers in Database (1)

benchmark arXiv Oct 31, 2025 · Oct 2025

Characterizing Selective Refusal Bias in Large Language Models

Adel Khorramrouz, Sharon Levy · Rutgers University

Reveals demographic-selective bias in LLM safety guardrails and exploits it via indirect jailbreak attacks on refused groups

Prompt Injection nlp
1 citations PDF