Hadi Askari

h-index: 5 79 citations 10 papers (total)

Papers in Database (1)

attack arXiv Oct 4, 2025 · Oct 2025

Less Diverse, Less Safe: The Indirect But Pervasive Risk of Test-Time Scaling in Large Language Models

Shahriar Kabir Nahin, Hadi Askari, Muhao Chen et al. · University of South Florida · University of California

RefDiv exploits candidate diversity reduction in test-time scaling to bypass LLM safety guardrails, surpassing direct adversarial prompts

Prompt Injection nlp
1 citations PDF