Muhao Chen

h-index: 2 12 citations 12 papers (total)

Papers in Database (2)

attack arXiv Oct 4, 2025 · Oct 2025

Less Diverse, Less Safe: The Indirect But Pervasive Risk of Test-Time Scaling in Large Language Models

Shahriar Kabir Nahin, Hadi Askari, Muhao Chen et al. · University of South Florida · University of California

RefDiv exploits candidate diversity reduction in test-time scaling to bypass LLM safety guardrails, surpassing direct adversarial prompts

Prompt Injection nlp
1 citations PDF
defense arXiv Dec 2, 2025 · Dec 2025

OmniGuard: Unified Omni-Modal Guardrails with Deliberate Reasoning

Boyu Zhu, Xiaofei Wen, Wenjie Jacky Mo et al. · Fudan University · University of California +1 more

Omni-modal guardrail system with deliberate reasoning to block unsafe LLM outputs across text, image, video, and audio

Prompt Injection nlpvisionaudiomultimodal
PDF