Ming Shan Hee

benchmark arXiv Sep 18, 2025 · Sep 2025

Toxicity Red-Teaming: Benchmarking LLM Safety in Singapore's Low-Resource Languages

Yujia Hu, Ming Shan Hee, Preslav Nakov et al. · Singapore University of Technology and Design · Mohamed bin Zayed University of Artificial Intelligence

Benchmarks multilingual LLM safety guardrails via red-teaming across Singlish, Chinese, Malay, and Tamil toxic prompts

Prompt Injection nlp

PDF Code

Papers in Database (1)

Toxicity Red-Teaming: Benchmarking LLM Safety in Singapore's Low-Resource Languages