ML Security Papers

ML Security Papers

Latest papers

1 papers

benchmark arXiv Jan 7, 2026 · 12w ago

RedBench: A Universal Dataset for Comprehensive Red Teaming of Large Language Models

Quy-Anh Dang, Chris Ngo, Truong-Son Hy · VNU University of Science · Knovel +1 more

Aggregates 37 red-teaming datasets into a unified LLM benchmark with standardized taxonomy across 22 risk categories

Prompt Injection nlp