ML Security Papers

ML Security Papers

Latest papers

1 papers

benchmark arXiv Sep 9, 2025 · Sep 2025

SafeToolBench: Pioneering a Prospective Benchmark to Evaluating Tool Utilization Safety in LLMs

Hongfei Xia, Hongru Wang, Zeming Liu et al. · Beijing Institute of Technology · The Chinese University of Hong Kong +2 more

Proposes benchmark and safety framework for prospective LLM tool-call risk assessment before irreversible harmful actions execute

Insecure Plugin Design Excessive Agency nlp