ML Security Papers

Latest papers

3 papers

attack arXiv Dec 14, 2025 · Dec 2025

Sanjay Das, Swastik Bhattacharya, Shamik Kundu et al. · University of Texas at Dallas · Intel Corporation +1 more

Bit-flip attack framework shows flipping one bit in Mamba-1.4b collapses accuracy from 74.64% to 0% on LAMBADA

Model Poisoning nlp

benchmark arXiv Oct 8, 2025 · Oct 2025

Shen Dong, Mingxuan Zhang, Pengfei He et al. · Michigan State University · Purdue University +1 more

Benchmark for evaluating adversarial robustness of LLM planner-executor multi-agent systems across harmful action, privacy, and DoS attacks

Prompt Injection Excessive Agency nlp

attack arXiv Sep 3, 2025 · Sep 2025

Yiyang Huang, Zixuan Wang, Zishen Wan et al. · Chinese Academy of Sciences · Georgia Institute of Technology +1 more

Adversarial visual perturbations on VLA robot models cause ISO-defined safety violations with 50%+ success, validated on physical robots

Input Manipulation Attack Prompt Injection visionmultimodalreinforcement-learningnlp