Latest papers

3 papers
attack arXiv Dec 14, 2025 · Dec 2025

COBRA: Catastrophic Bit-flip Reliability Analysis of State-Space Models

Sanjay Das, Swastik Bhattacharya, Shamik Kundu et al. · University of Texas at Dallas · Intel Corporation +1 more

Bit-flip attack framework shows flipping one bit in Mamba-1.4b collapses accuracy from 74.64% to 0% on LAMBADA

Model Poisoning nlp
PDF Code
benchmark arXiv Oct 8, 2025 · Oct 2025

PEAR: Planner-Executor Agent Robustness Benchmark

Shen Dong, Mingxuan Zhang, Pengfei He et al. · Michigan State University · Purdue University +1 more

Benchmark for evaluating adversarial robustness of LLM planner-executor multi-agent systems across harmful action, privacy, and DoS attacks

Prompt Injection Excessive Agency nlp
PDF Code
attack arXiv Sep 3, 2025 · Sep 2025

ANNIE: Be Careful of Your Robots

Yiyang Huang, Zixuan Wang, Zishen Wan et al. · Chinese Academy of Sciences · Georgia Institute of Technology +1 more

Adversarial visual perturbations on VLA robot models cause ISO-defined safety violations with 50%+ success, validated on physical robots

Input Manipulation Attack Prompt Injection visionmultimodalreinforcement-learningnlp
PDF Code