attack 2026

State Backdoor: Towards Stealthy Real-world Poisoning Attack on Vision-Language-Action Model in State Space

Ji Guo ¹, Wenbo Jiang ¹, Yansong Lin ¹, Yijing Liu ¹, Ruichen Zhang ², Guomin Lu ¹, Aiguo Chen ¹, Xinshuo Han ³, Hongwei Li ¹, Dusit Niyato ²

¹ University of Electronic Science and Technology of China

² Nanyang Technological University

³ Nanjing University of Aeronautics and Astronautics

1 citations · 45 references · arXiv

Published on arXiv

2601.04266

Model Poisoning

OWASP ML Top 10 — ML10

Data Poisoning Attack

OWASP ML Top 10 — ML02

Key Finding

State Backdoor achieves over 90% attack success rate across five VLA models and five real-world robotic tasks without degrading clean task performance.

State Backdoor / Preference-guided Genetic Algorithm (PGA)

Novel technique introduced

Vision-Language-Action (VLA) models are widely deployed in safety-critical embodied AI applications such as robotics. However, their complex multimodal interactions also expose new security vulnerabilities. In this paper, we investigate a backdoor threat in VLA models, where malicious inputs cause targeted misbehavior while preserving performance on clean data. Existing backdoor methods predominantly rely on inserting visible triggers into visual modality, which suffer from poor robustness and low insusceptibility in real-world settings due to environmental variability. To overcome these limitations, we introduce the State Backdoor, a novel and practical backdoor attack that leverages the robot arm's initial state as the trigger. To optimize trigger for insusceptibility and effectiveness, we design a Preference-guided Genetic Algorithm (PGA) that efficiently searches the state space for minimal yet potent triggers. Extensive experiments on five representative VLA models and five real-world tasks show that our method achieves over 90% attack success rate without affecting benign task performance, revealing an underexplored vulnerability in embodied AI systems.

Key Contributions

State Backdoor: uses robot arm's initial proprioceptive state as a stealthy, environment-stable backdoor trigger instead of fragile visual triggers
Preference-guided Genetic Algorithm (PGA) that searches the state space for minimal yet potent triggers optimized for stealthiness and effectiveness
Evaluation across five representative VLA models and five real-world robotic tasks, achieving >90% attack success rate with no benign performance degradation

🛡️ Threat Analysis

Data Poisoning Attack

The attack is implemented via data poisoning — selecting training data subsets, injecting triggered states, and relabeling corresponding actions to attacker-defined targets.

Model Poisoning

Core contribution is a backdoor attack embedding hidden trigger-activated targeted misbehavior in VLA models; the robot arm's initial state acts as the trigger, causing malicious actions only when triggered while preserving normal benign performance.

Details

Domains

visionmultimodal

Model Types

vlmmultimodaltransformer

Threat Tags

training_timetargetedphysical

Applications

roboticsembodied aivla model control systems

Read PDF arXiv DOI

State Backdoor: Towards Stealthy Real-world Poisoning Attack on Vision-Language-Action Model in State Space

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

SilentDrift: Exploiting Action Chunking for Stealthy Backdoor Attacks on Vision-Language-Action Models

ToxicTextCLIP: Text-Based Poisoning and Backdoor Attacks on CLIP Pre-training

AgentRAE: Remote Action Execution through Notification-based Visual Backdoors against Screenshots-based Mobile GUI Agents

MOBA: A Material-Oriented Backdoor Attack against LiDAR-based 3D Object Detection Systems

BadCLIP++: Stealthy and Persistent Backdoors in Multimodal Contrastive Learning

Pre-training CLIP against Data Poisoning with Optimal Transport-based Matching and Alignment

Goal-oriented Backdoor Attack against Vision-Language-Action Models via Physical Objects

Poisoning the Pixels: Revisiting Backdoor Attacks on Semantic Segmentation