Adversarial Patch Attacks on Vision-Based Cargo Occupancy Estimation via Differentiable 3D Simulation

Computer vision systems are increasingly adopted in modern logistics operations, including the estimation of trailer occupancy for planning, routing, and billing. Although effective, such systems may be vulnerable to physical adversarial attacks, particularly adversarial patches that can be printed and placed on interior surfaces. In this work, we study the feasibility of such attacks on a convolutional cargo-occupancy classifier using fully simulated 3D environments. Using Mitsuba 3 for differentiable rendering, we optimize patch textures across variations in geometry, lighting, and viewpoint, and compare their effectiveness to a 2D compositing baseline. Our experiments demonstrate that 3D-optimized patches achieve high attack success rates, especially in a denial-of-service scenario (empty to full), where success reaches 84.94 percent. Concealment attacks (full to empty) prove more challenging but still reach 30.32 percent. We analyze the factors influencing attack success, discuss implications for the security of automated logistics pipelines, and highlight directions for strengthening physical robustness. To our knowledge, this is the first study to investigate adversarial patch attacks for cargo-occupancy estimation in physically realistic, fully simulated 3D scenes.

Key Contributions

Differentiable 3D simulation pipeline using Mitsuba 3 for adversarial patch optimization in cargo trailer environments, with surface-aware patch placement sampling.
Comparison of 2D compositing vs. 3D scene-space patch optimization across 5,910 rendered scenes, demonstrating 3D optimization superiority.
Systematic evaluation of two attack scenarios — denial-of-service (empty→full, 84.94%) and concealment (full→empty, 30.32%) — against a logistics occupancy classifier.

🛡️ Threat Analysis

Input Manipulation Attack

Proposes physical adversarial patch attacks (printable, placeable in real trailer interiors) that cause targeted misclassification of a CNN occupancy classifier at inference time — core adversarial patch / physical adversarial example attack falling directly under ML01.

Details

Domains

vision

Model Types

cnn

Threat Tags

white_boxinference_timetargetedphysical

Datasets

Custom simulated cargo trailer scenes (5,910 rendered scenes)

Applications

2026 0 cit.

Input Manipulation Attack

85%

Adversarial Patch Attacks on Vision-Based Cargo Occupancy Estimation via Differentiable 3D Simulation

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Physical Adversarial Camouflage through Gradient Calibration and Regularization

IPG: Incremental Patch Generation for Generalized Adversarial Patch Training

Real-World Adversarial Attacks on RF-Based Drone Detectors

FlyTrap: Physical Distance-Pulling Attack Towards Camera-based Autonomous Target Tracking Systems

Distillation-Enhanced Physical Adversarial Attacks

Vision Transformers: the threat of realistic adversarial patches

A Single Set of Adversarial Clothes Breaks Multiple Defense Methods in the Physical World

AI Evasion and Impersonation Attacks on Facial Re-Identification with Activation Map Explanations