DETOUR: A Practical Backdoor Attack against Object Detection

Object detection (OD) is critical to real-world vision systems, yet existing backdoor attacks on detection transformers (DETRs) for OD tasks rely on patch-wise triggers optimized at fixed locations with minimal perturbations. Such attacks overlook that backdoor triggers in the real world may appear at different sizes, fields of view (FoVs), and locations in images, while minimal perturbations are difficult for cameras to capture, limiting attack practicality. We first observe that a patch-wise trigger in DETR delivers high attack effectiveness when activating the backdoor across neighboring locations, a phenomenon we term the trigger radiating effect (TRE). Meanwhile, inserting patch-wise triggers across multiple locations synergistically enhances TRE, resulting in high attack effectiveness across images. We propose DETOUR, a practical backdoor attack by using semantic triggers that are effective in real-world object detection systems. To ensure attack practicality, we rescale trigger patterns to different sizes and insert them at various predefined locations during backdoor training, enabling the model to recognize the trigger regardless of its spatial configurations. To address FoV variations in physical deployments, we extract the trigger pattern from a real-world object (e.g., a mug) captured under multiple FoVs and inject the trigger accordingly, promoting viewpoint-invariant backdoor activation and enhancing TRE across the entire image. As a result, the backdoor can be reliably activated under diverse FoVs and spatial configurations.

Key Contributions

Identifies trigger radiating effect (TRE) phenomenon where patch-wise triggers in DETRs activate backdoors across neighboring locations
Proposes semantic triggers extracted from real-world objects with viewpoint-invariant backdoor activation across multiple FoVs
Demonstrates practical backdoor attack resilient to spatial configuration variations (size, location, field of view)

🛡️ Threat Analysis

Model Poisoning

Proposes a backdoor injection method for object detection models that embeds hidden malicious behavior activated by semantic triggers. The attack trains models to recognize triggers across different sizes, locations, and fields of view, demonstrating targeted backdoor behavior in DETR models.

Details

Domains

vision

Model Types

transformer

Threat Tags

training_timetargetedphysical

Applications

2025 0 cit.

Model Poisoning

75%

DETOUR: A Practical Backdoor Attack against Object Detection

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

CIS-BA: Continuous Interaction Space Based Backdoor Attack for Object Detection in the Real-World

BadDet+: Robust Backdoor Attacks for Object Detection

Hammering the Diagnosis: Rowhammer-Induced Stealthy Trojan Attacks on ViT-Based Medical Imaging

DarkHash: A Data-Free Backdoor Attack Against Deep Hashing

DF-LoGiT: Data-Free Logic-Gated Backdoor Attacks in Vision Transformers

Backdoor Attacks on Prompt-Driven Video Segmentation Foundation Models

DSBA: Dynamic Stealthy Backdoor Attack with Collaborative Optimization in Self-Supervised Learning

Backdoor Poisoning Attack Against Face Spoofing Attack Detection Methods