attack arXiv Jan 28, 2026 · 9w ago
Kealan Dunnett, Reza Arablouei, Dimity Miller et al. · Queensland University of Technology · Commonwealth Scientific and Industrial Research Organisation
Backdoor attack framework for object detection unifying misclassification and object disappearance attacks with improved physical-world robustness
Model Poisoning vision
Backdoor attacks pose a severe threat to deep learning, yet their impact on object detection remains poorly understood compared to image classification. While attacks have been proposed, we identify critical weaknesses in existing detection-based methods, specifically their reliance on unrealistic assumptions and a lack of physical validation. To bridge this gap, we introduce BadDet+, a penalty-based framework that unifies Region Misclassification Attacks (RMA) and Object Disappearance Attacks (ODA). The core mechanism utilizes a log-barrier penalty to suppress true-class predictions for triggered inputs, resulting in (i) position and scale invariance, and (ii) enhanced physical robustness. On real-world benchmarks, BadDet+ achieves superior synthetic-to-physical transfer compared to existing RMA and ODA baselines while preserving clean performance. Theoretical analysis confirms the proposed penalty acts within a trigger-specific feature subspace, reliably inducing attacks without degrading standard inference. These results highlight significant vulnerabilities in object detection and the necessity for specialized defenses.
cnn transformer Queensland University of Technology · Commonwealth Scientific and Industrial Research Organisation
defense arXiv Feb 20, 2026 · 6w ago
Ehsan Lari, Reza Arablouei, Stefan Werner · Norwegian University of Science and Technology · Commonwealth Scientific and Industrial Research Organisation +1 more
Defends federated learning against Byzantine poisoning attacks end-to-end via partial update sharing and distance-based calibration filtering
Data Poisoning Attack federated-learning
We propose PRISM-FCP (Partial shaRing and robust calIbration with Statistical Margins for Federated Conformal Prediction), a Byzantine-resilient federated conformal prediction framework that utilizes partial model sharing to improve robustness against Byzantine attacks during both model training and conformal calibration. Existing approaches address adversarial behavior only in the calibration stage, leaving the learned model susceptible to poisoned updates. In contrast, PRISM-FCP mitigates attacks end-to-end. During training, clients partially share updates by transmitting only $M$ of $D$ parameters per round. This attenuates the expected energy of an adversary's perturbation in the aggregated update by a factor of $M/D$, yielding lower mean-square error (MSE) and tighter prediction intervals. During calibration, clients convert nonconformity scores into characterization vectors, compute distance-based maliciousness scores, and downweight or filter suspected Byzantine contributions before estimating the conformal quantile. Extensive experiments on both synthetic data and the UCI Superconductivity dataset demonstrate that PRISM-FCP maintains nominal coverage guarantees under Byzantine attacks while avoiding the interval inflation observed in standard FCP with reduced communication, providing a robust and communication-efficient approach to federated uncertainty quantification.
federated traditional_ml Norwegian University of Science and Technology · Commonwealth Scientific and Industrial Research Organisation · Aalto University