benchmark arXiv Mar 20, 2026 · 17d ago
Krzysztof Kotowski, Ramez Shendy, Jakub Nalepa et al. · KP Labs · Silesian University of Technology +4 more
Kaggle competition benchmark for detecting backdoor triggers in time series forecasting models for spacecraft telemetry
Model Poisoning timeseries
Forecasting plays a crucial role in modern safety-critical applications, such as space operations. However, the increasing use of deep forecasting models introduces a new security risk of trojan horse attacks, carried out by hiding a backdoor in the training data or directly in the model weights. Once implanted, the backdoor is activated by a specific trigger pattern at test time, causing the model to produce manipulated predictions. We focus on this issue in our \textit{Trojan Horse Hunt} data science competition, where more than 200 teams faced the task of identifying triggers hidden in deep forecasting models for spacecraft telemetry. We describe the novel task formulation, benchmark set, evaluation protocol, and best solutions from the competition. We further summarize key insights and research directions for effective identification of triggers in time series forecasting models. All materials are publicly available on the official competition webpage https://www.kaggle.com/competitions/trojan-horse-hunt-in-space.
traditional_ml KP Labs · Silesian University of Technology · Warsaw University of Technology +3 more