Unsupervised Backdoor Detection and Mitigation for Spiking Neural Networks

Spiking Neural Networks (SNNs) have gained increasing attention for their superior energy efficiency compared to Artificial Neural Networks (ANNs). However, their security aspects, particularly under backdoor attacks, have received limited attention. Existing defense methods developed for ANNs perform poorly or can be easily bypassed in SNNs due to their event-driven and temporal dependencies. This paper identifies the key blockers that hinder traditional backdoor defenses in SNNs and proposes an unsupervised post-training detection framework, Temporal Membrane Potential Backdoor Detection (TMPBD), to overcome these challenges. TMPBD leverages the maximum margin statistics of temporal membrane potential (TMP) in the final spiking layer to detect target labels without any attack knowledge or data access. We further introduce a robust mitigation mechanism, Neural Dendrites Suppression Backdoor Mitigation (NDSBM), which clamps dendritic connections between early convolutional layers to suppress malicious neurons while preserving benign behaviors, guided by TMP extracted from a small, clean, unlabeled dataset. Extensive experiments on multiple neuromorphic benchmarks and state-of-the-art input-aware dynamic trigger attacks demonstrate that TMPBD achieves 100% detection accuracy, while NDSBM reduces the attack success rate from 100% to 8.44%, and to 2.81% when combined with detection, without degrading clean accuracy.

Key Contributions

TMPBD: unsupervised post-training backdoor detection using maximum margin statistics of temporal membrane potential in the final spiking layer — achieves 100% detection accuracy without attack knowledge or data access
NDSBM: mitigation mechanism that clamps dendritic weights between early convolutional layers to suppress malicious neurons, guided by TMP from a small clean unlabeled dataset
First comprehensive backdoor defense framework dedicated to SNNs, identifying and addressing fundamental blockers that prevent ANN defenses from transferring to the SNN setting

🛡️ Threat Analysis

Model Poisoning

Paper proposes two dedicated defense mechanisms (TMPBD and NDSBM) to detect and mitigate backdoor/trojan attacks in SNNs — TMPBD detects target labels of hidden backdoor triggers, while NDSBM suppresses malicious neurons to neutralize embedded backdoor behavior, reducing ASR from 100% to 2.81%.

Details

Domains

vision

Model Types

cnn

Threat Tags

training_timetargeted

Datasets

N-MNISTN-CALTECH101CIFAR10-DVS

Applications

2025 0 cit.

Model Poisoning

82%

Unsupervised Backdoor Detection and Mitigation for Spiking Neural Networks

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Backdoor Mitigation via Invertible Pruning Masks

Isolate Trigger: Detecting and Eliminating Adaptive Backdoor Attacks

NT-ML: Backdoor Defense via Non-target Label Training and Mutual Learning

Illuminating the Black Box: Real-Time Monitoring of Backdoor Unlearning in CNNs via Explainable AI

Robust Backdoor Removal by Reconstructing Trigger-Activated Changes in Latent Representation

Prototype-Guided Robust Learning against Backdoor Attacks

RPP: A Certified Poisoned-Sample Detection Framework for Backdoor Attacks under Dataset Imbalance

Improving the Sensitivity of Backdoor Detectors via Class Subspace Orthogonalization