attack arXiv Aug 12, 2025 · Aug 2025
Alexandrine Fortier, Sonal Joshi, Thomas Thebaud et al. · École de technologie supérieure · Johns Hopkins University
Multi-target backdoor attack on speaker recognition using clicking-sound triggers, poisoning up to 50 speakers at 95% success rate
Model Poisoning audio
In this work, we propose a multi-target backdoor attack against speaker identification using position-independent clicking sounds as triggers. Unlike previous single-target approaches, our method targets up to 50 speakers simultaneously, achieving success rates of up to 95.04%. To simulate more realistic attack conditions, we vary the signal-to-noise ratio between speech and trigger, demonstrating a trade-off between stealth and effectiveness. We further extend the attack to the speaker verification task by selecting the most similar training speaker - based on cosine similarity - as a proxy target. The attack is most effective when target and enrolled speaker pairs are highly similar, reaching success rates of up to 90% in such cases.
cnn École de technologie supérieure · Johns Hopkins University