Byzantine-Robust Distributed SGD: A Unified Analysis and Tight Error Bounds

Byzantine-robust distributed optimization relies on robust aggregation rules to mitigate the influence of malicious Byzantine workers. Despite the proliferation of such rules, a unified convergence analysis framework that accommodates general data heterogeneity is lacking. In this work, we provide a thorough convergence theory of Byzantine-robust distributed stochastic gradient descent (SGD), analyzing variants both with and without local momentum. We establish the convergence rates for nonconvex smooth objectives and those satisfying the Polyak-Lojasiewicz condition under a general data heterogeneity assumption. Our analysis reveals that while stochasticity and data heterogeneity introduce unavoidable error floors, local momentum provably reduces the error component induced by stochasticity. Furthermore, we derive matching lower bounds to demonstrate that the upper bounds obtained in our analysis are tight and characterize the fundamental limits of Byzantine resilience under stochasticity and data heterogeneity. Empirical results support our theoretical findings.

Key Contributions

Unified convergence analysis of Byzantine-robust distributed SGD covering both variants with and without local momentum under general data heterogeneity
Establishes tight convergence rates for nonconvex and PL objectives, proving local momentum reduces stochastic error
Derives matching lower bounds demonstrating the upper bounds are tight and characterize fundamental limits of Byzantine resilience

🛡️ Threat Analysis

Data Poisoning Attack

Paper analyzes defenses (robust aggregation rules) against Byzantine workers who corrupt training by sending arbitrary malicious updates — this is data poisoning at training time in distributed/federated learning. The paper establishes convergence guarantees and fundamental limits of Byzantine-resilient aggregation under adversarial conditions.

Details

Domains

federated-learning

Model Types

federated

Threat Tags

training_time

Applications

2026 0 cit.

Data Poisoning Attack

100%

Byzantine-Robust Distributed SGD: A Unified Analysis and Tight Error Bounds

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

ARMOR: Adaptive Resilience Against Model Poisoning Attacks in Continual Federated Learning for Mobile Indoor Localization

Reconciling Communication Compression and Byzantine-Robustness in Distributed Learning

Beyond Trade-offs: A Unified Framework for Privacy, Robustness, and Communication Efficiency in Federated Learning

Enhancing Split Learning with Sharded and Blockchain-Enabled SplitFed Approaches

Enhancing Robustness of Federated Learning via Server Learning

Robust and Efficient Collaborative Learning

SecureAFL: Secure Asynchronous Federated Learning

FedIDM: Achieving Fast and Stable Convergence in Byzantine Federated Learning through Iterative Distribution Matching