SpectralKrum: A Spectral-Geometric Defense Against Byzantine Attacks in Federated Learning

Federated Learning (FL) distributes model training across clients who retain their data locally, but this architecture exposes a fundamental vulnerability: Byzantine clients can inject arbitrarily corrupted updates that degrade or subvert the global model. While robust aggregation methods (including Krum, Bulyan, and coordinate-wise defenses) offer theoretical guarantees under idealized assumptions, their effectiveness erodes substantially when client data distributions are heterogeneous (non-IID) and adversaries can observe or approximate the defense mechanism. This paper introduces SpectralKrum, a defense that fuses spectral subspace estimation with geometric neighbor-based selection. The core insight is that benign optimization trajectories, despite per-client heterogeneity, concentrate near a low-dimensional manifold that can be estimated from historical aggregates. SpectralKrum projects incoming updates into this learned subspace, applies Krum selection in compressed coordinates, and filters candidates whose orthogonal residual energy exceeds a data-driven threshold. The method requires no auxiliary data, operates entirely on model updates, and preserves FL privacy properties. We evaluate SpectralKrum against eight robust baselines across seven attack scenarios on CIFAR-10 with Dirichlet-distributed non-IID partitions (alpha = 0.1). Experiments spanning over 56,000 training rounds show that SpectralKrum is competitive against directional and subspace-aware attacks (adaptive-steer, buffer-drift), but offers limited advantage under label-flip and min-max attacks where malicious updates remain spectrally indistinguishable from benign ones.

Key Contributions

SpectralKrum algorithm combining rolling PCA subspace estimation with Krum geometric neighbor selection to filter Byzantine updates in non-IID FL settings
Orthogonal energy filtering step that flags updates deviating from the learned benign optimization manifold using a data-driven residual threshold
Rigorous empirical characterization of when spectral geometry aids Byzantine defense (directional/subspace-aware attacks) and when it fails (spectrally indistinguishable attacks like label-flip and min-max)

🛡️ Threat Analysis

Data Poisoning Attack

Directly defends against Byzantine clients in FL who send arbitrarily corrupted model updates (sign-flip, label-flip, min-max, adaptive-steer, buffer-drift) to degrade the global model — classic Byzantine/poisoning attacks in federated learning. SpectralKrum is a robust aggregation method specifically designed to detect and filter these malicious updates.

Details

Domains

federated-learningvision

Model Types

federatedcnn

Threat Tags

training_timegrey_boxuntargeted

Datasets

CIFAR-10

Applications

2026 0 cit.

Data Poisoning Attack

85%