FedGuard: A Diverse-Byzantine-Robust Mechanism for Federated Learning with Major Malicious Clients

Federated learning is a distributed training framework vulnerable to Byzantine attacks, particularly when over 50% of clients are malicious or when datasets are highly non-independent and identically distributed (non-IID). Additionally, most existing defense mechanisms are designed for specific attack types (e.g., gradient similarity-based schemes can only defend against outlier model poisoning), limiting their effectiveness. In response, we propose FedGuard, a novel federated learning mechanism. FedGuard cleverly addresses the aforementioned issues by leveraging the high sensitivity of membership inference to model bias. By requiring clients to include an additional mini-batch of server-specified data in their training, FedGuard can identify and exclude poisoned models, as their confidence in the mini-batch will drop significantly. Our comprehensive evaluation unequivocally shows that, under three highly non-IID datasets, with 90% of clients being Byzantine and seven different types of Byzantine attacks occurring in each round, FedGuard significantly outperforms existing robust federated learning schemes in mitigating various types of Byzantine attacks.

Key Contributions

FedGuard leverages high sensitivity of membership inference to model bias — poisoned models lose confidence on server-specified mini-batches, enabling detection without relying on gradient similarity
Demonstrated robustness against 7 concurrent Byzantine attack types with up to 90% malicious clients under highly non-IID data distributions
Overcomes limitations of gradient similarity-based defenses (which fail against stealthy similarity attacks) and trusted-data defenses (which degrade above ~60% malicious clients)

🛡️ Threat Analysis

Data Poisoning Attack

FedGuard defends against Byzantine attacks in federated learning where malicious clients (up to 90%) send corrupted model updates to degrade global model performance — the canonical ML02 federated poisoning threat. The paper proposes a novel robust aggregation mechanism that identifies and excludes poisoned models using membership inference sensitivity as a detection signal.

Details

Domains

federated-learning

Model Types

federated

Threat Tags

training_timegrey_box

Datasets

three non-IID federated learning benchmark datasets (unspecified in excerpt)

Applications

2025 0 cit.

Data Poisoning Attack

100%

FedGuard: A Diverse-Byzantine-Robust Mechanism for Federated Learning with Major Malicious Clients

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

RAIN: Secure and Robust Aggregation under Shuffle Model of Differential Privacy

VerifBFL: Leveraging zk-SNARKs for A Verifiable Blockchained Federated Learning

Proof-of-Data: A Consensus Protocol for Collaborative Intelligence

Agentic Trust Coordination for Federated Learning through Adaptive Thresholding and Autonomous Decision Making in Sustainable and Resilient Industrial Networks

Private and Robust Contribution Evaluation in Federated Learning

D2M: A Decentralized, Privacy-Preserving, Incentive-Compatible Data Marketplace for Collaborative Learning

FLAegis: A Two-Layer Defense Framework for Federated Learning Against Poisoning Attacks

Decentralized Trust for Space AI: Blockchain-Based Federated Learning Across Multi-Vendor LEO Satellite Networks