defense 2025

FLClear: Visually Verifiable Multi-Client Watermarking for Federated Learning

Chen Gu , Yingying Sun , Yifan She , Donghui Hu

0 citations · 60 references · arXiv

α

Published on arXiv

2511.12663

Model Theft

OWASP ML Top 10 — ML05

Key Finding

FLClear consistently outperforms state-of-the-art FL watermarking methods across various datasets, aggregation schemes, and attack scenarios while enabling intuitive visual ownership verification.

FLClear

Novel technique introduced


Federated learning (FL) enables multiple clients to collaboratively train a shared global model while preserving the privacy of their local data. Within this paradigm, the intellectual property rights (IPR) of client models are critical assets that must be protected. In practice, the central server responsible for maintaining the global model may maliciously manipulate the global model to erase client contributions or falsely claim sole ownership, thereby infringing on clients' IPR. Watermarking has emerged as a promising technique for asserting model ownership and protecting intellectual property. However, existing FL watermarking approaches remain limited, suffering from potential watermark collisions among clients, insufficient watermark security, and non-intuitive verification mechanisms. In this paper, we propose FLClear, a novel framework that simultaneously achieves collision-free watermark aggregation, enhanced watermark security, and visually interpretable ownership verification. Specifically, FLClear introduces a transposed model jointly optimized with contrastive learning to integrate the watermarking and main task objectives. During verification, the watermark is reconstructed from the transposed model and evaluated through both visual inspection and structural similarity metrics, enabling intuitive and quantitative ownership verification. Comprehensive experiments conducted over various datasets, aggregation schemes, and attack scenarios demonstrate the effectiveness of FLClear and confirm that it consistently outperforms state-of-the-art FL watermarking methods.


Key Contributions

  • FLClear framework achieving collision-free watermark aggregation across multiple FL clients via a transposed model jointly optimized with contrastive learning
  • Visually interpretable ownership verification using both visual inspection and structural similarity (SSIM) metrics reconstructed from the transposed model
  • Empirical demonstration of robustness against watermark removal attacks across multiple datasets and aggregation schemes, outperforming state-of-the-art FL watermarking methods

🛡️ Threat Analysis

Model Theft

Watermarks are embedded IN THE MODEL (via a jointly trained transposed model) to assert and verify ownership when a malicious FL server attempts to erase client contributions or falsely claim sole ownership — classic model IP theft defense.


Details

Domains
federated-learning
Model Types
federated
Threat Tags
training_timewhite_box
Applications
federated learning model ip protectionmodel ownership verification