Robust Client-Server Watermarking for Split Federated Learning
Jiaxiong Tang 1, Zhengchunmin Dai 1, Liantao Wu 1, Peng Sun 1, Honglong Chen 2, Zhenfu Cao 3
Published on arXiv
2511.13598
Model Theft
OWASP ML Top 10 — ML05
Key Finding
RISE achieves over 95% watermark detection rate (p-value < 0.03) across most settings with no mutual interference between client and server watermarks and robustness against removal attacks.
RISE
Novel technique introduced
Split Federated Learning (SFL) is renowned for its privacy-preserving nature and low computational overhead among decentralized machine learning paradigms. In this framework, clients employ lightweight models to process private data locally and transmit intermediate outputs to a powerful server for further computation. However, SFL is a double-edged sword: while it enables edge computing and enhances privacy, it also introduces intellectual property ambiguity as both clients and the server jointly contribute to training. Existing watermarking techniques fail to protect both sides since no single participant possesses the complete model. To address this, we propose RISE, a Robust model Intellectual property protection scheme using client-Server watermark Embedding for SFL. Specifically, RISE adopts an asymmetric client-server watermarking design: the server embeds feature-based watermarks through a loss regularization term, while clients embed backdoor-based watermarks by injecting predefined trigger samples into private datasets. This co-embedding strategy enables both clients and the server to verify model ownership. Experimental results on standard datasets and multiple network architectures show that RISE achieves over $95\%$ watermark detection rate ($p-value \lt 0.03$) across most settings. It exhibits no mutual interference between client- and server-side watermarks and remains robust against common removal attacks.
Key Contributions
- RISE: an asymmetric co-embedding scheme where the server uses feature-based loss regularization watermarks and clients use backdoor trigger-based watermarks, enabling both parties to verify ownership of a jointly trained SFL model
- Demonstrates no mutual interference between client-side and server-side watermarks across multiple architectures
- Achieves >95% watermark detection rate (p-value < 0.03) with robustness against common watermark removal attacks
🛡️ Threat Analysis
RISE embeds watermarks directly INTO the model weights/behavior (server via feature-based loss regularization, clients via backdoor triggers) to prove model ownership — this is a model IP protection defense against model theft, matching ML05 exactly. The backdoor mechanism is used as a watermarking vehicle, not as an attack.