defense 2025

Robust Client-Server Watermarking for Split Federated Learning

Jiaxiong Tang 1, Zhengchunmin Dai 1, Liantao Wu 1, Peng Sun 1, Honglong Chen 2, Zhenfu Cao 3

0 citations · 46 references · arXiv

α

Published on arXiv

2511.13598

Model Theft

OWASP ML Top 10 — ML05

Key Finding

RISE achieves over 95% watermark detection rate (p-value < 0.03) across most settings with no mutual interference between client and server watermarks and robustness against removal attacks.

RISE

Novel technique introduced


Split Federated Learning (SFL) is renowned for its privacy-preserving nature and low computational overhead among decentralized machine learning paradigms. In this framework, clients employ lightweight models to process private data locally and transmit intermediate outputs to a powerful server for further computation. However, SFL is a double-edged sword: while it enables edge computing and enhances privacy, it also introduces intellectual property ambiguity as both clients and the server jointly contribute to training. Existing watermarking techniques fail to protect both sides since no single participant possesses the complete model. To address this, we propose RISE, a Robust model Intellectual property protection scheme using client-Server watermark Embedding for SFL. Specifically, RISE adopts an asymmetric client-server watermarking design: the server embeds feature-based watermarks through a loss regularization term, while clients embed backdoor-based watermarks by injecting predefined trigger samples into private datasets. This co-embedding strategy enables both clients and the server to verify model ownership. Experimental results on standard datasets and multiple network architectures show that RISE achieves over $95\%$ watermark detection rate ($p-value \lt 0.03$) across most settings. It exhibits no mutual interference between client- and server-side watermarks and remains robust against common removal attacks.


Key Contributions

  • RISE: an asymmetric co-embedding scheme where the server uses feature-based loss regularization watermarks and clients use backdoor trigger-based watermarks, enabling both parties to verify ownership of a jointly trained SFL model
  • Demonstrates no mutual interference between client-side and server-side watermarks across multiple architectures
  • Achieves >95% watermark detection rate (p-value < 0.03) with robustness against common watermark removal attacks

🛡️ Threat Analysis

Model Theft

RISE embeds watermarks directly INTO the model weights/behavior (server via feature-based loss regularization, clients via backdoor triggers) to prove model ownership — this is a model IP protection defense against model theft, matching ML05 exactly. The backdoor mechanism is used as a watermarking vehicle, not as an attack.


Details

Domains
federated-learning
Model Types
federatedcnn
Threat Tags
training_timewhite_box
Datasets
CIFAR-10
Applications
split federated learningmodel ip protection