Ruibo Chen

defense arXiv Oct 3, 2025 · Oct 2025

Jingqi Zhang, Ruibo Chen, Yingqing Yang et al. · National University of Singapore · University of Maryland +2 more

Watermarks LLM fine-tuning datasets with distortion-free signals to enable black-box detection of copyrighted dataset usage

Output Integrity Attack nlp

5 citations PDF Code

defense arXiv Sep 28, 2025 · Sep 2025

Yihan Wu, Ruibo Chen, Georgios Milis et al. · College Park

Ensemble framework stacking multiple unbiased watermark keys to improve LLM text provenance detection and paraphrase-attack resistance

Output Integrity Attack nlp

3 citations PDF

benchmark arXiv Sep 28, 2025 · Sep 2025

Yihan Wu, Xuehao Cui, Ruibo Chen et al. · College Park

Benchmark for evaluating LLM text watermarks across unbiasedness, detectability, and robustness axes with impossibility proofs

Output Integrity Attack nlp

3 citations PDF

defense arXiv Sep 29, 2025 · Sep 2025

Ruibo Chen, Sheng Zhang, Yihan Wu et al. · College Park · National University of Singapore

Detects LLM/VLM model lineage via adversarial prefix transferability and hypothesis testing, producing principled p-values for model IP protection

Model Theft Model Theft nlpvisionmultimodal

1 citations PDF

Papers in Database (4)