Yihan Wu

h-index: 9 346 citations 27 papers (total)

Papers in Database (5)

defense arXiv Sep 28, 2025 · Sep 2025

An Ensemble Framework for Unbiased Language Model Watermarking

Yihan Wu, Ruibo Chen, Georgios Milis et al. · College Park

Ensemble framework stacking multiple unbiased watermark keys to improve LLM text provenance detection and paraphrase-attack resistance

Output Integrity Attack nlp
3 citations PDF
benchmark arXiv Sep 28, 2025 · Sep 2025

Analyzing and Evaluating Unbiased Language Model Watermark

Yihan Wu, Xuehao Cui, Ruibo Chen et al. · College Park

Benchmark for evaluating LLM text watermarks across unbiasedness, detectability, and robustness axes with impossibility proofs

Output Integrity Attack nlp
3 citations PDF
defense arXiv Sep 29, 2025 · Sep 2025

Model Correlation Detection via Random Selection Probing

Ruibo Chen, Sheng Zhang, Yihan Wu et al. · College Park · National University of Singapore

Detects LLM/VLM model lineage via adversarial prefix transferability and hypothesis testing, producing principled p-values for model IP protection

Model Theft Model Theft nlpvisionmultimodal
1 citations PDF
defense arXiv Feb 12, 2026 · 7w ago

More Haste, Less Speed: Weaker Single-Layer Watermark Improves Distortion-Free Watermark Ensembles

Ruibo Chen, Yihan Wu, Xuehao Cui et al. · University of Maryland · National University of Singapore

Proposes weaker single-layer watermarks in LLM ensembles to preserve entropy and improve AI-generated text detectability

Output Integrity Attack nlp
PDF
defense arXiv Feb 15, 2026 · 7w ago

MC$^2$Mark: Distortion-Free Multi-Bit Watermarking for Long Messages

Xuehao Cui, Ruibo Chen, Yihan Wu et al. · University of Maryland

Distortion-free multi-bit watermarking framework embeds long identifiers in LLM outputs for reliable AI text provenance tracing

Output Integrity Attack nlp
PDF