Shiyu Liang

h-index: 1 2 citations 6 papers (total)

Papers in Database (1)

attack arXiv Sep 25, 2025 · Sep 2025

RLCracker: Exposing the Vulnerability of LLM Watermarks with Adaptive RL Attacks

Hanbo Huang, Yiran Zhang, Hao Zheng et al. · Shanghai Jiao Tong University · National University of Defense Technology

RL-based attack removes LLM text watermarks with 98.5% success using 100 training samples, defeating 10 watermarking schemes

Output Integrity Attack nlp
PDF