Hongbo Zhang

h-index: 3 118 citations 8 papers (total)

Papers in Database (1)

attack arXiv Feb 12, 2026 · 7w ago

Detecting RLVR Training Data via Structural Convergence of Reasoning

Hongbo Zhang, Yang Yue, Jianhao Yan et al. · Zhejiang University · Westlake University +1 more

Black-box membership inference attack on RLVR-trained reasoning models exploiting generation diversity collapse to detect training data

Membership Inference Attack nlpreinforcement-learning
PDF Code