Yue Zhang

Papers in Database (2)

attack arXiv Feb 12, 2026 · 7w ago

Detecting RLVR Training Data via Structural Convergence of Reasoning

Hongbo Zhang, Yang Yue, Jianhao Yan et al. · Zhejiang University · Westlake University +1 more

Black-box membership inference attack on RLVR-trained reasoning models exploiting generation diversity collapse to detect training data

Membership Inference Attack nlpreinforcement-learning
PDF Code
defense arXiv Aug 3, 2025 · Aug 2025

AI-Generated Text is Non-Stationary: Detection via Temporal Tomography

Alva West, Yixuan Weng, Minjun Zhu et al. · Westlake University

Detects AI-generated text via wavelet-transformed token statistics, exploiting non-stationarity invisible to scalar-score detectors

Output Integrity Attack nlp
PDF Code