Guangsheng Bao

h-index: 11 769 citations 26 papers (total)

Papers in Database (3)

defense arXiv Jan 8, 2026 · 12w ago

When AI Settles Down: Late-Stage Stability as a Signature of AI-Generated Text Detection

Ke Sun, Guangsheng Bao, Han Cui et al. · Westlake University

Detects AI-generated text via late-stage token probability stabilization, achieving SOTA on EvoBench and MAGE benchmarks

Output Integrity Attack nlp
1 citations PDF
defense arXiv Feb 1, 2026 · 9w ago

Minimizing Mismatch Risk: A Prototype-Based Routing Framework for Zero-shot LLM-generated Text Detection

Ke Sun, Guangsheng Bao, Han Cui et al. · Westlake University

Prototype-based routing framework dynamically selects the best surrogate model to detect LLM-generated text across unknown black-box sources

Output Integrity Attack nlp
PDF
attack arXiv Feb 12, 2026 · 7w ago

Detecting RLVR Training Data via Structural Convergence of Reasoning

Hongbo Zhang, Yang Yue, Jianhao Yan et al. · Zhejiang University · Westlake University +1 more

Black-box membership inference attack on RLVR-trained reasoning models exploiting generation diversity collapse to detect training data

Membership Inference Attack nlpreinforcement-learning
PDF Code