Yilong Yang

h-index: 6 159 citations 18 papers (total)

Papers in Database (2)

attack arXiv Nov 18, 2025 · Nov 2025

GRPO Privacy Is at Risk: A Membership Inference Attack Against Reinforcement Learning With Verifiable Rewards

Yule Liu, Heyi Zhang, Jinyi Zheng et al. · The Hong Kong University of Science and Technology · Shanghai Jiao Tong University +2 more

First membership inference attack against RLVR-trained LLMs using behavioral divergence signals instead of memorization

Membership Inference Attack nlpmultimodalreinforcement-learning
1 citations PDF
attack arXiv Nov 12, 2025 · Nov 2025

Improving Sustainability of Adversarial Examples in Class-Incremental Learning

Taifeng Liu, Xinjing Liu, Liangqiu Dong et al. · Xidian University

Crafts targeted adversarial examples that persist through class-incremental learning model updates, outperforming baselines by 31%

Input Manipulation Attack vision
PDF Code