Jiayu Lv

h-index: 1 11 citations 4 papers (total)

Papers in Database (1)

defense arXiv Dec 1, 2025 · Dec 2025

On the Tension Between Optimality and Adversarial Robustness in Policy Optimization

Haoran Li, Jiayu Lv, Congying Han et al. · University of Chinese Academy of Sciences · JD.com +2 more

Proposes BARPO, a bilevel RL framework that reconciles optimality and adversarial robustness by modulating adversary strength during training

Input Manipulation Attack reinforcement-learning
PDF