Haoyang Hong

Papers in Database (1)

attack arXiv Apr 11, 2026 · 5w ago

When Can You Poison Rewards? A Tight Characterization of Reward Poisoning in Linear MDPs

Jose Efraim Aguilar Escamilla, Haoyang Hong, Jiawei Li et al. · Oregon State University · University of Illinois Urbana-Champaign +2 more

Characterizes when reward poisoning attacks can force RL agents to adopt attacker-chosen policies in linear MDPs

Model Skewing reinforcement-learning
PDF