Latest papers

1 papers
attack arXiv Nov 12, 2025 · Nov 2025

Hail to the Thief: Exploring Attacks and Defenses in Decentralised GRPO

Nikolay Blagoev, Oğuzhan Ersoy, Lydia Yiyu Chen · Gensyn · University of Neuchatel +1 more

First poisoning attack on decentralized LLM GRPO training via malicious completions, achieving 100% success in 50 iterations with proposed defenses reaching 100% stop rates

Data Poisoning Attack Model Poisoning Training Data Poisoning nlpreinforcement-learning
PDF