Dingcheng Li

defense arXiv Oct 6, 2025 · Oct 2025

Zizhao Wang, Dingcheng Li, Vaishakh Keshava et al. · Google · The University of Texas at Austin +2 more

Defends LLM tool-using agents from indirect prompt injection via adversarial RL co-training in a two-player zero-sum game

Prompt Injection nlpreinforcement-learning

3 citations PDF

Papers in Database (1)