Xin Chen

h-index: 1 1 citations 3 papers (total)

Papers in Database (1)

attack arXiv Feb 5, 2026 · 8w ago

Learning to Inject: Automated Prompt Injection via Reinforcement Learning

Xin Chen, Jie Zhang, Florian Tramèr · ETH Zürich

RL-trained 1.5B model generates universal, transferable prompt injection suffixes that compromise GPT, Claude, and Gemini agents

Prompt Injection nlp
PDF