Yongil Kim

attack arXiv Aug 11, 2025 · Aug 2025

Yerin Hwang, Dongryeol Lee, Taegwan Kang et al. · Seoul National University · LG AI Research

Embeds Aristotelian persuasion techniques in responses to manipulate LLM judges into inflating scores on incorrect math solutions by up to 8%

Prompt Injection nlp

Papers in Database (1)