Yongil Kim

Papers in Database (1)

attack arXiv Aug 11, 2025 · Aug 2025

Can You Trick the Grader? Adversarial Persuasion of LLM Judges

Yerin Hwang, Dongryeol Lee, Taegwan Kang et al. · Seoul National University · LG AI Research

Embeds Aristotelian persuasion techniques in responses to manipulate LLM judges into inflating scores on incorrect math solutions by up to 8%

Prompt Injection nlp
PDF