Shi Feng

h-index: 6 650 citations 12 papers (total)

Papers in Database (1)

benchmark arXiv Dec 22, 2025 · Dec 2025

Gaming the Answer Matcher: Examining the Impact of Text Manipulation on Automated Judgment

Manas Khatore, Sumana Sridharan, Kevork Sulahian et al. · Algoverse · p-1.ai +1 more

Tests whether verbosity, hedging, and conflicting-answer injection can game LLM-based answer-matching evaluation systems

Prompt Injection nlp
PDF Code