Latest papers

3 papers
attack arXiv Dec 18, 2025 · Dec 2025

Hacking Neural Evaluation Metrics with Single Hub Text

Hiroyuki Deguchi, Katsuki Chousa, Yusuke Sakai · NTT · Nara Institute of Science and Technology

Adversarial hub text attack fools COMET translation metric into assigning high scores regardless of source or reference

Input Manipulation Attack nlp
PDF
benchmark arXiv Oct 8, 2025 · Oct 2025

Exposing Citation Vulnerabilities in Generative Engines

Riku Mochizuki, Shusuke Komatsu, Souta Noguchi et al. · QueryLift Inc. · Keio University +1 more

Evaluates generative engine vulnerability to indirect web-poisoning attacks by measuring citation source credibility and content-injection barriers

Prompt Injection nlp
PDF
attack EMNLP Sep 30, 2025 · Sep 2025

Reliability Crisis of Reference-free Metrics for Grammatical Error Correction

Takumi Goto, Yusuke Sakai, Taro Watanabe · Nara Institute of Science and Technology

Adversarial attacks on ML-based GEC evaluation metrics exploit metric-specific vulnerabilities to score above state-of-the-art systems

Input Manipulation Attack Prompt Injection nlp
PDF Code