Alexey Zaytsev

h-index: 1 5 citations 3 papers (total)

Papers in Database (1)

attack arXiv Oct 15, 2025 · Oct 2025

Selective Adversarial Attacks on LLM Benchmarks

Ivan Dubrovsky, Anastasia Orlova, Illarion Iov et al. · ITMO University · Applied AI Institute

Selective word-level adversarial attacks on MMLU questions that degrade one target LLM's benchmark score while leaving competing models unaffected

Prompt Injection nlp
PDF