Alexey Zaytsev

attack arXiv Oct 15, 2025 · Oct 2025

Ivan Dubrovsky, Anastasia Orlova, Illarion Iov et al. · ITMO University · Applied AI Institute

Selective word-level adversarial attacks on MMLU questions that degrade one target LLM's benchmark score while leaving competing models unaffected

Prompt Injection nlp

Papers in Database (1)