Maksym Andriushchenko

attack arXiv Oct 30, 2025 · Oct 2025

David Schmotz, Sahar Abdelnabi, Maksym Andriushchenko · ELLIS Institute Tübingen · MPI for Intelligent Systems +1 more

Exploits LLM Agent Skills plugin framework for trivial indirect prompt injection, exfiltrating files and bypassing Claude Code guardrails

Prompt Injection Insecure Plugin Design nlp

8 citations 1 influentialPDF Code

benchmark arXiv Feb 18, 2026 · 6w ago

Nivya Talokar, Ayush K Tarun, Murari Mandal et al. · Independent Researcher · EPFL +4 more

Benchmarks multi-turn, multilingual jailbreaking of LLM agents using a step-by-step illicit planning framework and novel time-to-jailbreak metrics

Prompt Injection Excessive Agency nlp

benchmark arXiv Feb 23, 2026 · 6w ago

David Schmotz, Luca Beurer-Kellner, Sahar Abdelnabi et al. · Max Planck Institute for Intelligent Systems · Snyk

Benchmarks LLM agent susceptibility to skill-file prompt injection, finding up to 80% attack success on frontier models

Prompt Injection Insecure Plugin Design nlp

Papers in Database (3)