Latest papers

2 papers
defense arXiv Apr 19, 2026 · 4w ago

SafeAgent: A Runtime Protection Architecture for Agentic Systems

Hailin Liu, Eugene Ilyushin, Jie Ni et al. · Lomonosov Moscow State University · Central University

Runtime security architecture defending LLM agents against prompt injection by mediating tool-use actions with stateful risk reasoning

Prompt Injection Insecure Plugin Design Excessive Agency nlp
PDF
benchmark arXiv Mar 11, 2026 · 10w ago

Probabilistic Verification of Voice Anti-Spoofing Models

Evgeny Kushnir, Alexandr Kozodaev, Dmitrii Korzh et al. · AXXX · HSE +5 more

Proposes PV-VASM, a black-box probabilistic framework that formally bounds misclassification risk of speech deepfake detectors against TTS and voice cloning attacks

Output Integrity Attack audio
PDF