Latest papers

2 papers
benchmark arXiv Nov 14, 2025 · Nov 2025

Exposing Weak Links in Multi-Agent Systems under Adversarial Prompting

Nirmit Arora, Sathvik Joel, Ishan Kavathekar et al. · Microsoft Research · International Institute of Information Technology +1 more

Benchmarks adversarial prompt vulnerabilities across five multi-agent LLM architectures using a new evaluation framework and diagnostic metric

Prompt Injection Excessive Agency nlp
2 citations PDF Code
benchmark arXiv Sep 23, 2025 · Sep 2025

Stability and Generalization of Adversarial Diffusion Training

Hesam Hosseini, Ying Cao, Ali H. Sayed · École Polytechnique Fédérale de Lausanne

Derives stability-based generalization bounds for adversarial training in decentralized diffusion networks, showing robust overfitting worsens with perturbation radius and iterations

Input Manipulation Attack federated-learning
PDF