Latest papers

1 papers
benchmark arXiv Apr 27, 2026 · 24d ago

GAMMAF: A Common Framework for Graph-Based Anomaly Monitoring Benchmarking in LLM Multi-Agent Systems

Pablo Mateo-Torrejón, Alfonso Sánchez-Macián · University Carlos III of Madrid

Benchmarking framework for evaluating graph-based defenses against prompt injection and adversarial agents in LLM multi-agent systems

Prompt Injection Excessive Agency nlpgraph
PDF