Hui Liu

benchmark arXiv Oct 8, 2025 · Oct 2025

Shen Dong, Mingxuan Zhang, Pengfei He et al. · Michigan State University · Purdue University +1 more

Benchmark for evaluating adversarial robustness of LLM planner-executor multi-agent systems across harmful action, privacy, and DoS attacks

Prompt Injection Excessive Agency nlp

Papers in Database (1)