Hui Liu

h-index: 2 75 citations 4 papers (total)

Papers in Database (1)

benchmark arXiv Oct 8, 2025 · Oct 2025

PEAR: Planner-Executor Agent Robustness Benchmark

Shen Dong, Mingxuan Zhang, Pengfei He et al. · Michigan State University · Purdue University +1 more

Benchmark for evaluating adversarial robustness of LLM planner-executor multi-agent systems across harmful action, privacy, and DoS attacks

Prompt Injection Excessive Agency nlp
PDF Code