Ting Wang

h-index: 3 25 citations 9 papers (total)

Papers in Database (1)

benchmark arXiv Feb 18, 2026 · 6w ago

AgentLAB: Benchmarking LLM Agents against Long-Horizon Attacks

Tanqiu Jiang, Yuhui Wang, Jiacheng Liang et al. · Stony Brook University

Benchmark evaluating LLM agent susceptibility to five long-horizon attack types across 28 agentic environments and 644 test cases

Prompt Injection Excessive Agency nlp
1 citations PDF Code