Tianyu Chen

h-index: 2 34 citations 5 papers (total)

Papers in Database (2)

benchmark arXiv Feb 3, 2026 · 8w ago

LPS-Bench: Benchmarking Safety Awareness of Computer-Use Agents in Long-Horizon Planning under Benign and Adversarial Scenarios

Tianyu Chen, Chujia Hu, Ge Gao et al. · ShanghaiTech University · Shanghai Artificial Intelligence Laboratory

Benchmarks safety awareness of MCP-based LLM agents across 65 adversarial and benign long-horizon planning scenarios

Insecure Plugin Design Excessive Agency nlp
1 citations 1 influentialPDF Code
benchmark arXiv Feb 16, 2026 · 7w ago

A Trajectory-Based Safety Audit of Clawdbot (OpenClaw)

Tianyu Chen, Dongrui Liu, Xia Hu et al. · ShanghaiTech University · Shanghai Artificial Intelligence Laboratory

Trajectory-based safety audit of Clawdbot AI agent revealing jailbreak and excessive tool-action failures across 34 test cases

Prompt Injection Excessive Agency nlp
PDF Code