Tianyu Lu

h-index: 3 21 citations 4 papers (total)

Papers in Database (2)

benchmark arXiv Oct 8, 2025 · Oct 2025

Code Agent can be an End-to-end System Hacker: Benchmarking Real-world Threats of Computer-use Agent

Weidi Luo, Qiming Zhang, Tianyu Lu et al. · University of Georgia · University of Wisconsin–Madison +6 more

Benchmarks LLM-powered agents' ability to execute end-to-end enterprise intrusions aligned with MITRE ATT&CK TTPs

Excessive Agency Prompt Injection nlpmultimodal
4 citations PDF Code
attack arXiv Sep 28, 2025 · Sep 2025

Quant Fever, Reasoning Blackholes, Schrodinger's Compliance, and More: Probing GPT-OSS-20B

Shuyi Lin, Tian Lu, Zikai Wang et al. · Northeastern University · Shanghai Jiao Tong University

Discovers five jailbreak failure modes in GPT-OSS-20B, introducing chain-oriented prompting and reasoning mirage attacks with 80% success rates

Prompt Injection nlp
PDF