Zekun Wu

Papers in Database (1)

benchmark arXiv Sep 21, 2025 · Sep 2025

Mind the Gap: Comparing Model- vs Agentic-Level Red Teaming with Action-Graph Observability on GPT-OSS-20B

Ilham Wicaksono, Zekun Wu, Rahul Patel et al. · University College London · Holistic AI

Compares jailbreak attacks on standalone LLM vs. agentic loop, discovering agentic-only vulnerabilities with 24% higher ASR in tool-calling contexts

Prompt Injection Excessive Agency nlp
PDF