Ilham Wicaksono

benchmark arXiv Sep 5, 2025 · Sep 2025

Ilham Wicaksono, Zekun Wu, Rahul Patel et al. · Holistic AI · University College London

AgentSeer framework reveals LLM agent tool-calling suffers 24-60% higher jailbreak ASR than standalone model-level safety evaluation

Prompt Injection Excessive Agency nlp

Papers in Database (1)