Latest papers

1 papers
benchmark arXiv Nov 22, 2025 · Nov 2025

Beyond Jailbreak: Unveiling Risks in LLM Applications Arising from Blurred Capability Boundaries

Yunyi Zhang, Shibo Cui, Baojun Liu et al. · Tsinghua University · National University of Defense Technology +1 more

Discovers LLM apps routinely exceed intended capability boundaries, with 17 apps performing malicious tasks without any adversarial prompting

Excessive Agency Prompt Injection nlp
PDF