Ariane Teixeira

Papers in Database (1)

benchmark arXiv Feb 18, 2026 ยท 6w ago

Mind the GAP: Text Safety Does Not Transfer to Tool-Call Safety in LLM Agents

Arnold Cartagena, Ariane Teixeira

Benchmark revealing LLM agents refuse harmful requests in text while silently executing forbidden tool calls across six regulated domains

Excessive Agency Prompt Injection nlp
PDF Code