Godwin Badu-Marfo

h-index: 7 173 citations 25 papers (total)

Papers in Database (1)

benchmark arXiv Feb 12, 2026 · 7w ago

AgentLeak: A Full-Stack Benchmark for Privacy Leakage in Multi-Agent LLM Systems

Faouzi El Yagoubi, Godwin Badu-Marfo, Ranwa Al Mallah · Polytechnique Montréal

Benchmark revealing multi-agent LLM systems leak sensitive PII at 68.8% through inter-agent channels that output-only audits miss entirely

Sensitive Information Disclosure Excessive Agency nlp
PDF