Latest papers

2 papers
benchmark arXiv Nov 7, 2025 · Nov 2025

ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations

Amr Gomaa, Ahmed Salem, Sahar Abdelnabi · German Research Center for Artificial Intelligence · Microsoft +3 more

Benchmarks privacy leakage and prompt-injection-style attacks across 864 multi-turn agent-to-agent LLM conversations in three domains

Prompt Injection Sensitive Information Disclosure nlp
5 citations 2 influentialPDF Code
defense Proc. Interspeech 2025, 4553-4... Aug 29, 2025 · Aug 2025

Generalizable Audio Spoofing Detection using Non-Semantic Representations

Arnab Das, Yassine El Kheir, Carlos Franzreb et al. · German Research Center for Artificial Intelligence · Gretchen AI GmbH +1 more

Uses non-semantic TRILL/TRILLsson audio embeddings to build a generalizable audio deepfake detector that outperforms SOTA on out-of-domain data

Output Integrity Attack audio
PDF