Latest papers

3 papers
benchmark arXiv Nov 14, 2025 · Nov 2025

M-DAIGT: A Shared Task on Multi-Domain Detection of AI-Generated Text

Salima Lamsiyah, Saad Ezzini, Abdelkader El Mahdaouy et al. · University of Luxembourg · King Fahd University of Petroleum and Minerals +2 more

Introduces a 30K-sample shared-task benchmark for detecting LLM-generated text across news and academic domains

Output Integrity Attack nlp
1 citations PDF
defense NeurIPS Oct 26, 2025 · Oct 2025

If You Want to Be Robust, Be Wary of Initialization

Sofiane Ennadir, Johannes F. Lutzeyer, Michalis Vazirgiannis et al. · KTH Royal Institute of Technology · École Polytechnique +1 more

Defends GNNs against adversarial graph perturbations by theoretically linking weight initialization to robustness, achieving up to 50% improvement.

Input Manipulation Attack graph
4 citations PDF
attack arXiv Aug 12, 2025 · Aug 2025

Constrained Black-Box Attacks Against Cooperative Multi-Agent Reinforcement Learning

Amine Andam, Jamal Bentahar, Mustapha Hedabou · Mohammed VI Polytechnic University · Khalifa University +1 more

Black-box observation perturbation attacks disrupt cooperative MARL via agent-view misalignment using only 1,000 samples

Input Manipulation Attack reinforcement-learning
PDF