ML Security Papers

Latest papers

3 papers

benchmark arXiv Nov 14, 2025 · Nov 2025

Salima Lamsiyah, Saad Ezzini, Abdelkader El Mahdaouy et al. · University of Luxembourg · King Fahd University of Petroleum and Minerals +2 more

Introduces a 30K-sample shared-task benchmark for detecting LLM-generated text across news and academic domains

Output Integrity Attack nlp

1 citations PDF

defense NeurIPS Oct 26, 2025 · Oct 2025

Sofiane Ennadir, Johannes F. Lutzeyer, Michalis Vazirgiannis et al. · KTH Royal Institute of Technology · École Polytechnique +1 more

Defends GNNs against adversarial graph perturbations by theoretically linking weight initialization to robustness, achieving up to 50% improvement.

Input Manipulation Attack graph

4 citations PDF

attack arXiv Aug 12, 2025 · Aug 2025

Amine Andam, Jamal Bentahar, Mustapha Hedabou · Mohammed VI Polytechnic University · Khalifa University +1 more

Black-box observation perturbation attacks disrupt cooperative MARL via agent-view misalignment using only 1,000 samples

Input Manipulation Attack reinforcement-learning