ML Security Papers

Latest papers

5 papers

defense arXiv Feb 1, 2026 · 9w ago

Key Principles of Graph Machine Learning: Representation, Robustness, and Generalization

Yassine Abbahaddou, Céline Hudelot, Charlotte Laclau et al. · École Polytechnique · CentraleSupélec +4 more

Defends GNNs against adversarial graph perturbations via orthonormalization and noise-based techniques, alongside representation and generalization contributions

Input Manipulation Attack graph

PDF

attack arXiv Jan 13, 2026 · 11w ago

Double Strike: Breaking Approximation-Based Side-Channel Countermeasures for DNNs

Lorenzo Casalino, Maria Méndez Real, Jean-Christophe Prévotet et al. · CentraleSupélec · INRIA +7 more

Side-channel attack breaks MACPRUNING defense to recover 96–100% of DNN weights from embedded hardware implementations

Model Theft

PDF

benchmark EMNLP Oct 15, 2025 · Oct 2025

How Sampling Affects the Detectability of Machine-written texts: A Comprehensive Study

Matthieu Dubois, François Yvon, Pablo Piantanida · Sorbonne Université · CNRS +2 more

Benchmarks AI text detectors across 37 decoding configs, showing AUROC collapses from 0.99 to 0.01 with minor sampling changes

Output Integrity Attack nlp

2 citations PDF Code

defense arXiv Aug 22, 2025 · Aug 2025

ConceptGuard: Neuro-Symbolic Safety Guardrails via Sparse Interpretable Jailbreak Concepts

Darpan Aswal, Céline Hudelot · Université Paris-Saclay · CentraleSupélec

Defends LLMs against jailbreaks by using sparse autoencoders to identify interpretable internal activation concepts linked to attack themes

Prompt Injection nlp

PDF

defense arXiv Aug 10, 2025 · Aug 2025

Certifiably robust malware detectors by design

Pierre-Francois Gimenez, Sarath Sivaprasad, Mario Fritz · Univ Rennes · INRIA +3 more

Certifiably robust malware detection architecture proving every robust detector decomposes into a specific structure resistant to evasion attacks

Input Manipulation Attack

PDF

Latest papers

Key Principles of Graph Machine Learning: Representation, Robustness, and Generalization

Double Strike: Breaking Approximation-Based Side-Channel Countermeasures for DNNs

How Sampling Affects the Detectability of Machine-written texts: A Comprehensive Study

ConceptGuard: Neuro-Symbolic Safety Guardrails via Sparse Interpretable Jailbreak Concepts

Certifiably robust malware detectors by design

Filters

Time Period

Paper Type

OWASP ML Top 10

OWASP LLM Top 10

Institution

Venue