ML Security Papers

Latest papers

4 papers

attack arXiv Nov 10, 2025 · Nov 2025

Graph Representation-based Model Poisoning on the Heterogeneous Internet of Agents

Hanlin Cai, Houtianfu Wang, Haofan Dong et al. · University of Cambridge · CISTER Research Centre +2 more

Graph autoencoder-based Byzantine attack on federated LLM fine-tuning that evades cosine/distance-based defenses by mimicking benign update statistics

Data Poisoning Attack federated-learningnlp

1 citations PDF

attack arXiv Aug 21, 2025 · Aug 2025

Strategic Sample Selection for Improved Clean-Label Backdoor Attacks in Text Classification

Onur Alp Kirci, M. Emre Gursoy · Koç University

Proposes three model-uncertainty-based sample selection strategies that boost clean-label backdoor ASR in NLP text classifiers

Model Poisoning nlp

PDF

defense arXiv Aug 2, 2025 · Aug 2025

Defending Against Beta Poisoning Attacks in Machine Learning Models

Nilufer Gulciftci, M. Emre Gursoy · Acıbadem University · Koç University

Proposes four detection-based defenses against Beta Poisoning by exploiting spatial clustering properties of poisoned training samples

Data Poisoning Attack vision

PDF

attack arXiv Aug 2, 2025 · Aug 2025

Win-k: Improved Membership Inference Attacks on Small Language Models

Roya Arkhmammadova, Hosein Madadi Tamar, M. Emre Gursoy · Koç University

Proposes win-k, a sliding-window MIA that outperforms token-level attacks on small language models across all metrics

Membership Inference Attack nlp

PDF

Latest papers

Graph Representation-based Model Poisoning on the Heterogeneous Internet of Agents

Strategic Sample Selection for Improved Clean-Label Backdoor Attacks in Text Classification

Defending Against Beta Poisoning Attacks in Machine Learning Models

Win-k: Improved Membership Inference Attacks on Small Language Models

Filters

Time Period

Paper Type

OWASP ML Top 10

OWASP LLM Top 10

Institution

Venue