Latest papers

2 papers
attack arXiv Oct 3, 2025 · Oct 2025

Malice in Agentland: Down the Rabbit Hole of Backdoors in the AI Supply Chain

Léo Boisvert, Abhay Puri, Chandra Kiran Reddy Evuru et al. · ServiceNow Research · Mila - Québec AI Institute +2 more

Backdoors injected via AI supply chain poisoning cause agents to leak confidential data with 80%+ success at 2% poison rate

Model Poisoning AI Supply Chain Attacks nlp
2 citations PDF
defense arXiv Sep 27, 2025 · Sep 2025

Robust Fine-Tuning from Non-Robust Pretrained Models: Mitigating Suboptimal Transfer With Adversarial Scheduling

Jonas Ngnawé, Maxime Heuillet, Sabyasachi Sahoo et al. · Université Laval · Mila +3 more

Proposes Epsilon-Scheduling to fix adversarial training collapse when robustly fine-tuning non-robust pretrained models

Input Manipulation Attack vision
PDF Code