Latest papers

4 papers
attack arXiv Mar 10, 2026 · 27d ago

Removing the Trigger, Not the Backdoor: Alternative Triggers and Latent Backdoors

Gorka Abad, Ermes Franch, Stefanos Koffas et al. · University of Bergen · Delft University of Technology +2 more

Proves backdoor-trained models stay exploitable via alternative triggers even after defenses neutralize the original training trigger

Model Poisoning vision
PDF
attack arXiv Jan 6, 2026 · Jan 2026

Quality Degradation Attack in Synthetic Data

Qinyi Liu, Dong Liu, Farhad Vadiee et al. · University of Bergen · Delft University of Technology

Attacks synthetic data generators via label flipping and feature interventions, substantially degrading downstream predictive quality

Data Poisoning Attack tabulargenerative
PDF
defense arXiv Dec 19, 2025 · Dec 2025

AdaptPrompt: Parameter-Efficient Adaptation of VLMs for Generalizable Deepfake Detection

Yichen Jiang, Mohammed Talha Alam, Sohail Ahmed Khan et al. · University of Waterloo · MBZUAI +1 more

Adapts CLIP with prompt tuning and visual adapters to detect GAN and diffusion deepfakes across 25 diverse test sets

Output Integrity Attack vision
PDF
survey arXiv Nov 17, 2025 · Nov 2025

SoK: The Last Line of Defense: On Backdoor Defense Evaluation

Gorka Abad, Marina Krček, Stefanos Koffas et al. · University of Bergen · Radboud University +3 more

Surveys 183 backdoor defense papers revealing critical evaluation inconsistencies and proposing standardized assessment recommendations

Model Poisoning vision
1 citations PDF