ML Security Papers

Latest papers

3 papers

attack arXiv Jan 28, 2026 · 9w ago

BadDet+: Robust Backdoor Attacks for Object Detection

Kealan Dunnett, Reza Arablouei, Dimity Miller et al. · Queensland University of Technology · Commonwealth Scientific and Industrial Research Organisation

Backdoor attack framework for object detection unifying misclassification and object disappearance attacks with improved physical-world robustness

Model Poisoning vision

PDF

defense arXiv Nov 7, 2025 · Nov 2025

DeepForgeSeal: Latent Space-Driven Semi-Fragile Watermarking for Deepfake Detection Using Multi-Agent Adversarial Reinforcement Learning

Tharindu Fernando, Clinton Fookes, Sridha Sridharan · Queensland University of Technology

Semi-fragile latent-space watermarking for deepfake detection using multi-agent adversarial RL to balance robustness vs. fragility

Output Integrity Attack visiongenerative

PDF

defense arXiv Sep 19, 2025 · Sep 2025

Backdoor Mitigation via Invertible Pruning Masks

Kealan Dunnett, Reza Arablouei, Dimity Miller et al. · Queensland University of Technology · CSIRO

Pruning-based backdoor defense using invertible masks and bi-level optimization to surgically remove backdoor behavior while preserving clean accuracy

Model Poisoning vision

PDF

Latest papers

BadDet+: Robust Backdoor Attacks for Object Detection

DeepForgeSeal: Latent Space-Driven Semi-Fragile Watermarking for Deepfake Detection Using Multi-Agent Adversarial Reinforcement Learning

Backdoor Mitigation via Invertible Pruning Masks

Filters

Time Period

Paper Type

OWASP ML Top 10

OWASP LLM Top 10

Institution

Venue