Latest papers

3 papers
defense arXiv Feb 6, 2026 · 8w ago

AEGIS: Adversarial Target-Guided Retention-Data-Free Robust Concept Erasure from Diffusion Models

Fengpeng Li, Kemou Li, Qizhou Wang et al. · University of Macau · King Abdullah University of Science and Technology +2 more

Defends diffusion model concept erasure against adversarial prompt reactivation attacks via semantic-center-targeting adversarial erasure targets and gradient projection

Input Manipulation Attack visiongenerative
PDF Code
defense arXiv Dec 15, 2025 · Dec 2025

Calibrating Uncertainty for Zero-Shot Adversarial CLIP

Wenjing lu, Zerui Tao, Dongping Zhang et al. · RIKEN Center for Advanced Intelligence Project · Shanghai Jiao Tong University +1 more

Defends CLIP against adversarial attacks by restoring calibrated uncertainty via Dirichlet distribution alignment during fine-tuning

Input Manipulation Attack visionmultimodal
PDF
attack arXiv Oct 15, 2025 · Oct 2025

Model-agnostic Adversarial Attack and Defense for Vision-Language-Action Models

Haochuan Xu, Yun Sing Koh, Shuhuai Huang et al. · The University of Auckland · King Abdullah University of Science and Technology +2 more

Model-agnostic adversarial patch attack disrupts cross-modal embedding alignment in Vision-Language-Action robots, causing task failures

Input Manipulation Attack visionmultimodal
6 citations PDF Code