Latest papers

10 papers
defense arXiv Mar 28, 2026 · 11d ago

Diagnosing and Repairing Unsafe Channels in Vision-Language Models via Causal Discovery and Dual-Modal Safety Subspace Projection

Jinhu Fu, Yihang Lou, Qingyi Si et al. · Beijing University of Posts and Telecommunications · Chongqing University of Posts and Telecommunications +2 more

Identifies and repairs unsafe neural pathways in VLMs using causal mediation analysis and dual-modal safety subspace projection

Input Manipulation Attack Prompt Injection multimodalvisionnlp
PDF
defense arXiv Mar 10, 2026 · 29d ago

When Detectors Forget Forensics: Blocking Semantic Shortcuts for Generalizable AI-Generated Image Detection

Chao Shuai, Zhenguang Liu, Shaojing Fan et al. · Zhejiang University · National University of Singapore +1 more

Proposes GSD module to block semantic shortcuts in VFM-based detectors, improving generalization to unseen AI-generated image pipelines

Output Integrity Attack visiongenerative
PDF Code
defense arXiv Feb 28, 2026 · 5w ago

Diversity over Uniformity: Rethinking Representation in Generated Image Detection

Qinghui He, Haifeng Zhang, Qiao Qin et al. · Chongqing University of Posts and Telecommunications · Ltd. +1 more

Proposes anti-feature-collapse learning to diversify forgery cues in AI-generated image detectors, improving cross-model generalization

Output Integrity Attack visiongenerative
PDF Code
benchmark arXiv Jan 24, 2026 · 10w ago

OTI: A Model-free and Visually Interpretable Measure of Image Attackability

Jiaming Liang, Haowei Liu, Chi-Man Pun · University of Macau · Chongqing University of Posts and Telecommunications

Proposes OTI, a model-free texture-based metric for quantifying per-image adversarial vulnerability without model access

Input Manipulation Attack vision
PDF Code
defense arXiv Dec 15, 2025 · Dec 2025

CausalCLIP: Causally-Informed Feature Disentanglement and Filtering for Generalizable Detection of Generated Images

Bo Liu, Qiao Qin, Qinghui He · Chongqing University of Posts and Telecommunications · School of Artificial Intelligence +1 more

Proposes causal feature disentanglement in CLIP representations to generalize AI-generated image detection across unseen generative models

Output Integrity Attack vision
2 citations PDF
defense arXiv Dec 3, 2025 · Dec 2025

FeatureLens: A Highly Generalizable and Interpretable Framework for Detecting Adversarial Examples Based on Image Features

Zhigang Yang, Yuan Liu, Jiawei Zhang et al. · Chongqing University of Posts and Telecommunications · Chongqing University of Arts and Sciences

Lightweight adversarial example detector using 51-dim image features and shallow classifiers, generalizing across FGSM, PGD, CW, and DAmageNet attacks

Input Manipulation Attack vision
PDF
attack arXiv Dec 2, 2025 · Dec 2025

LeechHijack: Covert Computational Resource Exploitation in Intelligent Agent Systems

Yuanhe Zhang, Weiliu Wang, Zhenhong Zhou et al. · Beijing University of Posts and Telecommunications · Hangzhou Dianzi University +4 more

LeechHijack backdoors MCP tools to covertly parasitize LLM agent compute via runtime C2 channel, achieving 77% success undetected

Insecure Plugin Design nlp
1 citations PDF
defense International Journal of Compu... Nov 14, 2025 · Nov 2025

Unsupervised Robust Domain Adaptation: Paradigm, Theory and Algorithm

Fuxiang Huang, Xiaowei Fu, Shiyu Ye et al. · Chongqing University · Lingnan University +3 more

Defends unsupervised domain adaptation models against adversarial attacks via disentangled distillation post-training

Input Manipulation Attack vision
PDF
defense arXiv Sep 14, 2025 · Sep 2025

Make Identity Unextractable yet Perceptible: Synthesis-Based Privacy Protection for Subject Faces in Photos

Tao Wang, Yushu Zhang, Xiangli Xiao et al. · Nanjing University of Aeronautics and Astronautics · Jiangxi University of Finance and Economics +1 more

Synthesis-based anti-face-recognition defense generates perceptible yet identity-unextractable faces to defeat unauthorized FR systems

Input Manipulation Attack visiongenerative
PDF Code
defense arXiv Jan 9, 2025 · Jan 2025

A New Perspective on Privacy Protection in Federated Learning with Granular-Ball Computing

Guannan Lai, Yihui Feng, Xin Yang et al. · Southwestern University of Finance and Economics · Chongqing University of Posts and Telecommunications +1 more

Defends federated learning against gradient reconstruction attacks by transforming images into coarse-grained graph structures before training

Model Inversion Attack visionfederated-learninggraph
PDF Code