Latest papers

6 papers
survey arXiv Mar 23, 2026 · 14d ago

Towards Secure Retrieval-Augmented Generation: A Comprehensive Review of Threats, Defenses and Benchmarks

Yanming Mu, Hao Hu, Feiyang Li et al. · State Key Laboratory of Mathematical Engineering and Advanced Computing · Information Engineering University +2 more

First end-to-end survey mapping RAG security threats, defenses, and benchmarks across the entire pipeline

Prompt Injection Training Data Poisoning Sensitive Information Disclosure nlp
PDF
defense arXiv Jan 29, 2026 · 9w ago

Mining Forgery Traces from Reconstruction Error: A Weakly Supervised Framework for Multimodal Deepfake Temporal Localization

Midou Guo, Qilin Yin, Wei Lu et al. · Sun Yat-Sen University · Alibaba Group +1 more

Weakly supervised deepfake temporal localization using MAE reconstruction errors and asymmetric contrastive loss on multimodal video

Output Integrity Attack visionaudiomultimodal
PDF
defense arXiv Jan 29, 2026 · 9w ago

Lossless Copyright Protection via Intrinsic Model Fingerprinting

Lingxiao Chen, Liqin Wang, Wei Lu et al. · Sun Yat-Sen University · State Key Laboratory of Mathematical Engineering and Advanced Computing

Fingerprints diffusion models via denoising trajectory manifolds to verify copyright in black-box API settings without model modification

Model Theft visiongenerative
PDF
defense arXiv Jan 28, 2026 · 9w ago

MARE: Multimodal Alignment and Reinforcement for Explainable Deepfake Detection via Vision-Language Models

Wenbo Xu, Wei Lu, Xiangyang Luo et al. · Sun Yat-Sen University · State Key Laboratory of Mathematical Engineering and Advanced Computing +1 more

Proposes VLM-based deepfake detector using RLHF and multimodal alignment rewards for explainable forgery reasoning and spatial localization

Output Integrity Attack visionmultimodal
PDF
defense arXiv Jan 21, 2026 · 10w ago

Safeguarding Facial Identity against Diffusion-based Face Swapping via Cascading Pathway Disruption

Liqin Wang, Qianyue Hu, Wei Lu et al. · Sun Yat-Sen University · State Key Laboratory of Mathematical Engineering and Advanced Computing

Adversarial perturbations that cascade-disrupt diffusion face-swapping pipelines by corrupting identity extraction and injection to prevent deepfakes

Input Manipulation Attack Output Integrity Attack visiongenerative
PDF
defense arXiv Aug 4, 2025 · Aug 2025

Weakly Supervised Multimodal Temporal Forgery Localization via Multitask Learning

Wenbo Xu, Wei Lu, Xiangyang Luo · Sun Yat-Sen University · State Key Laboratory of Mathematical Engineering and Advanced Computing

Proposes weakly supervised multimodal deepfake detector that temporally localizes forged video segments using only video-level labels

Output Integrity Attack multimodalvisionaudio
PDF