ML Security Papers

Latest papers

4 papers

attack The Fourteenth International C... Feb 28, 2026 · 5w ago

Yilian Liu, Xiaojun Jia, Guoshun Nan et al. · Beijing University of Posts and Telecommunications · Nanyang Technological University +1 more

Jailbreaks MLLMs by dispersing harmful semantics across multiple images, forcing cross-image reasoning that defeats safety alignment

Prompt Injection visionnlpmultimodal

defense arXiv Dec 16, 2025 · Dec 2025

Zhaolun Li, Jichang Li, Yinqi Cai et al. · Guilin University of Electronic Technology · Pengcheng Laboratory +3 more

Deepfake video detector that synthesizes forgery outliers via CLIP features to generalize across unseen manipulation types

Output Integrity Attack vision

3 citations PDF

defense arXiv Oct 29, 2025 · Oct 2025

Yinqi Cai, Jichang Li, Zhaolun Li et al. · Guilin University of Electronic Technology · Sun Yat-Sen University +2 more

Detects deepfake face videos across unseen manipulations via CLIP-ViT with local patch and global domain-augmentation modules

Output Integrity Attack visiongenerative

4 citations 1 influentialPDF Code

attack arXiv Aug 16, 2025 · Aug 2025

Xuyang Guo, Zekai Huang, Zhao Song et al. · Guilin University of Electronic Technology · The Ohio State University +1 more

Demonstrates indirect prompt injection via PDF-hidden instructions fools LLMs even on trivial arithmetic judge tasks

Prompt Injection nlp