Latest papers

4 papers
attack The Fourteenth International C... Feb 28, 2026 · 5w ago

MIDAS: Multi-Image Dispersion and Semantic Reconstruction for Jailbreaking MLLMs

Yilian Liu, Xiaojun Jia, Guoshun Nan et al. · Beijing University of Posts and Telecommunications · Nanyang Technological University +1 more

Jailbreaks MLLMs by dispersing harmful semantics across multiple images, forcing cross-image reasoning that defeats safety alignment

Prompt Injection visionnlpmultimodal
PDF Code
defense arXiv Dec 16, 2025 · Dec 2025

FakeRadar: Probing Forgery Outliers to Detect Unknown Deepfake Videos

Zhaolun Li, Jichang Li, Yinqi Cai et al. · Guilin University of Electronic Technology · Pengcheng Laboratory +3 more

Deepfake video detector that synthesizes forgery outliers via CLIP features to generalize across unseen manipulation types

Output Integrity Attack vision
3 citations PDF
defense arXiv Oct 29, 2025 · Oct 2025

DeepShield: Fortifying Deepfake Video Detection with Local and Global Forgery Analysis

Yinqi Cai, Jichang Li, Zhaolun Li et al. · Guilin University of Electronic Technology · Sun Yat-Sen University +2 more

Detects deepfake face videos across unseen manipulations via CLIP-ViT with local patch and global domain-augmentation modules

Output Integrity Attack visiongenerative
4 citations 1 influentialPDF Code
attack arXiv Aug 16, 2025 · Aug 2025

Too Easily Fooled? Prompt Injection Breaks LLMs on Frustratingly Simple Multiple-Choice Questions

Xuyang Guo, Zekai Huang, Zhao Song et al. · Guilin University of Electronic Technology · The Ohio State University +1 more

Demonstrates indirect prompt injection via PDF-hidden instructions fools LLMs even on trivial arithmetic judge tasks

Prompt Injection nlp
PDF