Zihao Wang

h-index: 3 32 citations 6 papers (total)

Papers in Database (1)

defense arXiv Dec 1, 2025 · Dec 2025

DefenSee: Dissecting Threat from Sight and Text -- A Multi-View Defensive Pipeline for Multi-modal Jailbreaks

Zihao Wang, Kar Wai Fok, Vrizlynn L. L. Thing · ST Engineering

Defends VLMs against multi-modal jailbreaks by transcribing image variants and performing cross-modal consistency checks to flag harmful intent

Input Manipulation Attack Prompt Injection visionnlpmultimodal
PDF