Ruofan Wang

Papers in Database (1)

attack arXiv Aug 3, 2025 · Aug 2025

Simulated Ensemble Attack: Transferring Jailbreaks Across Fine-tuned Vision-Language Models

Ruofan Wang, Xin Wang, Yang Yao et al. · Fudan University · The University of Hong Kong

Grey-box adversarial image attack transfers jailbreaks to fine-tuned VLMs by simulating fine-tuning parameter trajectories on the base model

Input Manipulation Attack Prompt Injection visionmultimodalnlp
PDF