Guorui Chen

h-index: 2 7 citations 2 papers (total)

Papers in Database (1)

defense EMNLP Nov 1, 2025 · Nov 2025

Reimagining Safety Alignment with An Image

Yifan Xia, Guorui Chen, Wenqian Yu et al. · Wuhan University · University of Oxford

Defends MLLMs against jailbreaks and over-refusal by optimizing an adversarial-style image prompt as a parameter-free safety alignment mechanism

Input Manipulation Attack Prompt Injection nlpmultimodalvision
2 citations 1 influentialPDF Code