Minhyuk Seo

h-index: 4 75 citations 11 papers (total)

Papers in Database (1)

defense arXiv Oct 15, 2025 · Oct 2025

Risk-adaptive Activation Steering for Safe Multimodal Large Language Models

Jonghyun Park, Minhyuk Seo, Jonghyun Choi · Seoul National University · KU Leuven

Defends VLMs against image-embedded jailbreaks via risk-adaptive activation steering without iterative output adjustments

Input Manipulation Attack Prompt Injection multimodalvisionnlp
1 citations PDF