Jonghyun Park

h-index: 1 2 citations 7 papers (total)

Papers in Database (1)

defense arXiv Oct 15, 2025 · Oct 2025

Risk-adaptive Activation Steering for Safe Multimodal Large Language Models

Jonghyun Park, Minhyuk Seo, Jonghyun Choi · Seoul National University · KU Leuven

Defends VLMs against image-embedded jailbreaks via risk-adaptive activation steering without iterative output adjustments

Input Manipulation Attack Prompt Injection multimodalvisionnlp
1 citations PDF