Latest papers

2 papers
benchmark arXiv Feb 13, 2026 · 7w ago

A Calibrated Memorization Index (MI) for Detecting Training Data Leakage in Generative MRI Models

Yash Deo, Yan Jia, Toni Lassila et al. · University of York · University of Leeds +3 more

Proposes calibrated memorization metrics using MRI foundation model features to detect training data duplication in generative MRI models

Model Inversion Attack vision
PDF Code
defense arXiv Oct 15, 2025 · Oct 2025

Risk-adaptive Activation Steering for Safe Multimodal Large Language Models

Jonghyun Park, Minhyuk Seo, Jonghyun Choi · Seoul National University · KU Leuven

Defends VLMs against image-embedded jailbreaks via risk-adaptive activation steering without iterative output adjustments

Input Manipulation Attack Prompt Injection multimodalvisionnlp
1 citations PDF