Latest papers

2 papers
defense arXiv Feb 5, 2026 · 8w ago

Detecting Misbehaviors of Large Vision-Language Models by Evidential Uncertainty Quantification

Tao Huang, Rui Wang, Xiaofei Liu et al. · State Key Laboratory of Advanced Rail Autonomous Operation · Beijing Key Laboratory of Traffic Data Mining and Embodied Intelligence +2 more

Training-free uncertainty decomposition detects jailbreaks, adversarial inputs, hallucinations, and OOD failures in vision-language models

Input Manipulation Attack Prompt Injection visionnlpmultimodal
PDF Code
attack arXiv Aug 8, 2025 · Aug 2025

SAM Encoder Breach by Adversarial Simplicial Complex Triggers Downstream Model Failures

Yi Qin, Rui Wang, Tao Huang et al. · Beijing Jiaotong University · National Engineering Research Center of Rail Transportation Operation and Control System +2 more

Adversarial attack on SAM's encoder using simplicial complex geometry to craft highly transferable examples that break downstream vision models

Input Manipulation Attack vision
PDF