Jie Peng

defense arXiv Nov 22, 2025 · Nov 2025

Junrui Zhang, Xinyu Zhao, Jie Peng et al. · University of Science & Technology of China · University of North Carolina at Chapel Hill +1 more

Adversarial training defense that quantifies per-modality vulnerability to selectively harden multimodal models against adversarial attacks

Input Manipulation Attack multimodal

defense arXiv Oct 19, 2025 · Oct 2025

Pingzhi Li, Morris Yu-Chao Huang, Zhen Tan et al. · UNC-Chapel Hill · Arizona State University +4 more

Detects LLM knowledge distillation (model theft) by fingerprinting MoE expert routing patterns in both white-box and black-box settings

Model Theft nlp

Papers in Database (2)