Hanzhou Wu

h-index: 22 1,433 citations 117 papers (total)

Papers in Database (1)

attack arXiv Sep 23, 2025 · Sep 2025

Trigger Where It Hurts: Unveiling Hidden Backdoors through Sensitivity with Sensitron

Gejian Zhao, Hanzhou Wu, Xinpeng Zhang · Shanghai University

XAI-guided NLP backdoor attack using SHAP attribution to pinpoint vulnerable tokens and craft high-ASR triggers in language models

Model Poisoning nlp
PDF