Li Zheng

Papers in Database (1)

defense Neural Networks 193(2026) 1080... Aug 30, 2025 · Aug 2025

Backdoor Samples Detection Based on Perturbation Discrepancy Consistency in Pre-trained Language Models

Zuquan Peng, Jianming Fu, Lixin Zou et al. · Wuhan University

Detects NLP backdoor samples by measuring perturbation discrepancy curvature in log probability space, no poisoned model access needed

Model Poisoning nlp
PDF