Latest papers

2 papers
defense arXiv Dec 9, 2025 · Dec 2025

Improving the Sensitivity of Backdoor Detectors via Class Subspace Orthogonalization

Guangmingmei Yang, David J. Miller, George Kesidis · Penn State · Anomalee Inc.

Plug-and-play backdoor detector that suppresses intrinsic class features to isolate trigger signals, boosting sensitivity across existing detectors

Model Poisoning vision
PDF
defense arXiv Sep 19, 2025 · Sep 2025

Inverting Trojans in LLMs

Zhengxing Li, Guangmingmei Yang, Jayaram Raghuram et al. · Penn State · Anomalee Inc.

Defends LLMs against backdoor attacks by inverting triggers via discrete greedy search and implicit activation-space blacklisting

Model Poisoning nlp
PDF