Guangyu Shen

h-index: 21 1,319 citations 57 papers (total)

Papers in Database (2)

defense arXiv Oct 5, 2025 · Oct 2025

From Poisoned to Aware: Fostering Backdoor Self-Awareness in LLMs

Guangyu Shen, Siyuan Cheng, Xiangzhe Xu et al. · Purdue University · Columbia University

Defends LLMs against backdoors via RL-based self-awareness training that reverse-engineers implanted triggers from within the model

Model Poisoning nlp
PDF
survey arXiv Feb 6, 2026 · 8w ago

Trojans in Artificial Intelligence (TrojAI) Final Report

Kristopher W. Reese, Taylor Kulp-McDowall, Michael Majurski et al. · IARPA · NIST +13 more

Surveys IARPA TrojAI program findings on AI backdoor detection via weight analysis and trigger inversion across multi-year research

Model Poisoning visionnlp
PDF