Giovanni De Muri

h-index: 0 0 citations 1 papers (total)

Papers in Database (1)

attack arXiv Oct 21, 2025 · Oct 2025

Pay Attention to the Triggers: Constructing Backdoors That Survive Distillation

Giovanni De Muri, Mark Vero, Robin Staab et al. · ETH Zürich

Introduces T-MTB backdoor attack that survives LLM knowledge distillation by using frequent, composite trigger tokens

Model Poisoning Transfer Learning Attack nlp
PDF