Yuting Tan

h-index: 1 1 citations 3 papers (total)

Papers in Database (1)

attack arXiv Nov 16, 2025 · Nov 2025

The 'Sure' Trap: Multi-Scale Poisoning Analysis of Stealthy Compliance-Only Backdoors in Fine-Tuned Large Language Models

Yuting Tan, Yi Huang, Zhuo Li · hydrox.ai

Introduces compliance-only LLM backdoor using 'Sure' labels that generalize to harmful outputs when triggered at inference

Model Poisoning Data Poisoning Attack Training Data Poisoning nlp
PDF