Yibo Yang

h-index: 2 8 citations 6 papers (total)

Papers in Database (2)

attack arXiv Jan 11, 2026 · 12w ago

PDR: A Plug-and-Play Positional Decay Framework for LLM Pre-training Data Detection

Jinhan Liu, Yibo Yang, Ruiying Lu et al.

Positional decay reweighting boosts black-box membership inference on LLMs by amplifying high-entropy early token signals

Membership Inference Attack nlp
PDF
defense arXiv Jan 12, 2026 · 12w ago

Safeguarding LLM Fine-tuning via Push-Pull Distributional Alignment

Haozhong Wang, Zhuo Li, Yibo Yang et al. · Jilin University

Defends LLM safety alignment during fine-tuning via Optimal Transport-based distributional reweighting away from harmful data

Transfer Learning Attack Prompt Injection nlp
PDF