Jung-Eun Kim

Papers in Database (2)

defense arXiv Mar 13, 2026 · 24d ago

Purifying Generative LLMs from Backdoors without Prior Knowledge or Clean Reference

Jianwei Li, Jung-Eun Kim · North Carolina State University

Backdoor removal for instruction-tuned LLMs using synthetic backdoor variants to identify shared malicious components without trigger knowledge

Model Poisoning nlp
PDF
defense arXiv Mar 13, 2026 · 24d ago

Learnability and Privacy Vulnerability are Entangled in a Few Critical Weights

Xingli Fang, Jung-Eun Kim · North Carolina State University

Defends against membership inference by identifying and rewinding only the small fraction of weights responsible for privacy leakage

Membership Inference Attack vision
PDF