Dung Nguyen

Papers in Database (1)

defense ACL 2026 (Findings) Apr 19, 2026 · 4w ago

Continual Safety Alignment via Gradient-Based Sample Selection

Thong Bach, Dung Nguyen, Thao Minh Le et al. · Deakin University · Pennsylvania State University

Gradient-based sample filtering during fine-tuning that preserves LLM safety alignment by removing high-gradient samples causing drift

Prompt Injection nlp
PDF