Johannes Bjerva

Papers in Database (1)

benchmark arXiv Mar 2, 2026 · 5w ago

Characterizing Memorization in Diffusion Language Models: Generalized Extraction and Sampling Effects

Xiaoyu Luo, Wenrui Yu, Qiongxiu Li et al. · Aalborg University

Characterizes training data memorization in diffusion LMs via a generalized extraction framework, proving sampling resolution controls verbatim PII leakage

Model Inversion Attack Sensitive Information Disclosure nlpgenerative
PDF