Paulina Seidl

h-index: 0 0 citations 1 papers (total)

Papers in Database (1)

benchmark arXiv Jan 24, 2026 · 10w ago

Unintended Memorization of Sensitive Information in Fine-Tuned Language Models

Marton Szep, Jorge Marin Ruiz, Georgios Kaissis et al. · Technical University of Munich · TUM University Hospital +1 more

Benchmarks PII extraction attacks and four defenses against unintended memorization in fine-tuned LLMs using black-box probes

Model Inversion Attack Sensitive Information Disclosure nlp
PDF Code