Latest papers

2 papers
benchmark arXiv Jan 21, 2026 · 10w ago

Obscuring Data Contamination Through Translation: Evidence from Arabic Corpora

Chaymaa Abbas, Nour Shamaa, Mariette Awad · American University of Beirut

Shows Arabic translation conceals LLM benchmark contamination from standard detectors; proposes cross-lingual membership inference to expose it

Membership Inference Attack nlp
PDF
benchmark arXiv Dec 15, 2025 · Dec 2025

On the Effectiveness of Membership Inference in Targeted Data Extraction from Large Language Models

Ali Al Sahili, Ali Chehab, Razane Tajeddine · American University of Beirut

Benchmarks MIA techniques integrated into LLM training data extraction pipelines versus standalone MIA evaluation settings

Membership Inference Attack Model Inversion Attack Sensitive Information Disclosure nlp
PDF