Latest papers

211 papers
defense arXiv Apr 2, 2026 · 4d ago

Combating Data Laundering in LLM Training

Muxing Li, Zesheng Ye, Sharon Li et al. · University of Melbourne · University of Wisconsin-Madison

Detects unauthorized LLM training data use even when original data has been laundered through style transformations

Membership Inference Attack Sensitive Information Disclosure nlp
PDF
attack arXiv Apr 1, 2026 · 5d ago

G-Drift MIA: Membership Inference via Gradient-Induced Feature Drift in LLMs

Ravi Ranjan, Utkarsh Grover, Xiaomin Lin et al. · Florida International University · University of South Florida

White-box membership inference attack using gradient-induced feature drift, outperforming confidence-based and reference-based MIAs on LLMs

Membership Inference Attack nlp
PDF
attack arXiv Apr 1, 2026 · 5d ago

AutoMIA: Improved Baselines for Membership Inference Attack via Agentic Self-Exploration

Ruhao Liu, Weiqi Huang, Qi Li et al. · National University of Singapore

Agentic framework that automates membership inference attacks through self-exploration and strategy evolution, outperforming handcrafted baselines

Membership Inference Attack
PDF Code
attack arXiv Apr 1, 2026 · 5d ago

SERSEM: Selective Entropy-Weighted Scoring for Membership Inference in Code Language Models

Kıvanç Kuzey Dikici, Serdar Kara, Semih Çağlar et al. · Bilkent University

White-box membership inference attack on code LLMs using AST-weighted entropy scoring to detect memorized training data

Membership Inference Attack nlp
PDF
attack arXiv Mar 30, 2026 · 7d ago

\texttt{ReproMIA}: A Comprehensive Analysis of Model Reprogramming for Proactive Membership Inference Attacks

Chihan Huang, Huaijin Wang, Shuai Wang · HKUST

Novel membership inference attack using model reprogramming to amplify privacy leakage signals across LLMs, diffusion models, and classifiers

Membership Inference Attack nlpvisiongenerative
PDF
attack arXiv Mar 30, 2026 · 7d ago

Membership Inference Attacks against Large Audio Language Models

Jia-Kai Dong, Yu-Xiang Lin, Hung-Yi Lee · National Taiwan University · NTU Artificial Intelligence Center of Research Excellence

First systematic membership inference attack evaluation of audio language models, revealing cross-modal memorization from speaker-text binding

Membership Inference Attack audiomultimodalnlp
PDF
benchmark arXiv Mar 27, 2026 · 10d ago

SALMUBench: A Benchmark for Sensitive Association-Level Multimodal Unlearning

Cai Selvas-Sala, Lei Kang, Lluis Gomez · Computer Vision Center · Universitat Politècnica de Catalunya +1 more

Benchmark for evaluating multimodal unlearning methods with fine-grained metrics for forgetting efficacy and collateral damage on CLIP-like models

Membership Inference Attack multimodalvisionnlp
PDF
attack arXiv Mar 25, 2026 · 12d ago

Uncovering Memorization in Timeseries Imputation models: LBRM Membership Inference and its link to attribute Leakage

Faiz Taleb, Ivan Gazeau, Maryline Laurent · EDF · Télécom SudParis +1 more

Membership and attribute inference attacks on time-series imputation models, achieving 0.90 AUROC via reference-model comparison

Membership Inference Attack Model Inversion Attack timeseries
PDF
survey arXiv Mar 25, 2026 · 12d ago

AI Security in the Foundation Model Era: A Comprehensive Survey from a Unified Perspective

Zhenyi Wang, Siyu Luan · University of Central Florida · University of Copenhagen

Unified taxonomy of ML security threats organizing attacks into data-to-data, data-to-model, model-to-data, and model-to-model categories

Input Manipulation Attack Data Poisoning Attack Model Inversion Attack Membership Inference Attack Model Theft Output Integrity Attack Model Poisoning Prompt Injection Sensitive Information Disclosure visionnlpmultimodal
PDF
survey arXiv Mar 24, 2026 · 13d ago

A Critical Review on the Effectiveness and Privacy Threats of Membership Inference Attacks

Najeeb Jebreel, David Sánchez, Josep Domingo-Ferrer · Universitat Rovira i Virgili

Critical analysis showing MIAs are weak privacy threats under realistic conditions, questioning the need for strong defenses like differential privacy

Membership Inference Attack visionnlp
PDF
benchmark arXiv Mar 19, 2026 · 18d ago

MIDST Challenge at SaTML 2025: Membership Inference over Diffusion-models-based Synthetic Tabular data

Masoumeh Shafieinejad, Xi He, Mahshid Alinoori et al. · Vector Institute · University of Waterloo +3 more

Competition evaluating membership inference attack resistance of diffusion models generating synthetic tabular data across white-box and black-box settings

Membership Inference Attack tabulargenerative
PDF Code
attack arXiv Mar 19, 2026 · 18d ago

Automated Membership Inference Attacks: Discovering MIA Signal Computations using LLM Agents

Toan Tran, Olivera Kotevska, Li Xiong · Emory University · Oak Ridge National Laboratory

LLM-agent framework that automatically discovers novel membership inference attack strategies, achieving 0.18 AUC improvement over existing MIAs

Membership Inference Attack
PDF
attack arXiv Mar 15, 2026 · 22d ago

Membership Inference for Contrastive Pre-training Models with Text-only PII Queries

Ruoxi Cheng, Yizhong Ding, Hongyi Zhang et al. · Beijing Electronic Science and Technology Institute · Alibaba Group +2 more

Text-only membership inference attack on CLIP/CLAP models that detects PII memorization without exposing biometric data

Membership Inference Attack multimodalvisionaudionlp
PDF
defense arXiv Mar 13, 2026 · 24d ago

Learnability and Privacy Vulnerability are Entangled in a Few Critical Weights

Xingli Fang, Jung-Eun Kim · North Carolina State University

Defends against membership inference by identifying and rewinding only the small fraction of weights responsible for privacy leakage

Membership Inference Attack vision
PDF
attack arXiv Mar 12, 2026 · 25d ago

Exponential-Family Membership Inference: From LiRA and RMIA to BaVarIA

Rickard Brännvall · RISE Research Institutes of Sweden

Unifies LiRA, RMIA, and BASE under one framework, then proposes BaVarIA — a Bayesian variance MIA that outperforms both at low shadow-model budgets

Membership Inference Attack visiontabular
PDF
benchmark arXiv Mar 12, 2026 · 25d ago

Understanding Disclosure Risk in Differential Privacy with Applications to Noise Calibration and Auditing (Extended Version)

Patricia Guerra-Balboa, Annika Sauer, Héber H. Arcolezi et al. · Karlsruhe Institute of Technology · Inria Centre at the University Grenoble Alpes +1 more

Proposes reconstruction advantage metric unifying MIA, AIA, and DRA to tightly bound DP disclosure risk and improve auditing

Model Inversion Attack Membership Inference Attack tabular
PDF
attack arXiv Mar 11, 2026 · 26d ago

Quantifying Membership Disclosure Risk for Tabular Synthetic Data Using Kernel Density Estimators

Rajdeep Pathak, Sayantee Jana · Indian Institute of Technology Hyderabad

KDE-based membership inference attack on tabular synthetic data achieves higher F1 without costly shadow model training

Membership Inference Attack tabular
PDF Code
benchmark arXiv Mar 9, 2026 · 28d ago

Quantifying Memorization and Privacy Risks in Genomic Language Models

Alexander Nemecek, Wenbiao Li, Xiaoqian Jiang et al. · Case Western Reserve University · UTHealth +1 more

Multi-vector framework quantifying memorization, canary extraction, and membership inference risks across genomic language model architectures

Model Inversion Attack Membership Inference Attack nlp
PDF
benchmark arXiv Mar 8, 2026 · 29d ago

Revisiting the LiRA Membership Inference Attack Under Realistic Assumptions

Najeeb Jebreel, Mona Khalil, David Sánchez et al. · Universitat Rovira i Virgili

Re-evaluates LiRA membership inference attack under realistic conditions, showing it is far less effective than previously reported

Membership Inference Attack vision
PDF Code
attack arXiv Mar 5, 2026 · 4w ago

From Unfamiliar to Familiar: Detecting Pre-training Data via Gradient Deviations in Large Language Models

Ruiqi Zhang, Lingxiang Wang, Hainan Zhang et al. · Beihang University · Tsinghua University

Detects LLM pre-training data via gradient deviation scores capturing update magnitude, location, and concentration in FFN/Attention modules

Membership Inference Attack nlp
PDF
Loading more papers…