Training data membership inference via Gaussian process meta-modeling: a post-hoc analysis approach

Membership inference attacks (MIAs) test whether a data point was part of a model's training set, posing serious privacy risks. Existing methods often depend on shadow models or heavy query access, which limits their practicality. We propose GP-MIA, an efficient and interpretable approach based on Gaussian process (GP) meta-modeling. Using post-hoc metrics such as accuracy, entropy, dataset statistics, and optional sensitivity features (e.g. gradients, NTK measures) from a single trained model, GP-MIA trains a GP classifier to distinguish members from non-members while providing calibrated uncertainty estimates. Experiments on synthetic data, real-world fraud detection data, CIFAR-10, and WikiText-2 show that GP-MIA achieves high accuracy and generalizability, offering a practical alternative to existing MIAs.

Key Contributions

GP-MIA: a Gaussian process-based membership inference attack using post-hoc metrics (accuracy, entropy, dataset statistics, optional gradient/NTK features) from a single trained model
Eliminates the need for shadow models or heavy query access while providing calibrated uncertainty estimates
Demonstrated effectiveness across diverse settings: synthetic data, fraud detection, CIFAR-10, and WikiText-2

🛡️ Threat Analysis

Membership Inference Attack

GP-MIA is a membership inference attack that determines whether specific data points were included in a model's training set — the core definition of ML04. It proposes a novel GP-based classifier using post-hoc metrics (accuracy, entropy, gradients, NTK) as an alternative to shadow model and LiRA-style attacks.

Details

Domains

visionnlptabular

Model Types

cnntransformertraditional_ml

Threat Tags

black_boxinference_time

Datasets

CIFAR-10WikiText-2

Applications

2025 2 cit.

Membership Inference Attack

67%

Training data membership inference via Gaussian process meta-modeling: a post-hoc analysis approach

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

PAC-Private Responses with Adversarial Composition

Toward Efficient Inference Attacks: Shadow Model Sharing via Mixture-of-Experts

Membership Inference Attack with Partial Features

A Critical Review on the Effectiveness and Privacy Threats of Membership Inference Attacks

Exponential-Family Membership Inference: From LiRA and RMIA to BaVarIA

Imitative Membership Inference Attack

Membership Inference Attacks with False Discovery Rate Control

Dual-View Inference Attack: Machine Unlearning Amplifies Privacy Exposure