Decoupling Generalizability and Membership Privacy Risks in Neural Networks

A deep learning model usually has to sacrifice some utilities when it acquires some other abilities or characteristics. Privacy preservation has such trade-off relationships with utilities. The loss disparity between various defense approaches implies the potential to decouple generalizability and privacy risks to maximize privacy gain. In this paper, we identify that the model's generalization and privacy risks exist in different regions in deep neural network architectures. Based on the observations that we investigate, we propose Privacy-Preserving Training Principle (PPTP) to protect model components from privacy risks while minimizing the loss in generalizability. Through extensive evaluations, our approach shows significantly better maintenance in model generalizability while enhancing privacy preservation.

Key Contributions

Identifies that generalization and membership privacy risks are localized in different architectural regions of deep neural networks
Proposes Privacy-Preserving Training Principle (PPTP) that selectively protects high-risk model components to decouple privacy and utility
Demonstrates significantly better utility-privacy tradeoff compared to existing membership inference defenses

🛡️ Threat Analysis

Membership Inference Attack

Paper explicitly targets 'membership privacy risks' — the threat that an adversary determines whether a specific sample was in the training set. PPTP is a training-time defense against membership inference attacks that decouples this risk from generalizability.

Details

Domains

vision

Model Types

cnntransformer

Threat Tags

training_time

Applications

2025 3 cit.

Membership Inference Attack

73%

Decoupling Generalizability and Membership Privacy Risks in Neural Networks

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Active Membership Inference Test (aMINT): Enhancing Model Auditability with Multi-Task Learning

Sequential Subspace Noise Injection Prevents Accuracy Collapse in Certified Unlearning

Toward Reliable Machine Unlearning: Theory, Algorithms, and Evaluation

Learnability and Privacy Vulnerability are Entangled in a Few Critical Weights

Statistical Roughness-Informed Machine Unlearning

Modeling Neural Networks with Privacy Using Neural Stochastic Differential Equations

Statistical MIA: Rethinking Membership Inference Attack for Reliable Unlearning Auditing

AdaMixup: A Dynamic Defense Framework for Membership Inference Attack Mitigation