Protecting Private Code in IDE Autocomplete using Differential Privacy

Modern Integrated Development Environments (IDEs) increasingly leverage Large Language Models (LLMs) to provide advanced features like code autocomplete. While powerful, training these models on user-written code introduces significant privacy risks, making the models themselves a new type of data vulnerability. Malicious actors can exploit this by launching attacks to reconstruct sensitive training data or infer whether a specific code snippet was used for training. This paper investigates the use of Differential Privacy (DP) as a robust defense mechanism for training an LLM for Kotlin code completion. We fine-tune a \texttt{Mellum} model using DP and conduct a comprehensive evaluation of its privacy and utility. Our results demonstrate that DP provides a strong defense against Membership Inference Attacks (MIAs), reducing the attack's success rate close to a random guess (AUC from 0.901 to 0.606). Furthermore, we show that this privacy guarantee comes at a minimal cost to model performance, with the DP-trained model achieving utility scores comparable to its non-private counterpart, even when trained on 100x less data. Our findings suggest that DP is a practical and effective solution for building private and trustworthy AI-powered IDE features.

Key Contributions

First code completion LLM (Mellum fine-tune) trained with formal differential privacy guarantees for IDE integration
Empirical demonstration that DP reduces MIA success from AUC 0.901 to 0.606 (near random) with minimal utility loss
Shows that DP-trained model achieves comparable utility scores even when trained on 100x less data than the non-private counterpart

🛡️ Threat Analysis

Membership Inference Attack

The paper's primary empirical evaluation is a defense against Membership Inference Attacks — it quantitatively measures MIA success rate (AUC) before and after DP fine-tuning, reducing it from 0.901 to 0.606 (near random). This is the core security contribution.

Details

Domains

nlp

Model Types

llmtransformer

Threat Tags

training_timeblack_box

Datasets

Kotlin code repositories

Applications

2025 0 cit.

Membership Inference Attack

73%

Protecting Private Code in IDE Autocomplete using Differential Privacy

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Combating Data Laundering in LLM Training

PrivMedChat: End-to-End Differentially Private RLHF for Medical Dialogue Systems

Quantization-Robust LLM Unlearning via Low-Rank Adaptation

Public Data Assisted Differentially Private In-Context Learning

STaR: Sensitive Trajectory Regulation for Unlearning in Large Reasoning Models

Shadow Unlearning: A Neuro-Semantic Approach to Fidelity-Preserving Faceless Forgetting in LLMs

Privacy Enhanced PEFT: Tensor Train Decomposition Improves Privacy Utility Tradeoffs under DP-SGD

Membership Inference Attacks on LLM-based Recommender Systems