Latest papers

2 papers
defense arXiv Jan 22, 2026 · 10w ago

NOIR: Privacy-Preserving Generation of Code with Open-Source LLMs

Khoa Nguyen, Khiem Ton, NhatHai Phan et al. · New Jersey Institute of Technology · Hamad Bin Khalifa University +2 more

Defends LLM code generation prompts from cloud reconstruction via embedding-level local differential privacy and a randomized tokenizer

Model Inversion Attack Sensitive Information Disclosure nlp
1 citations 1 influentialPDF
attack arXiv Oct 24, 2025 · Oct 2025

$δ$-STEAL: LLM Stealing Attack with Local Differential Privacy

Kieu Dang, Phung Lai, NhatHai Phan et al. · University at Albany · New Jersey Institute of Technology +2 more

LDP noise injection during fine-tuning steals LLM behavior from APIs while evading watermark detectors, achieving 96.95% attack success rate

Model Theft Output Integrity Attack Model Theft nlp
2 citations PDF Code