Yelong Shen

h-index: 2 27 citations 4 papers (total)

Papers in Database (1)

attack arXiv Oct 24, 2025 · Oct 2025

$δ$-STEAL: LLM Stealing Attack with Local Differential Privacy

Kieu Dang, Phung Lai, NhatHai Phan et al. · University at Albany · New Jersey Institute of Technology +2 more

LDP noise injection during fine-tuning steals LLM behavior from APIs while evading watermark detectors, achieving 96.95% attack success rate

Model Theft Output Integrity Attack Model Theft nlp
2 citations PDF Code