Michael Cunningham

h-index: 16 3,098 citations 63 papers (total)

Papers in Database (1)

defense arXiv Feb 18, 2026 ยท 6w ago

Privacy-Aware Split Inference with Speculative Decoding for Large Language Models over Wide-Area Networks

Michael Cunningham

Defends LLM inference input privacy by splitting transformers across trusted/untrusted GPUs, empirically quantifying inversion attack token recovery (59% vs 35%) across split depths

Model Inversion Attack Sensitive Information Disclosure nlp
PDF