Peng Cheng

h-index: 0 0 citations 3 papers (total)

Papers in Database (1)

attack arXiv Feb 8, 2026 · 8w ago

Rethinking Latency Denial-of-Service: Attacking the LLM Serving Framework, Not the Model

Tianyi Wang, Huawei Fan, Yuanchao Shu et al. · Zhejiang University

System-level DoS attack on LLM serving frameworks exploiting KV cache exhaustion and scheduler preemption for 20-280x latency amplification

Model Denial of Service nlp
PDF