attack 2026

DRAINCODE: Stealthy Energy Consumption Attacks on Retrieval-Augmented Code Generation via Context Poisoning

Yanlin Wang ¹, Jiadong Wu ¹, Tianyue Jiang ¹, Mingwei Liu ¹, Jiachi Chen ¹, Chong Wang ², Ensheng Shi ³, Xilin Liu ³, Yuchi Ma ³, Zibin Zheng ¹

¹ Sun Yat-sen University

² Nanyang Technological University

³ Huawei

0 citations · 139 references · ASE

Published on arXiv

2601.20615

Model Denial of Service

OWASP LLM Top 10 — LLM04

Prompt Injection

OWASP LLM Top 10 — LLM01

Key Finding

DrainCode achieves up to 85% increase in GPU latency, 49% increase in energy consumption, and more than 3x increase in output length on RAG-based code generation LLMs compared to baseline.

DrainCode

Novel technique introduced

Large language models (LLMs) have demonstrated impressive capabilities in code generation by leveraging retrieval-augmented generation (RAG) methods. However, the computational costs associated with LLM inference, particularly in terms of latency and energy consumption, have received limited attention in the security context. This paper introduces DrainCode, the first adversarial attack targeting the computational efficiency of RAG-based code generation systems. By strategically poisoning retrieval contexts through a mutation-based approach, DrainCode forces LLMs to produce significantly longer outputs, thereby increasing GPU latency and energy consumption. We evaluate the effectiveness of DrainCode across multiple models. Our experiments show that DrainCode achieves up to an 85% increase in latency, a 49% increase in energy consumption, and more than a 3x increase in output length compared to the baseline. Furthermore, we demonstrate the generalizability of the attack across different prompting strategies and its effectiveness compared to different defenses. The results highlight DrainCode as a potential method for increasing the computational overhead of LLMs, making it useful for evaluating LLM security in resource-constrained environments. We provide code and data at https://github.com/DeepSoftwareAnalytics/DrainCode.

Key Contributions

First adversarial attack targeting computational efficiency (latency and energy) of RAG-based code generation LLMs rather than output correctness
Mutation-based context poisoning strategy that injects adversarially crafted code snippets into retrieval databases to force verbose LLM outputs
Empirical evaluation across multiple LLMs showing up to 85% latency increase, 49% energy increase, and >3x output length increase, with analysis of defenses and prompting strategies

🛡️ Threat Analysis

Details

Domains

nlp

Model Types

llm

Threat Tags

inference_timedigital

Applications

code generationrag-based llm systems

Read PDF arXiv DOI Code

DRAINCODE: Stealthy Energy Consumption Attacks on Retrieval-Augmented Code Generation via Context Poisoning

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

CODE: A Contradiction-Based Deliberation Extension Framework for Overthinking Attacks on Retrieval-Augmented Generation

Beyond Content Safety: Real-Time Monitoring for Reasoning Vulnerabilities in Large Language Models

Self-HarmLLM: Can Large Language Model Harm Itself?

Publish to Perish: Prompt Injection Attacks on LLM-Assisted Peer Review

Multilingual Hidden Prompt Injection Attacks on LLM-Based Academic Reviewing

Automating Agent Hijacking via Structural Template Injection

A Whole New World: Creating a Parallel-Poisoned Web Only AI-Agents Can See

A Simple and Efficient Jailbreak Method Exploiting LLMs' Helpfulness