Yong Wang

Papers in Database (1)

attack arXiv Mar 27, 2026 · 10d ago

H-Node Attack and Defense in Large Language Models

Eric Yocam, Varghese Vaidyan, Yong Wang · California Polytechnic State University · Dakota State University +1 more

Mechanistic attack amplifying hallucination nodes in LLM hidden states, with adaptive defense canceling excess activations at inference

Input Manipulation Attack Prompt Injection nlp
PDF