Yong Wang

attack arXiv Mar 27, 2026 · 10d ago

Eric Yocam, Varghese Vaidyan, Yong Wang · California Polytechnic State University · Dakota State University +1 more

Mechanistic attack amplifying hallucination nodes in LLM hidden states, with adaptive defense canceling excess activations at inference

Input Manipulation Attack Prompt Injection nlp

Papers in Database (1)