Ruocheng Shan

h-index: 2 8 citations 6 papers (total)

Papers in Database (1)

defense arXiv Nov 23, 2025 · Nov 2025

Semantics as a Shield: Label Disguise Defense (LDD) against Prompt Injection in LLM Sentiment Classification

Yanxi Li, Ruocheng Shan · George Washington University

Defends LLMs against class-directive prompt injection by disguising output labels with alias terms in few-shot prompts

Prompt Injection nlp
PDF