Hin Chi Kwok

defense arXiv Nov 15, 2025 · Nov 2025

Shaowei Guan, Yu Zhai, Zhengyu Zhang et al. · The Hong Kong Polytechnic University

Defends LLMs against adversarial text perturbations using DeepSeek-Reasoner CoT prompts that purify inputs and explain each defense decision

Input Manipulation Attack Prompt Injection nlp

Papers in Database (1)