Tao Jiang

h-index: 6 219 citations 10 papers (total)

Papers in Database (1)

defense arXiv Sep 29, 2025 · Sep 2025

Sanitize Your Responses: Mitigating Privacy Leakage in Large Language Models

Wenjie Fu, Huandong Wang, Junyao Gao et al. · Huazhong University of Science and Technology · Tsinghua University +2 more

Token-level self-monitoring and in-place repair framework that prevents LLMs from leaking private information via adversarial prompts

Sensitive Information Disclosure Prompt Injection nlp
PDF Code