Latest papers

1 papers
defense arXiv Nov 8, 2025 · Nov 2025

DRAGON: Guard LLM Unlearning in Context via Negative Detection and Reasoning

Yaxuan Wang, Chris Yuhao Liu, Quan Liu et al. · University of California · Accenture

Training-free LLM unlearning framework using CoT-guided detection and refusal to block queries about forgotten private or harmful data

Sensitive Information Disclosure Prompt Injection nlp
2 citations PDF