Qinqin He

h-index: 1 4 citations 2 papers (total)

Papers in Database (1)

defense arXiv Sep 25, 2025 · Sep 2025

A Single Neuron Works: Precise Concept Erasure in Text-to-Image Diffusion Models

Qinqin He, Jiaqi Weng, Jialing Tao et al. · Alibaba Group

Defends text-to-image diffusion models against harmful content generation by suppressing a single SAE-identified neuron, with adversarial robustness

Input Manipulation Attack visionnlpgenerative
4 citations 1 influentialPDF