Yingkai Dong

Papers in Database (2)

defense arXiv Aug 28, 2025 · Aug 2025

Beyond the Safety Tax: Mitigating Unsafe Text-to-Image Generation via External Safety Rectification

Xiangtao Meng, Yingkai Dong, Ning Yu et al. · Shandong University · Netflix

Proposes SafePatch, an external safety module for T2I diffusion models that suppresses unsafe generation without degrading benign image quality

Prompt Injection visiongenerative
PDF
attack arXiv Sep 7, 2025 · Sep 2025

DCMI: A Differential Calibration Membership Inference Attack Against Retrieval-Augmented Generation

Xinyu Gao, Xiangtao Meng, Yingkai Dong et al. · Shandong University

Novel MIA on RAG knowledge bases using differential query perturbation to isolate member document contributions

Membership Inference Attack Sensitive Information Disclosure nlp
PDF Code