Xiaoyan Gu

h-index: 1 5 citations 5 papers (total)

Papers in Database (2)

defense arXiv Nov 12, 2025 · Nov 2025

Value-Aligned Prompt Moderation via Zero-Shot Agentic Rewriting for Safe Image Generation

Xin Zhao, Xiaojun Chen, Bingshan Liu et al. · Chinese Academy of Sciences · State Key Laboratory of Cyberspace Security Defense +1 more

Defends text-to-image models from jailbreak prompts via LLM-driven zero-shot prompt rewriting with cultural and intent-aware safety checks

Prompt Injection multimodalgenerativenlp
1 citations PDF
defense arXiv Dec 16, 2025 · Dec 2025

ComMark: Covert and Robust Black-Box Model Watermarking with Compressed Samples

Yunfei Yang, Xiaojun Chen, Zhendong Zhao et al. · Chinese Academy of Sciences · University of Chinese Academy of Sciences +1 more

Defends model IP by embedding frequency-domain compressed watermark samples into black-box models, resisting removal and forgery attacks.

Model Theft visionnlpaudio
PDF