Yue Liu

h-index: 3 16 citations 9 papers (total)

Papers in Database (2)

defense arXiv Sep 29, 2025 · Sep 2025

DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models

Zherui Li, Zheng Nie, Zhenhong Zhou et al. · Beijing University of Posts and Telecommunications · National University of Singapore +5 more

Defends diffusion LLMs against jailbreaks by fixing greedy remasking bias and block-level autonomous safety repair

Prompt Injection nlp
3 citations 2 influentialPDF Code
defense arXiv Oct 3, 2025 · Oct 2025

DMark: Order-Agnostic Watermarking for Diffusion Large Language Models

Linyu Wu, Linhao Zhong, Wenjie Qu et al. · National University of Singapore · Zhejiang University

Watermarks diffusion LLM text outputs via order-agnostic predictive and bidirectional strategies, achieving 92–99.5% detection at 1% FPR

Output Integrity Attack nlp
PDF