Shengfang Zhai

h-index: 7 313 citations 18 papers (total)

Papers in Database (2)

defense arXiv Feb 7, 2026 · 8w ago

MemPot: Defending Against Memory Extraction Attack with Optimized Honeypots

Yuhao Wang, Shengfang Zhai, Guanghao Jin et al. · National University of Singapore · Southern University of Science and Technology +1 more

Defends LLM agent memory from adversarial data extraction by injecting optimized honeypot documents with SPRT-based sequential attacker detection

Sensitive Information Disclosure nlp
PDF
defense arXiv Oct 3, 2025 · Oct 2025

DMark: Order-Agnostic Watermarking for Diffusion Large Language Models

Linyu Wu, Linhao Zhong, Wenjie Qu et al. · National University of Singapore · Zhejiang University

Watermarks diffusion LLM text outputs via order-agnostic predictive and bidirectional strategies, achieving 92–99.5% detection at 1% FPR

Output Integrity Attack nlp
PDF