Weiming Zhang

defense arXiv Feb 27, 2026 · 11w ago

Yang Yang, Xinze Zou, Zehua Ma et al. · AnHui University · University of Science and Technology of China +1 more

Embeds robust watermarks into text-to-video diffusion outputs using shuffle-key sampling and differential attention for provenance tracking

Output Integrity Attack visiongenerative

tool arXiv Mar 9, 2026 · 10w ago

Chao Wang, Zijin Yang, Yaofei Wang et al. · University of Science and Technology of China · Hefei University of Technology

Few-shot, training-free video attribution tool traces generated videos to source models via sliding-window reconstruction loss signals

Output Integrity Attack visiongenerative

defense arXiv Apr 16, 2026 · 5w ago

Yuzhuo Chen, Zehua Ma, Han Fang et al. · University of Science and Technology of China

Proactive temporal forensics framework that embeds motion-aware watermarks in image-to-video generation for content authentication

Output Integrity Attack visiongenerative

attack arXiv Apr 17, 2026 · 4w ago

Ki Sen Hung, Xi Yang, Chang Liu et al. · The Hong Kong University of Science and Technology · University of Science and Technology of China

Context-based jailbreak attack achieving 93%+ success by exploiting safety-research framing to trigger broad defense relaxation across frontier LLMs

Prompt Injection nlp

defense arXiv Apr 7, 2026 · 6w ago

Peigui Qi, Kunsheng Tang, Yanpu Yu et al. · University of Science and Technology of China · Ant Group +1 more

Lightweight detector identifying malicious multimodal jailbreak prompts for VLMs via feature distribution analysis with CLIP-based aggregation

Input Manipulation Attack Prompt Injection multimodalvisionnlp

Papers in Database (5)