Weiming Zhang

Papers in Database (5)

defense arXiv Feb 27, 2026 · 11w ago

SKeDA: A Generative Watermarking Framework for Text-to-video Diffusion Models

Yang Yang, Xinze Zou, Zehua Ma et al. · AnHui University · University of Science and Technology of China +1 more

Embeds robust watermarks into text-to-video diffusion outputs using shuffle-key sampling and differential attention for provenance tracking

Output Integrity Attack visiongenerative
PDF
tool arXiv Mar 9, 2026 · 10w ago

SWIFT: Sliding Window Reconstruction for Few-Shot Training-Free Generated Video Attribution

Chao Wang, Zijin Yang, Yaofei Wang et al. · University of Science and Technology of China · Hefei University of Technology

Few-shot, training-free video attribution tool traces generated videos to source models via sliding-window reconstruction loss signals

Output Integrity Attack visiongenerative
PDF Code
defense arXiv Apr 16, 2026 · 5w ago

Flow of Truth: Proactive Temporal Forensics for Image-to-Video Generation

Yuzhuo Chen, Zehua Ma, Han Fang et al. · University of Science and Technology of China

Proactive temporal forensics framework that embeds motion-aware watermarks in image-to-video generation for content authentication

Output Integrity Attack visiongenerative
PDF
attack arXiv Apr 17, 2026 · 4w ago

Into the Gray Zone: Domain Contexts Can Blur LLM Safety Boundaries

Ki Sen Hung, Xi Yang, Chang Liu et al. · The Hong Kong University of Science and Technology · University of Science and Technology of China

Context-based jailbreak attack achieving 93%+ success by exploiting safety-research framing to trigger broad defense relaxation across frontier LLMs

Prompt Injection nlp
PDF Code
defense arXiv Apr 7, 2026 · 6w ago

VLMShield: Efficient and Robust Defense of Vision-Language Models against Malicious Prompts

Peigui Qi, Kunsheng Tang, Yanpu Yu et al. · University of Science and Technology of China · Ant Group +1 more

Lightweight detector identifying malicious multimodal jailbreak prompts for VLMs via feature distribution analysis with CLIP-based aggregation

Input Manipulation Attack Prompt Injection multimodalvisionnlp
PDF Code