Latest papers

5 papers
benchmark arXiv Jan 29, 2026 · 9w ago

WMVLM: Evaluating Diffusion Model Image Watermarking via Vision-Language Models

Zijin Yang, Yu Sun, Kejiang Chen et al. · University of Science and Technology of China · Anhui Province Key Laboratory of Digital Security +1 more

Proposes a unified VLM-based benchmark for evaluating residual and semantic watermarks in diffusion model image outputs

Output Integrity Attack visiongenerative
PDF
defense arXiv Nov 26, 2025 · Nov 2025

GuardTrace-VL: Detecting Unsafe Multimodel Reasoning via Iterative Safety Supervision

Yuxiao Xiang, Junchi Chen, Zhenchao Jin et al. · University of Science and Technology of China · Anhui Province Key Laboratory of Digital Security +1 more

Defends VLMs against unsafe intermediate reasoning by auditing the full Question-Thinking-Answer pipeline with a vision-aware safety guard

Prompt Injection multimodalnlp
PDF
defense arXiv Oct 25, 2025 · Oct 2025

T2SMark: Balancing Robustness and Diversity in Noise-as-Watermark for Diffusion Models

Jindong Yang, Han Fang, Weiming Zhang et al. · University of Science and Technology of China · Anhui Province Key Laboratory of Digital Security +1 more

Proposes Tail-Truncated Sampling watermarking for diffusion model outputs, balancing robustness and generation diversity

Output Integrity Attack visiongenerative
5 citations 2 influentialPDF Code
defense arXiv Oct 1, 2025 · Oct 2025

LAKAN: Landmark-assisted Adaptive Kolmogorov-Arnold Network for Face Forgery Detection

Jiayao Jiang, Bin Liu, Qi Chu et al. · University of Science and Technology of China · Anhui Province Key Laboratory of Digital Security

Novel KAN-based deepfake detector uses facial landmarks to adaptively generate spline activations for artifact detection

Output Integrity Attack vision
PDF
benchmark arXiv Sep 6, 2025 · Sep 2025

MFFI: Multi-Dimensional Face Forgery Image Dataset for Real-World Scenarios

Changtao Miao, Yi Zhang, Man Luo et al. · Ant Group · Anhui Province Key Laboratory of Digital Security +4 more

Proposes a 1024K-image deepfake benchmark dataset spanning 50 forgery methods and real-world degradation for face forgery detection evaluation

Output Integrity Attack visiongenerative
PDF Code