tool 2025

UniMark: Artificial Intelligence Generated Content Identification Toolkit

Meilin Li 1,2, Ji He 1,3, Yi Yu 1, Jia Xu 1, Shanzhe Lei 1, Yan Teng 1, Yingchun Wang 1, Xuhong Wang 1

0 citations · 14 references · arXiv

α

Published on arXiv

2512.12324

Output Integrity Attack

OWASP ML Top 10 — ML09

Key Finding

UniMark unifies fragmented AIGC identification tools into a single framework supporting both hidden watermarking and visible compliance marking across all major media modalities.

UniMark

Novel technique introduced


The rapid proliferation of Artificial Intelligence Generated Content has precipitated a crisis of trust and urgent regulatory demands. However, existing identification tools suffer from fragmentation and a lack of support for visible compliance marking. To address these gaps, we introduce the \textbf{UniMark}, an open-source, unified framework for multimodal content governance. Our system features a modular unified engine that abstracts complexities across text, image, audio, and video modalities. Crucially, we propose a novel dual-operation strategy, natively supporting both \emph{Hidden Watermarking} for copyright protection and \emph{Visible Marking} for regulatory compliance. Furthermore, we establish a standardized evaluation framework with three specialized benchmarks (Image/Video/Audio-Bench) to ensure rigorous performance assessment. This toolkit bridges the gap between advanced algorithms and engineering implementation, fostering a more transparent and secure digital ecosystem.


Key Contributions

  • Unified, modular open-source engine abstracting AIGC identification across text, image, audio, and video modalities
  • Dual-operation strategy natively supporting both hidden watermarking (copyright/provenance) and visible marking (regulatory compliance)
  • Standardized evaluation framework with three specialized benchmarks (Image-Bench, Video-Bench, Audio-Bench) for rigorous performance assessment

🛡️ Threat Analysis

Output Integrity Attack

Primary contribution is watermarking and identifying AI-generated content (text, image, audio, video) — hidden watermarking for copyright/provenance and visible marking for regulatory compliance are both output integrity concerns. Benchmarks assess AIGC detection performance across modalities.


Details

Domains
multimodalnlpvisionaudio
Model Types
multimodalgenerative
Threat Tags
inference_time
Datasets
Image-BenchVideo-BenchAudio-Bench
Applications
aigc content governancecontent provenance trackingregulatory compliance markingcopyright protection