defense 2026

Diversity Matters: Dataset Diversification and Dual-Branch Network for Generalized AI-Generated Image Detection

Nusrat Tasnim 1, Kutub Uddin 2, Khalid Malik 2

0 citations

α

Published on arXiv

2603.27800

Output Integrity Attack

OWASP ML Top 10 — ML09

Key Finding

Significantly improves cross-model and cross-dataset generalization performance compared to existing detection methods through data diversity and dual-domain feature extraction

Diversity Matters

Novel technique introduced


The rapid proliferation of AI-generated images, powered by generative adversarial networks (GANs), diffusion models, and other synthesis techniques, has raised serious concerns about misinformation, copyright violations, and digital security. However, detecting such images in a generalized and robust manner remains a major challenge due to the vast diversity of generative models and data distributions. In this work, we present \textbf{Diversity Matters}, a novel framework that emphasizes data diversity and feature domain complementarity for AI-generated image detection. The proposed method introduces a feature-domain similarity filtering mechanism that discards redundant or highly similar samples across both inter-class and intra-class distributions, ensuring a more diverse and representative training set. Furthermore, we propose a dual-branch network that combines CLIP features from the pixel domain and the frequency domain to jointly capture semantic and structural cues, leading to improved generalization against unseen generative models and adversarial conditions. Extensive experiments on benchmark datasets demonstrate that the proposed approach significantly improves cross-model and cross-dataset performance compared to existing methods. \textbf{Diversity Matters} highlights the critical role of data and feature diversity in building reliable and robust detectors against the rapidly evolving landscape of synthetic content.


Key Contributions

  • Feature-domain similarity filtering mechanism that ensures diverse training samples across inter-class and intra-class distributions
  • Dual-branch network combining CLIP pixel-domain and frequency-domain features for improved generalization
  • Demonstrates significant cross-model and cross-dataset performance against unseen generative models

🛡️ Threat Analysis

Output Integrity Attack

Detects AI-generated images (GANs, diffusion models) to verify content authenticity and distinguish synthetic from real images — core output integrity and content provenance problem.


Details

Domains
visiongenerative
Model Types
gandiffusionmultimodal
Threat Tags
inference_time
Applications
ai-generated image detectiondeepfake detectionsynthetic content verification