HEDGE: Heterogeneous Ensemble for Detection of AI-GEnerated Images in the Wild

Robust detection of AI-generated images in the wild remains challenging due to the rapid evolution of generative models and varied real-world distortions. We argue that relying on a single training regime, resolution, or backbone is insufficient to handle all conditions, and that structured heterogeneity across these dimensions is essential for robust detection. To this end, we propose HEDGE, a Heterogeneous Ensemble for Detection of AI-GEnerated images, that introduces complementary detection routes along three axes: diverse training data with strong augmentation, multi-scale feature extraction, and backbone heterogeneity. Specifically, Route~A progressively constructs DINOv3-based detectors through staged data expansion and augmentation escalation, Route~B incorporates a higher-resolution branch for fine-grained forensic cues, and Route~C adds a MetaCLIP2-based branch for backbone diversity. All outputs are fused via logit-space weighted averaging, refined by a lightweight dual-gating mechanism that handles branch-level outliers and majority-dominated fusion errors. HEDGE achieves 4th place in the NTIRE 2026 Robust AI-Generated Image Detection in the Wild Challenge and attains state-of-the-art performance with strong robustness on multiple AIGC image detection benchmarks.

Key Contributions

Heterogeneous ensemble architecture combining DINOv3 and MetaCLIP2 backbones with multi-scale feature extraction
Three-route detection strategy with staged data expansion, multi-resolution branches, and backbone diversity
Dual-gating fusion mechanism to handle outliers and majority-dominated errors in ensemble outputs

🛡️ Threat Analysis

Output Integrity Attack

Detects AI-generated images to verify content authenticity and provenance — core output integrity problem. The paper builds a robust detection system for synthetic images in the wild.

Details

Domains

visiongenerative

Model Types

diffusiongantransformer

Threat Tags

inference_time

Datasets

NTIRE 2026 Challenge dataset

Applications

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Training-free Detection of AI-generated images via Cropping Robustness

NS-Net: Decoupling CLIP Semantic Information through NULL-Space for Generalizable AI-Generated Image Detection

Generalizable and Adaptive Continual Learning Framework for AI-generated Image Detection

CINEMAE: Leveraging Frozen Masked Autoencoders for Cross-Generator AI Image Detection

Patch-Discontinuity Mining for Generalized Deepfake Detection

Rethinking the Use of Vision Transformers for AI-Generated Image Detection

Exposing DeepFakes via Hyperspectral Domain Mapping

Semantic-Aware Reconstruction Error for Detecting AI-Generated Images