defense 2025

Hierarchical Deep Fusion Framework for Multi-dimensional Facial Forgery Detection - The 2024 Global Deepfake Image Detection Challenge

Kohou Wang , Huan Hu , Xiang Liu , Zezhou Chen , Ping Chen , Zhaoxiang Liu , Shiguo Lian

0 citations

α

Published on arXiv

2509.13107

Output Integrity Attack

OWASP ML Top 10 — ML09

Key Finding

HDFF achieves a score of 0.96852 on the private leaderboard, ranking 20th out of 184 teams in the 2024 Global Deepfake Image Detection Challenge.

HDFF (Hierarchical Deep Fusion Framework)

Novel technique introduced


The proliferation of sophisticated deepfake technology poses significant challenges to digital security and authenticity. Detecting these forgeries, especially across a wide spectrum of manipulation techniques, requires robust and generalized models. This paper introduces the Hierarchical Deep Fusion Framework (HDFF), an ensemble-based deep learning architecture designed for high-performance facial forgery detection. Our framework integrates four diverse pre-trained sub-models, Swin-MLP, CoAtNet, EfficientNetV2, and DaViT, which are meticulously fine-tuned through a multi-stage process on the MultiFFDI dataset. By concatenating the feature representations from these specialized models and training a final classifier layer, HDFF effectively leverages their collective strengths. This approach achieved a final score of 0.96852 on the competition's private leaderboard, securing the 20th position out of 184 teams, demonstrating the efficacy of hierarchical fusion for complex image classification tasks.


Key Contributions

  • HDFF ensemble architecture integrating Swin-MLP, CoAtNet, EfficientNetV2, and DaViT via feature concatenation and a fused FC classifier for deepfake detection
  • Multi-stage training strategy: ImageNet-pretrained initialization → frozen-backbone head fine-tuning → full end-to-end fine-tuning
  • Competitive validation on MultiFFDI (50+ forgery methods), achieving 0.96852 score and 20th place out of 184 teams

🛡️ Threat Analysis

Output Integrity Attack

Proposes a detection architecture specifically targeting AI-generated/manipulated facial imagery (deepfakes) — classic output integrity and AI-generated content detection work.


Details

Domains
visiongenerative
Model Types
transformercnn
Threat Tags
inference_time
Datasets
MultiFFDIImageNet-1K
Applications
facial forgery detectiondeepfake detection