Morphology-optimized Multi-Scale Fusion: Combining Local Artifacts and Mesoscopic Semantics for Deepfake Detection and Localization
Chao Shuai 1, Gaojian Wang 1, Kun Pan 1, Tong Wu 1, Fanli Jin 1, Haohan Tan 1, Mengxiang Li 1, Zhenguang Liu 1,2, Feng Lin 1,2, Kui Ren 1,2
Published on arXiv
2509.13776
Output Integrity Attack
OWASP ML Top 10 — ML09
Key Finding
Morphological fusion of local and global prediction branches improves deepfake localization robustness by suppressing noise amplification inherent in naive output combination.
Morphology-optimized Multi-Scale Fusion (MMSF)
Novel technique introduced
While the pursuit of higher accuracy in deepfake detection remains a central goal, there is an increasing demand for precise localization of manipulated regions. Despite the remarkable progress made in classification-based detection, accurately localizing forged areas remains a significant challenge. A common strategy is to incorporate forged region annotations during model training alongside manipulated images. However, such approaches often neglect the complementary nature of local detail and global semantic context, resulting in suboptimal localization performance. Moreover, an often-overlooked aspect is the fusion strategy between local and global predictions. Naively combining the outputs from both branches can amplify noise and errors, thereby undermining the effectiveness of the localization. To address these issues, we propose a novel approach that independently predicts manipulated regions using both local and global perspectives. We employ morphological operations to fuse the outputs, effectively suppressing noise while enhancing spatial coherence. Extensive experiments reveal the effectiveness of each module in improving the accuracy and robustness of forgery localization.
Key Contributions
- Dual-branch architecture that independently predicts manipulated regions from both local artifact and global semantic perspectives
- Morphological operations as a principled fusion strategy to suppress noise and enforce spatial coherence when combining local and global branch outputs
- Demonstrates improved accuracy and robustness for forgery localization over naive branch-fusion baselines
🛡️ Threat Analysis
Core contribution is detecting and localizing AI-manipulated (deepfake) image regions — directly addresses output integrity and AI-generated content authenticity verification.