Morphology-optimized Multi-Scale Fusion: Combining Local Artifacts and Mesoscopic Semantics for Deepfake Detection and Localization

While the pursuit of higher accuracy in deepfake detection remains a central goal, there is an increasing demand for precise localization of manipulated regions. Despite the remarkable progress made in classification-based detection, accurately localizing forged areas remains a significant challenge. A common strategy is to incorporate forged region annotations during model training alongside manipulated images. However, such approaches often neglect the complementary nature of local detail and global semantic context, resulting in suboptimal localization performance. Moreover, an often-overlooked aspect is the fusion strategy between local and global predictions. Naively combining the outputs from both branches can amplify noise and errors, thereby undermining the effectiveness of the localization. To address these issues, we propose a novel approach that independently predicts manipulated regions using both local and global perspectives. We employ morphological operations to fuse the outputs, effectively suppressing noise while enhancing spatial coherence. Extensive experiments reveal the effectiveness of each module in improving the accuracy and robustness of forgery localization.

Key Contributions

Dual-branch architecture that independently predicts manipulated regions from both local artifact and global semantic perspectives
Morphological operations as a principled fusion strategy to suppress noise and enforce spatial coherence when combining local and global branch outputs
Demonstrates improved accuracy and robustness for forgery localization over naive branch-fusion baselines

🛡️ Threat Analysis

Output Integrity Attack

Core contribution is detecting and localizing AI-manipulated (deepfake) image regions — directly addresses output integrity and AI-generated content authenticity verification.

Details

Domains

vision

Model Types

cnntransformer

Threat Tags

inference_timedigital

Applications

2026 0 cit.

Output Integrity Attack

100%

Morphology-optimized Multi-Scale Fusion: Combining Local Artifacts and Mesoscopic Semantics for Deepfake Detection and Localization

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

TwoHead-SwinFPN: A Unified DL Architecture for Synthetic Manipulation, Detection and Localization in Identity Documents

ForensicFormer: Hierarchical Multi-Scale Reasoning for Cross-Domain Image Forgery Detection

ForensicFlow: A Tri-Modal Adaptive Network for Robust Deepfake Detection

Attack-Aware Deepfake Detection under Counter-Forensic Manipulations

A Novel Unified Approach to Deepfake Detection

Beyond Flicker: Detecting Kinematic Inconsistencies for Generalizable Deepfake Video Detection

Fairness-Aware Deepfake Detection: Leveraging Dual-Mechanism Optimization

StegaFFD: Privacy-Preserving Face Forgery Detection via Fine-Grained Steganographic Domain Lifting