When Deepfake Detection Meets Graph Neural Network:a Unified and Lightweight Learning Framework

The proliferation of generative video models has made detecting AI-generated and manipulated videos an urgent challenge. Existing detection approaches often fail to generalize across diverse manipulation types due to their reliance on isolated spatial, temporal, or spectral information, and typically require large models to perform well. This paper introduces SSTGNN, a lightweight Spatial-Spectral-Temporal Graph Neural Network framework that represents videos as structured graphs, enabling joint reasoning over spatial inconsistencies, temporal artifacts, and spectral distortions. SSTGNN incorporates learnable spectral filters and spatial-temporal differential modeling into a unified graph-based architecture, capturing subtle manipulation traces more effectively. Extensive experiments on diverse benchmark datasets demonstrate that SSTGNN not only achieves superior performance in both in-domain and cross-domain settings, but also offers strong efficiency and resource allocation. Remarkably, SSTGNN accomplishes these results with up to 42$\times$ fewer parameters than state-of-the-art models, making it highly lightweight and resource-friendly for real-world deployment.

Key Contributions

SSTGNN: a unified graph-based architecture that jointly models spatial inconsistencies, temporal artifacts, and spectral distortions for deepfake video detection
Learnable spectral filters and spatial-temporal differential modeling embedded within a GNN framework for capturing subtle manipulation traces
Achieves up to 42× parameter reduction compared to SOTA models while maintaining superior in-domain and cross-domain detection performance

🛡️ Threat Analysis

Output Integrity Attack

Proposes a novel detection architecture for AI-generated and manipulated video content — deepfake detection is explicitly listed under ML09 (output integrity / AI-generated content detection). The contribution is a new forensic method, not mere application of existing detectors to a domain.

Details

Domains

vision

Model Types

gnn

Threat Tags

inference_time

Datasets

FaceForensics++Celeb-DFDFDC

Applications

2025 0 cit.

Output Integrity Attack

78%

When Deepfake Detection Meets Graph Neural Network:a Unified and Lightweight Learning Framework

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Towards Robust DeepFake Detection under Unstable Face Sequences: Adaptive Sparse Graph Embedding with Order-Free Representation and Explicit Laplacian Spectral Prior

DDNet: A Dual-Stream Graph Learning and Disentanglement Framework for Temporal Forgery Localization

Where is the Watermark? Interpretable Watermark Detection at the Block Level

Fine-Grained DINO Tuning with Dual Supervision for Face Forgery Detection

DinoLizer: Learning from the Best for Generative Inpainting Localization

FreqDebias: Towards Generalizable Deepfake Detection via Consistency-Driven Frequency Debiasing

DevFD: Developmental Face Forgery Detection by Learning Shared and Orthogonal LoRA Subspaces

Deepfake Detection that Generalizes Across Benchmarks