From Spurious to Causal: Low-rank Orthogonal Subspace Intervention for Generalizable Face Forgery Detection

The generalization problem remains a critical challenge in face forgery detection. Some researches have discovered that ``a backdoor path" in the representations from forgery-irrelevant information to labels induces biased learning, thereby hindering the generalization. In this paper, these forgery-irrelevant information are collectively termed spurious correlations factors. Previous methods predominantly focused on identifying concrete, specific spurious correlation and designing corresponding solutions to address them. However, spurious correlations arise from unobservable confounding factors, making it impractical to identify and address each one individually. To address this, we propose an intervention paradigm for representation space. Instead of tracking and blocking various instance-level spurious correlation one by one, we uniformly model them as a low-rank subspace and intervene in them. Specifically, we decompose spurious correlation features into a low-rank subspace via orthogonal low-rank projection, subsequently removing this subspace from the original representation and training its orthogonal complement to capture forgery-related features. This low-rank projection removal effectively eliminates spurious correlation factors, ensuring that classification decision is based on authentic forgery cues. With only 0.43M trainable parameters, our method achieves state-of-the-art performance across several benchmarks, demonstrating excellent robustness and generalization.

Key Contributions

Proposes a low-rank orthogonal subspace intervention paradigm that uniformly models all spurious correlation factors as a low-rank subspace and removes them from representations, rather than addressing each spurious factor individually.
Decomposes forgery-irrelevant features via orthogonal low-rank projection and trains only the orthogonal complement to capture causal forgery cues.
Achieves state-of-the-art generalization across multiple face forgery detection benchmarks with only 0.43M trainable parameters.

🛡️ Threat Analysis

Output Integrity Attack

Proposes a novel face forgery detection architecture that addresses a core forensic challenge (spurious correlations degrading generalization). Deepfake/face forgery detection is explicitly listed under ML09 AI-generated content detection, and the paper's primary contribution is a new detection technique, not a domain application of existing methods.

Details

Domains

vision

Model Types

cnntransformer

Threat Tags

inference_time

Datasets

FaceForensics++Celeb-DFDFDC

Applications

2025 0 cit.

Output Integrity Attack

100%