defense 2026

Generalizable Face Forgery Detection via Separable Prompt Learning

Enrui Yang , Yuezun Li

0 citations

α

Published on arXiv

2604.17307

Output Integrity Attack

OWASP ML Top 10 — ML09

Key Finding

Achieves competitive and superior performance compared to existing methods under both cross-dataset and cross-method evaluation, demonstrating strong generalizability to unseen forgery techniques

SePL

Novel technique introduced


Detecting face forgeries using CLIP has recently emerged as a promising and increasingly popular research direction. Owing to its rich visual knowledge acquired through large-scale pretraining, most existing methods typically rely on the visual encoder of CLIP, while paying limited attention to the text modality. Given the instructive nature of the text modality, we posit that it can be leveraged to instruct Deepfake detection with meticulous design. Accordingly, we shift the focus from the visual modality to the text modality and propose a new Separable Prompt Learning strategy (SePL) that enables CLIP to serve as an effective face forgery detector. The core idea of SePL is to disentangle forgery-specific and forgery-irrelevant information in images via two types of prompt learning, with the former enhancing detection. To achieve this disentangle, we describe a cross-modality alignment strategy and a set of dedicated objectives. Extensive experiments demonstrate that, with this simple adaptation, our method achieves competitive and even superior performance compared to other methods under both cross-dataset and cross-method evaluation, highlighting its strong generalizability. The codes have been released at https://github.com/OUC-YER/SePL-DeepfakeDetection


Key Contributions

  • Separable Prompt Learning (SePL) strategy that disentangles forgery-specific from forgery-irrelevant information via dual prompt types
  • Cross-modality alignment strategy between text and visual encoders with dedicated objectives for prompt learning
  • Strong cross-dataset and cross-method generalization performance on deepfake detection benchmarks

🛡️ Threat Analysis

Output Integrity Attack

Paper focuses on detecting AI-generated face forgeries (deepfakes) — verifying authenticity of face images and detecting synthetic/manipulated content, which is output integrity and AI-generated content detection.


Details

Domains
visionmultimodal
Model Types
multimodaltransformer
Threat Tags
inference_time
Datasets
DFDCFaceForensics++
Applications
deepfake detectionface forgery detection