benchmark arXiv Nov 26, 2025 · Nov 2025
Lorenzo Pellegrini, Serafino Pandolfini, Davide Maltoni et al. · University of Bologna · IdentifAI
Benchmarks architecture-agnostic design choices for deepfake detectors, establishing best practices that achieve SOTA on AI-GenBench
Output Integrity Attack visiongenerative
The effectiveness of deepfake detection methods often depends less on their core design and more on implementation details such as data preprocessing, augmentation strategies, and optimization techniques. These factors make it difficult to fairly compare detectors and to understand which factors truly contribute to their performance. To address this, we systematically investigate how different design choices influence the accuracy and generalization capabilities of deepfake detection models, focusing on aspects related to training, inference, and incremental updates. By isolating the impact of individual factors, we aim to establish robust, architecture-agnostic best practices for the design and development of future deepfake detection systems. Our experiments identify a set of design choices that consistently improve deepfake detection and enable state-of-the-art performance on the AI-GenBench benchmark.
cnn transformer University of Bologna · IdentifAI
benchmark arXiv Dec 18, 2025 · Dec 2025
Serafino Pandolfini, Lorenzo Pellegrini, Matteo Ferrara et al. · University of Bologna
Benchmarks synthetic image detectors on localized inpainting deepfakes, revealing partial transferability and key failure modes
Output Integrity Attack visiongenerative
The rapid progress of generative AI has enabled highly realistic image manipulations, including inpainting and region-level editing. These approaches preserve most of the original visual context and are increasingly exploited in cybersecurity-relevant threat scenarios. While numerous detectors have been proposed for identifying fully synthetic images, their ability to generalize to localized manipulations remains insufficiently characterized. This work presents a systematic evaluation of state-of-the-art detectors, originally trained for the deepfake detection on fully synthetic images, when applied to a distinct challenge: localized inpainting detection. The study leverages multiple datasets spanning diverse generators, mask sizes, and inpainting techniques. Our experiments show that models trained on a large set of generators exhibit partial transferability to inpainting-based edits and can reliably detect medium- and large-area manipulations or regeneration-style inpainting, outperforming many existing ad hoc detection approaches.
cnn transformer diffusion University of Bologna