DeepFeatureX Net: Deep Features eXtractors based Network for discriminating synthetic from real images

Orazio Pontorno,Luca Guarnera,Sebastiano Battiato
2024-04-24
Abstract:Deepfakes, synthetic images generated by deep learning algorithms, represent one of the biggest challenges in the field of Digital Forensics. The scientific community is working to develop approaches that can discriminate the origin of digital images (real or AI-generated). However, these methodologies face the challenge of generalization, that is, the ability to discern the nature of an image even if it is generated by an architecture not seen during training. This usually leads to a drop in performance. In this context, we propose a novel approach based on three blocks called Base Models, each of which is responsible for extracting the discriminative features of a specific image class (Diffusion Model-generated, GAN-generated, or real) as it is trained by exploiting deliberately unbalanced datasets. The features extracted from each block are then concatenated and processed to discriminate the origin of the input image. Experimental results showed that this approach not only demonstrates good robust capabilities to JPEG compression but also outperforms state-of-the-art methods in several generalization tests. Code, models and dataset are available at
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the ability to distinguish between synthetic images and real images, especially the generalization ability when facing images created by generation architectures that have not been seen during the training process. With the development of deep - learning technologies, especially the progress of Generative Adversarial Networks (GANs) and Diffusion Models, the quality of synthetic images has reached a very high level, making these images very difficult to distinguish from real ones. Although this high - fidelity image - generation ability has brought great opportunities in fields such as art, entertainment, scientific research, and multimedia content production, it has also raised the possibility of abuse, such as creating fake news, fraud, and privacy - invasion problems. Therefore, developing effective detection methods to identify AI - generated content is crucial for maintaining the integrity of online information and combating the spread of Deepfakes. The paper proposes a new method based on three modules (called Base Models). Each module is responsible for extracting discriminative features from a specific type of image (generated by diffusion models, generated by GANs, or real images). These base models are trained by using deliberately unbalanced datasets to focus on extracting the unique features left by each generation architecture during the image - generation process. Then, the features extracted from each module are concatenated and further processed to determine the source of the input image. Experimental results show that this method is not only robust to JPEG compression but also outperforms existing state - of - the - art methods in multiple generalization tests. In conclusion, this paper aims to improve the ability to distinguish between real images and synthetic images in different contexts by proposing a new deep - learning architecture, especially enhancing the model's resistance to JPEG - compression attacks and its performance in generalization ability.