Abstract:Deepfakes, synthetic images generated by deep learning algorithms, represent one of the biggest challenges in the field of Digital Forensics. The scientific community is working to develop approaches that can discriminate the origin of digital images (real or AI-generated). However, these methodologies face the challenge of generalization, that is, the ability to discern the nature of an image even if it is generated by an architecture not seen during training. This usually leads to a drop in performance. In this context, we propose a novel approach based on three blocks called Base Models, each of which is responsible for extracting the discriminative features of a specific image class (Diffusion Model-generated, GAN-generated, or real) as it is trained by exploiting deliberately unbalanced datasets. The features extracted from each block are then concatenated and processed to discriminate the origin of the input image. Experimental results showed that this approach not only demonstrates good robust capabilities to JPEG compression but also outperforms state-of-the-art methods in several generalization tests. Code, models and dataset are available at

What problem does this paper attempt to address?

The main problem that this paper attempts to solve is the ability to distinguish between synthetic images and real images, especially the generalization ability when facing images created by generation architectures that have not been seen during the training process. With the development of deep - learning technologies, especially the progress of Generative Adversarial Networks (GANs) and Diffusion Models, the quality of synthetic images has reached a very high level, making these images very difficult to distinguish from real ones. Although this high - fidelity image - generation ability has brought great opportunities in fields such as art, entertainment, scientific research, and multimedia content production, it has also raised the possibility of abuse, such as creating fake news, fraud, and privacy - invasion problems. Therefore, developing effective detection methods to identify AI - generated content is crucial for maintaining the integrity of online information and combating the spread of Deepfakes. The paper proposes a new method based on three modules (called Base Models). Each module is responsible for extracting discriminative features from a specific type of image (generated by diffusion models, generated by GANs, or real images). These base models are trained by using deliberately unbalanced datasets to focus on extracting the unique features left by each generation architecture during the image - generation process. Then, the features extracted from each module are concatenated and further processed to determine the source of the input image. Experimental results show that this method is not only robust to JPEG compression but also outperforms existing state - of - the - art methods in multiple generalization tests. In conclusion, this paper aims to improve the ability to distinguish between real images and synthetic images in different contexts by proposing a new deep - learning architecture, especially enhancing the model's resistance to JPEG - compression attacks and its performance in generalization ability.

DeepFeatureX Net: Deep Features eXtractors based Network for discriminating synthetic from real images

Unmasking DeepFakes with simple Features

Fighting Deepfake by Exposing the Convolutional Traces on Images

On the Exploitation of DCT-Traces in the Generative-AI Domain

FFR_FD: Effective and fast detection of DeepFakes via feature point defects

FFR_FD: Effective and Fast Detection of DeepFakes Based on Feature Point Defects

DeepFake Detection by Analyzing Convolutional Traces

Fighting deepfakes by detecting GAN DCT anomalies

A Noise and Edge extraction-based dual-branch method for Shallowfake and Deepfake Localization

Real-Time Advanced Computational Intelligence for Deep Fake Video Detection

FCD-Net: Learning to Detect Multiple Types of Homologous Deepfake Face Images

Improving the Efficiency and Robustness of Deepfakes Detection through Precise Geometric Features

Classification of Deepfake Images Using a Novel Explanatory Hybrid Model

Combating deepfakes: a comprehensive multilayer deepfake video detection framework

Domain-invariant and Patch-discriminative Feature Learning for General Deepfake Detection

DiffusionFake: Enhancing Generalization in Deepfake Detection via Guided Stable Diffusion

Mastering Deepfake Detection: A Cutting-Edge Approach to Distinguish GAN and Diffusion-Model Images

Noise-aware progressive multi-scale deepfake detection

Deepfake Detection without Deepfakes: Generalization via Synthetic Frequency Patterns Injection

Exposing Deepfake Using Fusion of Deep-Learned and Hand-Crafted Features

Harnessing Machine Learning for Discerning AI-Generated Synthetic Images