Level Up the Deepfake Detection: a Method to Effectively Discriminate Images Generated by GAN Architectures and Diffusion Models

Luca Guarnera,Oliver Giudice,Sebastiano Battiato
2023-03-02
Abstract:The image deepfake detection task has been greatly addressed by the scientific community to discriminate real images from those generated by Artificial Intelligence (AI) models: a binary classification task. In this work, the deepfake detection and recognition task was investigated by collecting a dedicated dataset of pristine images and fake ones generated by 9 different Generative Adversarial Network (GAN) architectures and by 4 additional Diffusion Models (DM). A hierarchical multi-level approach was then introduced to solve three different deepfake detection and recognition tasks: (i) Real Vs AI generated; (ii) GANs Vs DMs; (iii) AI specific architecture recognition. Experimental results demonstrated, in each case, more than 97% classification accuracy, outperforming state-of-the-art methods.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to address the problem of deepfake image detection and recognition. Specifically, the researchers collected a dataset containing real images and fake images generated by different Generative Adversarial Networks (GANs) and Diffusion Models (DMs), and proposed a multi-level classification method to solve the following three tasks: 1. **Distinguishing between real images and AI-generated images**: That is, distinguishing between real images and images generated by all AI models. 2. **Distinguishing between GAN-generated images and DM-generated images**: Further distinguishing whether the image was generated by a GAN or a DM. 3. **Identifying specific architectures**: Based on the first two steps, identifying which specific GAN or DM architecture generated the image. Experimental results show that this method achieved over 97% classification accuracy on each task, outperforming existing state-of-the-art methods. Additionally, this hierarchical method can be used for in-depth analysis of multimedia data to reconstruct its historical information (e.g., forensic ballistics), a field that has been less explored in existing research. In summary, this paper significantly improves the effectiveness of deepfake image detection and recognition by proposing a novel multi-level classification framework.