On the detection of synthetic images generated by diffusion models

Riccardo Corvi,Davide Cozzolino,Giada Zingarini,Giovanni Poggi,Koki Nagano,Luisa Verdoliva
DOI: https://doi.org/10.48550/arXiv.2211.00680
2022-11-02
Abstract:Over the past decade, there has been tremendous progress in creating synthetic media, mainly thanks to the development of powerful methods based on generative adversarial networks (GAN). Very recently, methods based on diffusion models (DM) have been gaining the spotlight. In addition to providing an impressive level of photorealism, they enable the creation of text-based visual content, opening up new and exciting opportunities in many different application fields, from arts to video games. On the other hand, this property is an additional asset in the hands of malicious users, who can generate and distribute fake media perfectly adapted to their attacks, posing new challenges to the media forensic community. With this work, we seek to understand how difficult it is to distinguish synthetic images generated by diffusion models from pristine ones and whether current state-of-the-art detectors are suitable for the task. To this end, first we expose the forensics traces left by diffusion models, then study how current detectors, developed for GAN-generated images, perform on these new synthetic images, especially in challenging social-networks scenarios involving image compression and resizing. Datasets and code are available at <a class="link-external link-http" href="http://github.com/grip-unina/DMimageDetection" rel="external noopener nofollow">this http URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper aims to explore how to detect synthetic images generated by Diffusion Models (DM) and evaluate the performance of the current state - of - the - art detectors in this task. Specifically, the researchers hope to answer two core questions: 1. **Do images generated by diffusion models have hidden artifacts similar to those of GAN - generated images?** - The paper explores this issue by analyzing the fingerprint features of images generated by different generative models (including GANs and DMs). The study found that some diffusion models (such as GLIDE, Latent Diffusion, and Stable Diffusion) do have fingerprint features similar to those of GAN - generated images, while other models (such as ADM and DALL·E 2) show weaker artifacts. 2. **How effective are the current state - of - the - art detectors on such images?** - The researchers used several existing detection methods (such as Spec, PatchForensics, Wang2020, and Grag2021) to test images generated by different generative models. The experimental results show that on uncompressed images, the detection effect is relatively good, but on images after compression and scaling processing, the detection performance drops significantly. In particular, it is more difficult to detect images generated by ADM and DALL·E 2. In addition, the paper also explores the generalization ability of detectors among different generative models. The study found that detectors trained only on GAN - generated images perform poorly on images generated by diffusion models. However, if trained on images generated by diffusion models, the detector can better recognize similar diffusion - model - generated images, but the effect on other types of diffusion models is still limited. Overall, through detailed experimental analysis, this paper reveals the detection challenges of images generated by diffusion models and provides valuable references for future research.