Abstract:In recent years, the field of artificial intelligence has witnessed a remarkable surge in the generation of synthetic images, driven by advancements in deep learning techniques. These synthetic images, often created through complex algorithms, closely mimic real photographs, blurring the lines between reality and artificiality. This proliferation of synthetic visuals presents a pressing challenge: how to accurately and reliably distinguish between genuine and generated images. This article, in particular, explores the task of detecting images generated by text-to-image diffusion models, highlighting the challenges and peculiarities of this field. To evaluate this, we consider images generated from captions in the MSCOCO and Wikimedia datasets using two state-of-the-art models: Stable Diffusion and GLIDE. Our experiments show that it is possible to detect the generated images using simple multi-layer perceptrons (MLPs), starting from features extracted by CLIP or RoBERTa, or using traditional convolutional neural networks (CNNs). These latter models achieve remarkable performances in particular when pretrained on large datasets. We also observe that models trained on images generated by Stable Diffusion can occasionally detect images generated by GLIDE, but only on the MSCOCO dataset. However, the reverse is not true. Lastly, we find that incorporating the associated textual information with the images in some cases can lead to a better generalization capability, especially if textual features are closely related to visual ones. We also discovered that the type of subject depicted in the image can significantly impact performance. This work provides insights into the feasibility of detecting generated images and has implications for security and privacy concerns in real-world applications. The code to reproduce our results is available at: https://github.com/davide-coccomini/Detecting-Images-Generated-by-Diffusers.

DiffGuard: Text-Based Safety Checker for Diffusion Models

DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image Editing

Red-Teaming the Stable Diffusion Safety Filter

SteerDiff: Steering towards Safe Text-to-Image Diffusion Models

Safeguard Text-to-Image Diffusion Models with Human Feedback Inversion

Detecting images generated by diffusers

SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation

DiffProtect: Generate Adversarial Examples with Diffusion Models for Facial Privacy Protection

IMPRESS: Evaluating the Resilience of Imperceptible Perturbations Against Unauthorized Data Usage in Diffusion-Based Generative AI

StealthDiffusion: Towards Evading Diffusion Forensic Detection through Diffusion Model

FreezeAsGuard: Mitigating Illegal Adaptation of Diffusion Models via Selective Tensor Freezing

Toward effective protection against diffusion based mimicry through score distillation

Watch the Watcher! Backdoor Attacks on Security-Enhancing Diffusion Models

EditShield: Protecting Unauthorized Image Editing by Instruction-guided Diffusion Models

When Image Generation Goes Wrong: A Safety Analysis of Stable Diffusion Models

To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now

DisDet: Exploring Detectability of Backdoor Attack on Diffusion Models

Exposing the Fake: Effective Diffusion-Generated Images Detection

DiffDefense: Defending against Adversarial Attacks via Diffusion Models

Securing Federated Diffusion Model With Dynamic Quantization for Generative AI Services in Multiple-Access Artificial Intelligence of Things