Abstract:Generative models have reached an advanced stage where they can produce remarkably realistic images. However, this remarkable generative capability also introduces the risk of disseminating false or misleading information. Notably, existing image detectors for generated images encounter challenges such as low accuracy and limited generalization. This paper seeks to address this issue by seeking a representation with strong generalization capabilities to enhance the detection of generated images. Our investigation has revealed that real and generated images display distinct latent Gaussian representations when subjected to an inverse diffusion process within a pre-trained diffusion model. Exploiting this disparity, we can amplify subtle artifacts in generated images. Building upon this insight, we introduce a novel image representation known as Diffusion Noise Feature (DNF). DNF is extracted from the estimated noise generated during the inverse diffusion process. A simple classifier, e.g., ResNet50, trained on DNF achieves high accuracy, robustness, and generalization capabilities for detecting generated images (even the corresponding generator is built with datasets/structures that are not seen during the classifier's training). We conducted experiments using four training datasets and five testsets, achieving state-of-the-art detection performance.

What problem does this paper attempt to address?

### Problems the Paper Aims to Solve The paper aims to address several key issues in generative image detection: 1. **Authenticity and Deceptiveness of Generated Images**: - Current generative models (such as diffusion models) are capable of producing highly realistic images, which poses a risk of spreading false information. For example, malicious actors can use these generative models to create convincing fake images for illegal activities like telecom fraud. 2. **Limitations of Existing Detection Methods**: - Existing generative image detection methods have low accuracy and limited generalization capabilities when dealing with images generated by the latest models (such as DALL·E, Stable Diffusion, and Midjourney). These methods are usually designed based on earlier generative models (like GANs) and are no longer effective for images generated by new models. 3. **Generalization Across Datasets and Generators**: - Detection methods need to have the ability to generalize across datasets and generators, meaning they should maintain high accuracy on data outside the training dataset. Existing methods perform poorly in this regard, especially when facing unseen generators. ### Solution To address the above issues, the paper proposes a new image representation method called Diffusion Noise Feature (DNF). DNF is extracted through the following steps: 1. **Reverse Diffusion Process**: - Input the image to be detected into a pre-trained diffusion model and perform the reverse diffusion process. During this process, real and generated images exhibit different latent Gaussian representations. 2. **Amplification of Estimated Noise**: - The estimated noise generated through the reverse diffusion process contains a wealth of information about the original image distribution. This noise shows significant differences between real and generated images, thereby amplifying subtle artifacts. 3. **Fusion Strategy**: - Use an experimentally determined fusion strategy to merge the sequence of estimated noise generated during the reverse diffusion process into a single DNF representation. This representation is used as the input to a classifier to distinguish between real and generated images. ### Experimental Results - **High Accuracy**: Experiments on 4 training datasets and 5 test sets show that the classifier based on DNF achieves state-of-the-art performance in generative image detection, with an accuracy of 100%. - **Strong Robustness**: Even when images undergo common perturbations during transmission (such as Gaussian blur and JPEG compression), the classifier based on DNF can still maintain an accuracy of over 99.2%. - **Generalization Across Datasets and Generators**: The DNF classifier performs excellently in tests across datasets and generators, significantly outperforming other detection methods. In summary, by introducing DNF, the paper provides an efficient and accurate method for generative image detection, addressing the limitations of existing methods in detecting images generated by the latest generative models.

Diffusion Noise Feature: Accurate and Fast Generated Image Detection

Exposing the Fake: Effective Diffusion-Generated Images Detection

Time Step Generating: A Universal Synthesized Deepfake Image Detector

DIRE for Diffusion-Generated Image Detection

Learning on Less: Constraining Pre-trained Model Learning for Generalizable Diffusion-Generated Image Detection

AnomalyDiffusion: Few-Shot Anomaly Image Generation with Diffusion Model

Out-of-distribution Detection with Diffusion-based Neighborhood

Detecting images generated by diffusers

Detecting Generated Images by Real Images Only

Intriguing Properties of Diffusion Models: An Empirical Study of the Natural Attack Capability in Text-to-Image Generative Models

Generative Edge Detection with Stable Diffusion

Fine-Tuning Text-To-Image Diffusion Models for Class-Wise Spurious Feature Generation

Edge-preserving noise for diffusion models

Diffusion Model for Generative Image Denoising

Diffusion-GAN: Training GANs with Diffusion

Enhancing Diffusion-Based Image Synthesis with Robust Classifier Guidance

Masked Diffusion Models Are Fast Distribution Learners

Towards More Accurate Fake Detection on Images Generated from Advanced Generative and Neural Rendering Models

Are Images Indistinguishable to Humans Also Indistinguishable to Classifiers?

Diffusion-Generated Fake Face Detection by Exploring Wavelet Domain Forgery Clues.