Diffusion Noise Feature: Accurate and Fast Generated Image Detection

Yichi Zhang,Xiaogang Xu
2024-03-07
Abstract:Generative models have reached an advanced stage where they can produce remarkably realistic images. However, this remarkable generative capability also introduces the risk of disseminating false or misleading information. Notably, existing image detectors for generated images encounter challenges such as low accuracy and limited generalization. This paper seeks to address this issue by seeking a representation with strong generalization capabilities to enhance the detection of generated images. Our investigation has revealed that real and generated images display distinct latent Gaussian representations when subjected to an inverse diffusion process within a pre-trained diffusion model. Exploiting this disparity, we can amplify subtle artifacts in generated images. Building upon this insight, we introduce a novel image representation known as Diffusion Noise Feature (DNF). DNF is extracted from the estimated noise generated during the inverse diffusion process. A simple classifier, e.g., ResNet50, trained on DNF achieves high accuracy, robustness, and generalization capabilities for detecting generated images (even the corresponding generator is built with datasets/structures that are not seen during the classifier's training). We conducted experiments using four training datasets and five testsets, achieving state-of-the-art detection performance.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve The paper aims to address several key issues in generative image detection: 1. **Authenticity and Deceptiveness of Generated Images**: - Current generative models (such as diffusion models) are capable of producing highly realistic images, which poses a risk of spreading false information. For example, malicious actors can use these generative models to create convincing fake images for illegal activities like telecom fraud. 2. **Limitations of Existing Detection Methods**: - Existing generative image detection methods have low accuracy and limited generalization capabilities when dealing with images generated by the latest models (such as DALLĀ·E, Stable Diffusion, and Midjourney). These methods are usually designed based on earlier generative models (like GANs) and are no longer effective for images generated by new models. 3. **Generalization Across Datasets and Generators**: - Detection methods need to have the ability to generalize across datasets and generators, meaning they should maintain high accuracy on data outside the training dataset. Existing methods perform poorly in this regard, especially when facing unseen generators. ### Solution To address the above issues, the paper proposes a new image representation method called Diffusion Noise Feature (DNF). DNF is extracted through the following steps: 1. **Reverse Diffusion Process**: - Input the image to be detected into a pre-trained diffusion model and perform the reverse diffusion process. During this process, real and generated images exhibit different latent Gaussian representations. 2. **Amplification of Estimated Noise**: - The estimated noise generated through the reverse diffusion process contains a wealth of information about the original image distribution. This noise shows significant differences between real and generated images, thereby amplifying subtle artifacts. 3. **Fusion Strategy**: - Use an experimentally determined fusion strategy to merge the sequence of estimated noise generated during the reverse diffusion process into a single DNF representation. This representation is used as the input to a classifier to distinguish between real and generated images. ### Experimental Results - **High Accuracy**: Experiments on 4 training datasets and 5 test sets show that the classifier based on DNF achieves state-of-the-art performance in generative image detection, with an accuracy of 100%. - **Strong Robustness**: Even when images undergo common perturbations during transmission (such as Gaussian blur and JPEG compression), the classifier based on DNF can still maintain an accuracy of over 99.2%. - **Generalization Across Datasets and Generators**: The DNF classifier performs excellently in tests across datasets and generators, significantly outperforming other detection methods. In summary, by introducing DNF, the paper provides an efficient and accurate method for generative image detection, addressing the limitations of existing methods in detecting images generated by the latest generative models.