Abstract:The rapid development of image generative models has lowered the threshold for image creation but also raised security concerns related to the propagation of false information, urgently necessitating the development of detection technologies for AI-generated images. Presently, text-to-image generation stands as the predominant approach to image generation, where the rendering of generated images hinges on two primary factors: text prompts and the inherent characteristics of the model. However, the variety of semantic text prompts yields diverse generated images, posing significant challenges to existing detection methodologies that rely solely on learning from image features, particularly in scenarios with limited samples. To tackle these challenges, this paper presents a novel perspective on the AI-generated image detection task, advocating for detection under semantic-decoupling conditions. Building upon this insight, we propose SemGIR, a semantic-guided image regeneration based method for AI-generated image detection. SemGIR first regenerates images through image-to-text followed by a text-to-image generation process, subsequently utilizing these re-generated image pairs to derive discriminative features. This regeneration process effectively decouples semantic features organically, allowing the detection process to concentrate more on the inherent characteristics of the generative model. Such an efficient detection scheme can also be effectively applied to attribution. Experimental findings demonstrate that in realistic scenarios with limited samples, SemGIR achieves an average detection accuracy 15.76% higher than state-of-the-art (SOTA) methods. Furthermore, in attribution experiments on the SDv2.1 model, SemGIR attains an accuracy exceeding 98%, affirming the effectiveness and practical utility of the proposed method.

Semantic Draw Engineering for Text-to-Image Creation

Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge.

SIMGAN: Photo-Realistic Semantic Image Manipulation Using Generative Adversarial Networks.

Creating Word Paintings Jointly Considering Semantics, Attention, and Aesthetics.

SmartPaint: a Co-Creative Drawing System Based on Generative Adversarial Networks

Semantics Disentangling for Text-to-Image Generation

GAN-based AI Drawing Board for Image Generation and Colorization

Text-to-image Generation Based on Spatial-Channel Attention and Semantic Redescription

DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for Text-to-Image Generation

Semantic Style Transfer and Turning Two-Bit Doodles into Fine Artworks

Disentangling for Text-to-Image Generation

Creative and Diverse Artwork Generation Using Adversarial Networks

Investigation related to application of Generative Adversarial Networks in text-to-image synthesis

Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes

AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation

R-GAN: Exploring Human-like Way for Reasonable Text-to-Image Synthesis via Generative Adversarial Networks

Linear Semantics in Generative Adversarial Networks

SemGIR: Semantic-Guided Image Regeneration Based Method for AI-generated Image Detection and Attribution

Intelligent Typography: Artistic Text Style Transfer for Complex Texture and Structure

Learning to Draw Text in Natural Images with Conditional Adversarial Networks

DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-to-Image Synthesis