Abstract:The proliferation of AI techniques for image generation, coupled with their increasing accessibility, has raised significant concerns about the potential misuse of these images to spread misinformation. Recent AI-generated image detection (AGID) methods include CNNDetection, NPR, DM Image Detection, Fake Image Detection, DIRE, LASTED, GAN Image Detection, AIDE, SSP, DRCT, RINE, OCC-CLIP, De-Fake, and Deep Fake Detection. However, we argue that the current state-of-the-art AGID techniques are inadequate for effectively detecting contemporary AI-generated images and advocate for a comprehensive reevaluation of these methods. We introduce the Visual Counter Turing Test (VCT^2), a benchmark comprising ~130K images generated by contemporary text-to-image models (Stable Diffusion 2.1, Stable Diffusion XL, Stable Diffusion 3, DALL-E 3, and Midjourney 6). VCT^2 includes two sets of prompts sourced from tweets by the New York Times Twitter account and captions from the MS COCO dataset. We also evaluate the performance of the aforementioned AGID techniques on the VCT$^2$ benchmark, highlighting their ineffectiveness in detecting AI-generated images. As image-generative AI models continue to evolve, the need for a quantifiable framework to evaluate these models becomes increasingly critical. To meet this need, we propose the Visual AI Index (V_AI), which assesses generated images from various visual perspectives, including texture complexity and object coherence, setting a new standard for evaluating image-generative AI models. To foster research in this domain, we make our <a class="link-external link-https" href="https://huggingface.co/datasets/anonymous1233/COCO_AI" rel="external noopener nofollow">this https URL</a> and <a class="link-external link-https" href="https://huggingface.co/datasets/anonymous1233/twitter_AI" rel="external noopener nofollow">this https URL</a> datasets publicly available.

A Multimodal Approach for Detecting AI Generated Content using BERT and CNN

Detection of AI-Generated Synthetic Images with a Lightweight CNN

Visual Veracity: Advancing AI-Generated Image Detection with Convolutional Neural Networks

AI-Generated Video Content Detection Using Vision Language Models

Enhancing Text Authenticity: A Novel Hybrid Approach for AI-Generated Text Detection

Deep Learning Detection Method for Large Language Models-Generated Scientific Content

AI vs. AI: Can AI Detect AI-Generated Images?

Detecting the Undetectable: Combining Kolmogorov-Arnold Networks and MLP for AI-Generated Image Detection

Detecting AI Generated Text Based on NLP and Machine Learning Approaches

Development of a Dual-Input Neural Model for Detecting AI-Generated Imagery

Transfer Learning-Based Models for Comparative Evaluation for the Detection of AI-Generated Images

Deep Learning Multimodal Methods to Detect Fake News

On the Possibilities of AI-Generated Text Detection

ConvNLP: Image-based AI Text Detection

Visual Counter Turing Test (VCT^2): Discovering the Challenges for AI-Generated Image Detection and Introducing Visual AI Index (V_AI)

MiRAGeNews: Multimodal Realistic AI-Generated News Detection

A Multimodal Framework for Deepfake Detection

Investigating the Evolving Landscape of Deepfake Technology: Generative AI's Role in it's Generation and Detection

Ensemble Techniques for Robust Fake News Detection: Integrating Transformers, Natural Language Processing, and Machine Learning

Deepfake Detection System Using Deep Neural Networks

AI-Generated Image Detection using a Cross-Attention Enhanced Dual-Stream Network