Abstract:The proliferation of AI techniques for image generation, coupled with their increasing accessibility, has raised significant concerns about the potential misuse of these images to spread misinformation. Recent AI-generated image detection (AGID) methods include CNNDetection, NPR, DM Image Detection, Fake Image Detection, DIRE, LASTED, GAN Image Detection, AIDE, SSP, DRCT, RINE, OCC-CLIP, De-Fake, and Deep Fake Detection. However, we argue that the current state-of-the-art AGID techniques are inadequate for effectively detecting contemporary AI-generated images and advocate for a comprehensive reevaluation of these methods. We introduce the Visual Counter Turing Test (VCT^2), a benchmark comprising ~130K images generated by contemporary text-to-image models (Stable Diffusion 2.1, Stable Diffusion XL, Stable Diffusion 3, DALL-E 3, and Midjourney 6). VCT^2 includes two sets of prompts sourced from tweets by the New York Times Twitter account and captions from the MS COCO dataset. We also evaluate the performance of the aforementioned AGID techniques on the VCT$^2$ benchmark, highlighting their ineffectiveness in detecting AI-generated images. As image-generative AI models continue to evolve, the need for a quantifiable framework to evaluate these models becomes increasingly critical. To meet this need, we propose the Visual AI Index (V_AI), which assesses generated images from various visual perspectives, including texture complexity and object coherence, setting a new standard for evaluating image-generative AI models. To foster research in this domain, we make our <a class="link-external link-https" href="https://huggingface.co/datasets/anonymous1233/COCO_AI" rel="external noopener nofollow">this https URL</a> and <a class="link-external link-https" href="https://huggingface.co/datasets/anonymous1233/twitter_AI" rel="external noopener nofollow">this https URL</a> datasets publicly available.

AI-Generated Video Content Detection Using Vision Language Models

Distinguish Any Fake Videos: Unleashing the Power of Large-scale Data and Motion Features

Turns Out I'm Not Real: Towards Robust Detection of AI-Generated Videos

A Multimodal Approach for Detecting AI Generated Content using BERT and CNN

Visual Veracity: Advancing AI-Generated Image Detection with Convolutional Neural Networks

What Matters in Detecting AI-Generated Videos like Sora?

AI-Generated Video Detection via Spatio-Temporal Anomaly Learning

Beyond Deepfake Images: Detecting AI-Generated Videos

Image and Video Generation Using Artificial Intelligence and its Detection

A Survey of Defenses against AI-generated Visual Media: Detection, Disruption, and Authentication

A Survey of AI-Generated Video Evaluation

Exposing AI-generated Videos: A Benchmark Dataset and a Local-and-Global Temporal Defect Based Detection Method

Detection of AI-Generated Synthetic Faces

Visual Counter Turing Test (VCT^2): Discovering the Challenges for AI-Generated Image Detection and Introducing Visual AI Index (V_AI)

Understanding and Improving Training-Free AI-Generated Image Detections with Vision Foundation Models

AI vs. AI: Can AI Detect AI-Generated Images?

Transfer Learning-Based Models for Comparative Evaluation for the Detection of AI-Generated Images

Detecting Multimedia Generated by Large AI Models: A Survey

Harnessing Machine Learning for Discerning AI-Generated Synthetic Images

Quality Prediction of AI Generated Images and Videos: Emerging Trends and Opportunities