Visual Verity in AI-Generated Imagery: Computational Metrics and Human-Centric Analysis

Memoona Aziz,Umair Rehman,Syed Ali Safi,Amir Zaib Abbasi

2024-09-02

Abstract:The rapid advancements in AI technologies have revolutionized the production of graphical content across various sectors, including entertainment, advertising, and e-commerce. These developments have spurred the need for robust evaluation methods to assess the quality and realism of AI-generated images. To address this, we conducted three studies. First, we introduced and validated a questionnaire called Visual Verity, which measures photorealism, image quality, and text-image alignment. Second, we applied this questionnaire to assess images from AI models (DALL-E2, DALL-E3, GLIDE, Stable Diffusion) and camera-generated images, revealing that camera-generated images excelled in photorealism and text-image alignment, while AI models led in image quality. We also analyzed statistical properties, finding that camera-generated images scored lower in hue, saturation, and brightness. Third, we evaluated computational metrics' alignment with human judgments, identifying MS-SSIM and CLIP as the most consistent with human assessments. Additionally, we proposed the Neural Feature Similarity Score (NFSS) for assessing image quality. Our findings highlight the need for refining computational metrics to better capture human visual perception, thereby enhancing AI-generated content evaluation.

Human-Computer Interaction,Artificial Intelligence

What problem does this paper attempt to address?

The paper aims to address the following key issues: 1. **Development and Validation of Evaluation Metrics**: The paper designs three subjective questionnaires to assess the key dimensions of AI-generated images, including photorealism, image quality, and text-image alignment. These questionnaires are statistically validated to ensure reliability and validity. 2. **Comparison of Pixel-Level and Model-Level Metrics**: The study compares pixel-level metrics (such as SSIM, PSNR) and model-level metrics (such as FID, LPIPS, CLIP, etc.), and analyzes the consistency of these metrics with human perception. 3. **Design of Neural Feature Similarity Score (NFSS)**: A new metric, NFSS, is proposed to evaluate image quality, aiming to better align with human judgment. 4. **Expert Evaluation Strategy**: The Interpolative Binning Scale (IBS) is introduced to fairly evaluate the results of human and metric outputs. 5. **Performance Comparison of Different AI Models**: By comparing the quality of images generated by models such as DALL-E2, DALL-E3, GLIDE, and Stable Diffusion, the study examines the performance differences of these models across different dimensions. In summary, the main goal of this paper is to provide a comprehensive and statistically validated approach to evaluate the photorealism, text-image alignment, and image quality of AI-generated images, ensuring that these evaluation methods better reflect human perception.

Visual Verity in AI-Generated Imagery: Computational Metrics and Human-Centric Analysis

A Comprehensive Survey on Computational Aesthetic Evaluation of Visual Art Images: Metrics and Challenges

Quality Prediction of AI Generated Images and Videos: Emerging Trends and Opportunities

Image Visual Realism: from Human Perception to Machine Computation.

Visual Counter Turing Test (VCT^2): Discovering the Challenges for AI-Generated Image Detection and Introducing Visual AI Index (V_AI)

Visual Veracity: Advancing AI-Generated Image Detection with Convolutional Neural Networks

Global-Local Image Perceptual Score (GLIPS): Evaluating Photorealistic Quality of AI-Generated Images

A study of the evaluation metrics for generative images containing combinational creativity

How Do You Perceive Differently from an AI - A Database for Semantic Distortion Measurement.

AIGCIQA2023: A Large-scale Image Quality Assessment Database for AI Generated Images: from the Perspectives of Quality, Authenticity and Correspondence

A Survey on Quality Metrics for Text-to-Image Models

Crafting Synthetic Realities: Examining Visual Realism and Misinformation Potential of Photorealistic AI-Generated Images

A Perceptual Quality Assessment Exploration for AIGC Images

The Development of Three Image Quality Evaluation Metrics Based on a Comprehensive Dataset

AIGCOIQA2024: Perceptual Quality Assessment of AI Generated Omnidirectional Images

Is the development of objective image quality assessment methods keeping pace with technological developments?

Subjective and Objective Quality Assessment for in-the-Wild Computer Graphics Images

Reinforcing Visual Content Integrity through Image Restoration and AI Recognition

Appeal and quality assessment for AI-generated images

What You See Is What Matters: A Novel Visual and Physics-Based Metric for Evaluating Video Generation Quality

Subjective and Objective Quality Assessment of Image: A Survey