Abstract:The advent of AI has influenced many aspects of human life, from self-driving cars and intelligent chatbots to text-based image and video generation models capable of creating realistic images and videos based on user prompts (text-to-image, image-to-image, and image-to-video). AI-based methods for image and video super resolution, video frame interpolation, denoising, and compression have already gathered significant attention and interest in the industry and some solutions are already being implemented in real-world products and services. However, to achieve widespread integration and acceptance, AI-generated and enhanced content must be visually accurate, adhere to intended use, and maintain high visual quality to avoid degrading the end user's quality of experience (QoE). One way to monitor and control the visual "quality" of AI-generated and -enhanced content is by deploying Image Quality Assessment (IQA) and Video Quality Assessment (VQA) models. However, most existing IQA and VQA models measure visual fidelity in terms of "reconstruction" quality against a pristine reference content and were not designed to assess the quality of "generative" artifacts. To address this, newer metrics and models have recently been proposed, but their performance evaluation and overall efficacy have been limited by datasets that were too small or otherwise lack representative content and/or distortion capacity; and by performance measures that can accurately report the success of an IQA/VQA model for "GenAI". This paper examines the current shortcomings and possibilities presented by AI-generated and enhanced image and video content, with a particular focus on end-user perceived quality. Finally, we discuss open questions and make recommendations for future work on the "GenAI" quality assessment problems, towards further progressing on this interesting and relevant field of research.

MoE-AGIQA: Mixture-of-Experts Boosted Visual Perception-Driven and Semantic-Aware Quality Assessment for AI-Generated Images

Large Multi-modality Model Assisted AI-Generated Image Quality Assessment

AI-Generated Image Quality Assessment Based on Task-Specific Prompt and Multi-Granularity Similarity

A Perceptual Quality Assessment Exploration for AIGC Images

Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment

CLIP-AGIQA: Boosting the Performance of AI-Generated Image Quality Assessment with CLIP

AGIQA-3K: An Open Database for AI-Generated Image Quality Assessment

Bringing Textual Prompt to AI-Generated Image Quality Assessment

McmIQA: Multi-Module Collaborative Model for No-Reference Image Quality Assessment

Quality Prediction of AI Generated Images and Videos: Emerging Trends and Opportunities

TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment

AIGIQA-20K: A Large Database for AI-Generated Image Quality Assessment

Re-IQA: Unsupervised Learning for Image Quality Assessment in the Wild

Subjective and Objective Quality Assessment for in-the-Wild Computer Graphics Images

PCQA: A Strong Baseline for AIGC Quality Assessment Based on Prompt Condition

NTIRE 2024 Quality Assessment of AI-Generated Content Challenge

AIGC-VQA: A Holistic Perception Metric for AIGC Video Quality Assessment

GIQA: Generated Image Quality Assessment

PKU-AIGIQA-4K: A Perceptual Quality Assessment Database for Both Text-to-Image and Image-to-Image AI-Generated Images

Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribution Gap