Abstract:In recent years, artificial intelligence-generated content (AIGC) enabled by foundation models has received increasing attention and is undergoing remarkable development. Text prompts can be elegantly translated/converted into high-quality, photo-realistic images. This remarkable feature, however, has introduced extremely high bandwidth requirements for compressing and transmitting the vast number of AI-generated images (AIGI) for such AIGC services. Despite this challenge, research on compression methods for AIGI is conspicuously lacking but undeniably necessary. This research addresses this critical gap by introducing the pioneering AIGI dataset, PKU-AIGI-500K, encompassing over 105k+ diverse prompts and 528k+ images derived from five major foundation models. Through this dataset, we delve into exploring and analyzing the essential characteristics of AIGC images and empirically prove that existing data-driven lossy compression methods achieve sub-optimal or less efficient rate-distortion performance without fine-tuning, primarily due to a domain shift between AIGIs and natural images. We comprehensively benchmark the rate-distortion performance and runtime complexity analysis of conventional and learned image coding solutions that are openly available, uncovering new insights for emerging studies in AIGI compression. Moreover, to harness the full potential of redundant information in AIGI and its corresponding text, we propose an AIGI compression model (Cross-Attention Transformer Codec, CATC) trained on this dataset as a strong baseline. Subsequent experimental results demonstrate that our proposed model achieves up to 30.09% bitrate reduction compared to the state-of-the-art (SOTA) H.266/VVC codec and outperforms the SOTA learned codec, paving the way for future research in AIGI compression.

AIGC Image Quality Assessment Via Image-Prompt Correspondence

CLIP-AGIQA: Boosting the Performance of AI-Generated Image Quality Assessment with CLIP

Quality Assessment of AI-Generated Image Based on Cross-modal Correlation

A Perceptual Quality Assessment Exploration for AIGC Images

PCQA: A Strong Baseline for AIGC Quality Assessment Based on Prompt Condition

AI-Generated Image Quality Assessment Based on Task-Specific Prompt and Multi-Granularity Similarity

AIGIQA-20K: A Large Database for AI-Generated Image Quality Assessment

Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribution Gap

Bringing Textual Prompt to AI-Generated Image Quality Assessment

TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment

Advancing Video Quality Assessment for AIGC

AI-generated Image Quality Assessment in Visual Communication

PSCR: Patches Sampling-based Contrastive Regression for AIGC Image Quality Assessment

AIGCIQA2023: A Large-scale Image Quality Assessment Database for AI Generated Images: from the Perspectives of Quality, Authenticity and Correspondence

AIGC-VQA: A Holistic Perception Metric for AIGC Video Quality Assessment

PKU-AIGI-500K: A Neural Compression Benchmark and Model for AI-Generated Images

Large Multi-modality Model Assisted AI-Generated Image Quality Assessment

SF-IQA: Quality and Similarity Integration for AI Generated Image Quality Assessment

NTIRE 2024 Quality Assessment of AI-Generated Content Challenge

Generalized Visual Quality Assessment of GAN-Generated Face Images