PrefIQA: Human Preference Learning for AI-generated Image Quality Assessment

Hengjian Gao,Kaiwei Zhang,Wei Sun,Chunyi Li,Huiyu Duan,Xiaohong Liu,Xiongkuo Min,Guangtao Zhai
DOI: https://doi.org/10.1109/iscas58744.2024.10558022
2024-01-01
Abstract:Despite recent advancements in generative models, the variation in image quality remains a significant concern. To tackle this issue, we propose PrefIQA, an effective human preference learning metric, which can better evaluate the quality of AI-generated images. PrefIQA consists of two units, namely Feature Extraction Unit and Feature Fusion Unit. In Feature Extraction Unit, we introduce a prompt-segmentation module to divide prompts into multiple phrases, enabling a more detailed evaluation of the alignment between images and texts. In Feature Fusion Unit, we introduce a modality-fusion module, which effectively mixes text features and image features to improve the overall performance. In the experiment part, extensive experiments are conducted, demonstrating that PrefIQA surpasses existing text-to-image alignment metrics. We believe that PrefIQA’s proposal would facilitate researches on AI-generated image quality assessment, and make a valuable contribution to the field of text-to-image generation.
What problem does this paper attempt to address?