Blind image quality assessment with semi-supervised learning

Xiwen Li,Zhihua Wang,Binwei Xu
DOI: https://doi.org/10.1016/j.jvcir.2024.104100
IF: 2.887
2024-02-01
Journal of Visual Communication and Image Representation
Abstract:Blind image quality assessment (BIQA) aims to automatically predict the perceptual quality of an image without requiring access to its pristine reference counterpart. BIQA models are typically developed through supervised learning, optimizing and testing them by comparing their predictions to human ratings, usually expressed as mean opinion scores (MOS), which can be labor-intensive to collect. The performance of these BIQA models is significantly reliant on the amount of labeled training data. When there is a shortage of human-rated data, these BIQA models may perform inadequately. In this study, we investigate the potential of incorporating unlabeled data to mitigate this issue and enhance the performance of BIQA models. To achieve this, we propose a deep ensemble-based BIQA model (referred to as the “target model”) with two heads: one for quality estimation and the other for pseudo-label generation. Initially, we train it on a small set of human-rated images where the supervisory signals are binary labels indicating the pairwise ranking of perceptual quality for image pairs. Then, the head responsible for pseudo-label generation assigns pseudo-binary labels to unlabeled pairs. Subsequently, we re-train the target model using a combination of labeled and pseudo-labeled datasets. This process can be iterated, allowing for the progressive improvement of the target model’s performance. We conduct comprehensive case studies to illustrate the advantages of utilizing unlabeled data for BIQA, particularly in terms of model generalization and identifying cases of model failure.
computer science, information systems, software engineering
What problem does this paper attempt to address?