Opinion-Unaware Blind Image Quality Assessment using Multi-Scale Deep Feature Statistics

Zhangkai Ni,Yue Liu,Keyan Ding,Wenhan Yang,Hanli Wang,Shiqi Wang
2024-05-29
Abstract:Deep learning-based methods have significantly influenced the blind image quality assessment (BIQA) field, however, these methods often require training using large amounts of human rating data. In contrast, traditional knowledge-based methods are cost-effective for training but face challenges in effectively extracting features aligned with human visual perception. To bridge these gaps, we propose integrating deep features from pre-trained visual models with a statistical analysis model into a Multi-scale Deep Feature Statistics (MDFS) model for achieving opinion-unaware BIQA (OU-BIQA), thereby eliminating the reliance on human rating data and significantly improving training efficiency. Specifically, we extract patch-wise multi-scale features from pre-trained vision models, which are subsequently fitted into a multivariate Gaussian (MVG) model. The final quality score is determined by quantifying the distance between the MVG model derived from the test image and the benchmark MVG model derived from the high-quality image set. A comprehensive series of experiments conducted on various datasets show that our proposed model exhibits superior consistency with human visual perception compared to state-of-the-art BIQA models. Furthermore, it shows improved generalizability across diverse target-specific BIQA tasks. Our code is available at: <a class="link-external link-https" href="https://github.com/eezkni/MDFS" rel="external noopener nofollow">this https URL</a>
Computer Vision and Pattern Recognition,Multimedia,Image and Video Processing
What problem does this paper attempt to address?
The paper aims to address a problem in the field of Blind Image Quality Assessment (BIQA), which is to accurately predict the quality score of a distorted image without a reference image. Specifically, the study proposes a Multi-Scale Deep Feature Statistics (MDFS) model for achieving Opinion-Unaware BIQA (OU-BIQA). This approach combines multi-scale features extracted by pre-trained deep models with statistical analysis models, assessing image quality by fitting a multivariate Gaussian distribution. Compared to existing methods that require a large amount of manually annotated data, this model significantly improves training efficiency and generalization ability. Experimental results on multiple datasets show that its performance surpasses current state-of-the-art BIQA models.