Multidimensional Similarity Fusion for Speech Quality Assessment

Fan Huang,Xiongkuo Min,Yuqin Cao,Xiao-Ping Zhang,Guangtao Zhai
DOI: https://doi.org/10.1109/iscas58744.2024.10558180
2024-01-01
Abstract:Since the perceptual quality of audio signals is easily to be affected by compression, transmission, noise adding, etc, it is of great significance to develop an effective audio quality assessment (AQA) method to measure end-user’s quality of experience. In this paper, we propose a full reference AQA model named Multidimensional Similarity Fusion for Audio Quality Assessment (MSF-AQA). We generalize the similarity-based image quality assessment methods for audio, then extract audio similarity features from multiple dimensions, and finally regress the multidimensional similarity features into the final quality score. The experimental results across three databases indicate that our MSF-AQA model outperforms the state-of-the-art AQA methods.
What problem does this paper attempt to address?