Objective assessment of speech quality by combining Bark- and Mel-scale frequency

Honghong Chen,Weiqiang Zhang,Jia Liu,Qingsheng Yuan
DOI: https://doi.org/10.1109/ICOSP.2010.5655032
IF: 4.729
2010-01-01
Signal Processing
Abstract:Perceptual evaluation of speech quality (PESQ, ITU-T P.862) is a well known objective method for speech quality assessment. PESQ applies Bark-scale frequency to estimate the mean opinion score (MOS) for end-to-end speech quality assessment of narrow-band telephone networks and speech codec. This paper discusses a new objective estimation method by combining Bark-scale and Mel-scale frequency to improve the accuracy of PESQ. The objective assessment based on Mel-scale frequency is presented by following the PESQ framework and then they are combined together through score fusion. Experiment results shows that the objective score of the estimation method using Mel-scale frequency alone has good correlation with the subjective score. Comparative results show improvement in cases where Bark-scale frequency is combined with Mel-scale frequency.
What problem does this paper attempt to address?