Predicting intelligibility of noise-suppressed speech

MA Jianfen,ZHANG Xueying
DOI: https://doi.org/10.3778/j.issn.1002-8331.1111-0092
2012-01-01
Abstract:The aim of the present research is to propose a measure to predict noise-suppressed speech which has higher correlation with subjective scores.The traditional frequency-weighted segmental SNR(fSNRseg)measure does not have higher correlations with subjective scores since it does not account for spectral attenuations and spectral amplification distortions introduced by speech enhancement algorithms separately.In this study,it decomposes the fSNRseg measure in three regions,corresponding to attenuation distortion only,amplification distortion up to 6.02 dB and distortion of 6.02 dB or greater.It calculates fSNRseg in each region separately.Multiple-regression analysis is run on the three decomposed measures to maximize the correlation with subjective scores.A high correlation(0.91)is obtained with sentence recognition scores with the proposed objective measure.
What problem does this paper attempt to address?