Speech enhancement based on soft audible noise masking and noise power estimation

Rongshan Yu
DOI: https://doi.org/10.1016/j.specom.2013.05.006
IF: 2.723
2013-01-01
Speech Communication
Abstract:This paper presents a perceptual model based speech enhancement algorithm. The proposed algorithm measures the amount of the audible noise in the input noisy speech based on estimation of short-time spectral power of noise signal, and masking threshold calculated from the estimated spectrum of clean speech. An appropriate amount of noise reduction is chosen based on the result to achieve good noise suppression without introducing significant distortion to the clean speech. To mitigate the problem of ''musical noise'', the amount of noise reduction is linked directly to the estimation of short-term noise spectral amplitude instead of noise variance so that the spectral peaks of noise can be better suppressed. Good performance of the proposed speech enhancement system is confirmed through objective and subjective tests.
What problem does this paper attempt to address?