Probability Decision-Driven Speech Enhancement Algorithm Based on Human Acoustic Perception

Lu Zhang,Mingjiang Wang,Muhammad Idrees,Qiquan Zhang
DOI: https://doi.org/10.1049/iet-spr.2020.0056
IF: 1.819
2020-01-01
IET Signal Processing
Abstract:In this study, a novel human acoustic perception motivated Wiener filter speech enhancement system is presented to cope with real-world interfering background noises. Guiding by the speech presence probability, two alternative methods are proposed to reduce the noise by adopting the audible sound pressure level (SPL) and the masking characteristic of the human auditory system to achieve better listening comfort level. More specifically, when the probability of speech presence in the noisy signal is less than the decision threshold, a new SPL compressed method effectively reduces the noise. When the speech presence probability is more than the decision threshold, an improved acoustical mask threshold constrained Wiener filter approach enhances the noisy speech. Moreover, in order to evaluate the performance of the new system, the proposed algorithm is compared with the classic prior signal-to-noise ratio-based Wiener filter and three acoustic perception related algorithms. The experimental results show that the proposed algorithm significantly outperforms the four comparing algorithms in terms of speech quality and intelligibility either in stationary or moderate non-stationary noisy environments. Thus, the intended approach can be employed as the front-end module for various speech-related applications.
What problem does this paper attempt to address?