Research on Denoising Method Based on Improved Short - Time Spectrum Estimation

Jian Kang,Hongbo Wang
DOI: https://doi.org/10.1109/iccsnt.2016.8070263
2016-01-01
Abstract:With the popularity of mobile terminal equipment, voice communication is becoming more and more frequent, the application of speech recognition scene is increasing. All these put forward higher requirements on the accuracy of speech recognition, therefore, how to enhance the speech as effectively as possible is becoming more and more important. At present, a lot of research has been done on the preprocessing of speech, especially on how to reduce the noise. Various noise reduction algorithms have been raised, such as wavelet transform denoising, spectral subtraction and other solutions for different scenarios. In the traditional speech enhancement algorithm, only the current frame and the previous frame are used to estimate the speech spectrum of the current frame, which results in variable level noise and music noise problems. In this paper, we propose a speech enhancement method based on Speex open-source library for improved short-time spectral estimation. This method eliminates the background noise and avoids the white noise, and achieves an ideal balance between the recognition rate and the human audition effect for four different scenes. At the same time, the algorithm can set the threshold freely according to the setting of noise reduction intensity or the human voice enhancement, which enhances the flexibility of preprocessing. Compared with the traditional speech noise reduction method, the accuracy of speech recognition has improved more than 15 %.
What problem does this paper attempt to address?