Analysis of noise robustness of auditory features in speech recognition

李银国,欧阳希子,郑方
DOI: https://doi.org/10.16511/j.cnki.qhdxxb.2013.08.015
2013-01-01
Abstract:A particular difficulty of automatic speech recognition in real applications involves significant performance degradation in noisy environment. Based on the research on gammatone-based auditory features (GFCCs) proposed by other researchers, an additional comparative study on the GFCC and the MFCC was presented for various noise conditions. Particularly, the behavior of GFCC/MFCC features with noise in different frequency bands was analyzed by mixing the test speech with sine noises to show that the GFCC is more robust against low-frequency noises than the MFCC while more sensitive to noises at middle and high frequencies. This property is desirable for speech recognition since most of the information of human speech resides in the low frequency band of 300-700 Hz. Experimental results demonstrate that the GFCC exhibits significant advantages over the MFCC for various noise conditions, especially when the SNR is low.
What problem does this paper attempt to address?