Harmonic Intensity Feature for Robust Speech Recognition

许超,曹志刚
DOI: https://doi.org/10.3321/j.issn:1000-0054.2004.01.006
2004-01-01
Abstract:Automatic speech recognition (ASR) in noisy environments is a challenging problem. The performance of traditional Mel-frequency cepstral coefficient (MFCC) feature based ASR systems is dramatically degraded by additive noise. The harmonic intensity (H) feature was used to develop a robust ASR to replace the zero-order cepstral coefficient (C_0) or frame energy (E) feature in the MFCCs. A C_0-based ASR system, an E-based ASR system, and an H-based ASR system were tested with noise corrupted speech. The results show that the H-based ASR system has higher recognition accuracy and better robustness than the other systems.
What problem does this paper attempt to address?