Robust Speech Recognition Based on Spectral Adjusting and Warping

R Zhao,Z Wang
DOI: https://doi.org/10.1109/icassp.2005.1415173
2005-01-01
Abstract:In this paper, we first propose a new channel adaptation method named spectral adjusting (SA) which adjusts the amplitude spectrum of the channel distorted speech with an adjusting function to reduce the channel distortion. Then, we combine vocal tract length normalization (VTLN), which warps the frequency scale of the speech spectrum to do speaker normalization, with SA to adjust and warp the speech spectrum. So the channel and speaker variations can be compensated for together. We call the combined method spectral adjusting and warping (SAW). In the SA method, the adjusting function is approximated by a piece-wise linear function, and the parameters of the piece-wise linear function are estimated by a gradient projection algorithm with short adaptation utterances based on the ML rule. The evaluating experiments were carried out on telephone speech recognition in a duration distribution based HMM (DDBHMM) system. Experimental results showed that SA yielded a relative error rate reduction of 10.44% over the baseline, and SAW led to a greater reduction of 14.6%.
What problem does this paper attempt to address?