Spatial Correlation Transformation for Speech Recognition

SU Tengrong,WU Ji,WANG Zuoying
DOI: https://doi.org/10.3321/j.issn:1000-0054.2009.10.020
2009-01-01
Abstract:The traditional Hidden Markov model for speech recognition ignores the relationships between speech signals. This paper presents a linear feature transformation, Spatial Correlation Transformation, to utilize the correlation between different acoustic units of the same speaker (Spatial Correlation) to obtain new features having better discrimination. The optimum transformation matrix is determined based on the Minimum Covariance criterion. The recognition system uses these new features and the corresponding model parameters in the Viterbi search instead of the original features. The key to the transformation is the calculation of the optimum transformation matrix. Experiments show that this approach achieves better performance than adaptation approaches on the speaker independent recognition system. Moreover, the combination of this approach and adaptation approaches further improves the system performance.
What problem does this paper attempt to address?