Automatic speech segmentation for Chinese speech database based on HMM

Jianhua Tao,Horst Udo Hain
DOI: https://doi.org/10.1109/TENCON.2002.1181318
2002-01-01
Abstract:The paper offers an optimized method for speech segmentation of a Mandarin speech database by using a hidden Markov model (HMM). The method takes the syllable boundaries into account. Testing shows that the accuracy of results is improved to 95% from 88% compared to the normal method. In particular, most of the boundaries between two vowels can also be well detected with the new method. The paper also analyzes the influence of the amount of HMM states and the amount of the training corpus.
What problem does this paper attempt to address?