Research on Context-Dependent Acoustical Unit (Triphone) for Mandarin Continuous Speech Recognition
Qingwei Zhao,Zuoying Wang,Dajin Lu
DOI: https://doi.org/10.3321/j.issn:0372-2112.1999.06.017
1999-01-01
Tien Tzu Hsueh Pao/Acta Electronica Sinica
Abstract:The problem on building context dependent model in continuous mandarin speech recognition in order to avoid coarticulatory effects is discussed. On the basis of information theory, the distance metric of the traditional clustering algorithm is first studied, which is the divergence of the model distribution and the difference in entropy result from model merging or splitting. Then the clustering algorithm based on decision tree is presented, which makes full use of the phonological rules. The model obtained from where it is easy to be generalized, and this method demonstrates especially better when many triphones emerge that are not covered in the training material. In addition, the clustering and training procedure is discussed. At last, the speaker independent large vocabulary continuous speech recognition experiment shows that, if the recognition material is different from the training material, the recognition model obtained from the decision-tree-based clustering algorithm reduces the error rate by 7.95%. However, the recognition model obtained from the traditional merge algorithm reduces the error rate only by 2.63%.