A Smoothing Algorithm For The Task Adaptation Chinese Trigram Model

Mh Jiang,Bz Yuan,Bq Lin,Xf Tang
DOI: https://doi.org/10.1109/ICOSP.1998.770317
1998-01-01
Abstract:The paper main solve above two problems. In the paper a Chinese Trigram model of task adaptation ability are set up. A zerogram to trigram probability statistics information base of 1994 "People Daily" are built, it made use of the success experience of HMM in speech recognition, anti adopted Baum-Welch algorithm for optimum of the weights. Each weigh stands for correlation statistic reliability of these models. The probability statistics matrix smooth algorithm of the parameter space was carried on, in order to offset the matrix sparse data of statistic probability. The "People Daily" corpus statistic results are regard as the preliminary statistic results. When the changing of application domain, then the recognition accuracy rate of the preliminary statistic results are declined, we adopted "PC World" as the corpus of the changing domain and carried on successive training, then second smooth of the preliminary statistic results and successive statistic results was look on as finally results. A trigram model of task adaptation is gotten. The experiment results show, this language model the workload of successive training befits, it can effectively reduce the perplexity of language model in task changing domain. it has a high language adaptation ability in the task changing domain.
What problem does this paper attempt to address?