Grammar-Based Semi-Supervised Incremental Learning in Automatic Speech Recognition and Labeling

Haifeng Li,Tian Zhang,Rongfa Qiu,Lin Ma
DOI: https://doi.org/10.1016/j.egypro.2012.02.321
2012-01-01
Energy Procedia
Abstract:The currently common methods in automatic speech recognition and labeling are usually supervised which need manually labeled transcriptions. Considering the high cost and time-consuming especially in the acoustic model training stage, a new data selection method named Grammar-based Semi-Supervised Incremental Learning is proposed requiring only a small number of manually labeled data to initialize the acoustic model. The initial model is loaded to recognize a great number of unlabeled transcriptions to receive the N-best results which are analyzed and further synthesized and used to choose the optimal results for iterately system retraining according to the proposed method. The experimental results show that this method can significantly improve the performance of the acoustic model in LVCSR, and moreover the whole labeling system. Besides, a syllable-covering explanation about the improvement is given.
What problem does this paper attempt to address?