Selection of acoustic modeling unit for Tibetan speech recognition based on deep learning

Baojia Gong,Rangzhuoma Cai,Zhijie Cai,Yuntao Ding,Maozhaxi Peng
DOI: https://doi.org/10.1051/matecconf/202133606014
2021-01-01
MATEC Web of Conferences
Abstract:The selection of the speech recognition modeling unit is the primary problem of acoustic modeling in speech recognition, and different acoustic modeling units will directly affect the overall performance of speech recognition. This paper designs the Tibetan character segmentation and labeling model and algorithm flow for the purpose of solving the problem of selecting the acoustic modeling unit in Tibetan speech recognition by studying and analyzing the deficiencies of the existing acoustic modeling units in Tibetan speech recognition. After experimental verification, the Tibetan character segmentation and labeling model and algorithm achieved good performance of character segmentation and labeling, and the accuracy of Tibetan character segmentation and labeling reached 99.98%, respectively.
English Else
What problem does this paper attempt to address?