Confidence measure based on forced-alignment for out-of-vocabulary term detection

Haiyang Li,Tieran Zheng,Guibin Zheng,Jiqing Han
DOI: https://doi.org/10.12733/jcis7812
2013-01-01
Journal of Computational Information Systems
Abstract:In this paper, we propose a method of confidence measure (CM) to improve the performance of out-ofvocabulary (OOV) term detection. For hypothesized OOV terms, the proposed method firstly obtains the acoustic likelihood by forced-alignment, and then computes the final CM for verification. The forcedalignment provides the mapping relation between the frame observations and the states of hidden Markov models. The CM can be calculated by averaging phone-level confidence or classifying confidence features of syllable with support vector machine. The confidence features take advantage of the merit of Chinese syllable structure and describe the confidences in every sub-syllable level. The experiments conducted on the Hub-4NE Mandarin database show that the proposed method of confidence measure can achieve improvements over the current lattice-based method for OOV detection. Copyright © 2013 Binary Information Press.
What problem does this paper attempt to address?