Lipreading HLM and Text Flow Analysis

WANG Dan,YAO Hong-xun,WAN Yu-qi,HONG Xiao-peng
DOI: https://doi.org/10.3969/j.issn.1002-137X.2008.12.045
2008-01-01
Computer Science
Abstract:Since lip movement sequence and language sequence are one-to-many mapping,it is far from sufficiency to use only HMM for lip-reading recognition.Proposed a novel recognition model HLM(HMM and Bigram Language Model),which is based on HMM,and combined with prior knowledge of language.In contrary to the traditional framework,which adopts pure acoustic HMM posterior probability calculation for recognition,HLM combines closely language background knowledge and HMM.It carries on background knowledge of the language statistics according to language model.Acoustic posterior probability and linguistics prior probability are fused for judgments in the recognition stage.Experimental results demonstrated that applying HLM,syllable accuracy can increase by 7.3%,and sentence accuracy can increas by 19.5%.In addition,exploited language model for text flow analysis,rather than blindly text matching.In single video channel the accuracy can be up to 70.5%.
What problem does this paper attempt to address?