Lip Temporal Pattern Analysis For Automatic Visual Speech Recognition

Lei Xie,Xiuli Cai,Zhonghua Fu,Dongmei Jiang,Rongchun Zhao
DOI: https://doi.org/10.1109/icosp.2004.1452760
2004-01-01
Abstract:This paper presents a novel approach to processing temporal lip motion information for dynamic visual feature extraction in visual speech recognition. The long-time Lip TenipoRA1 Patterns (LipTRAPs) of visual phonemes are introduced to analyze the nature of lip shape changes when uttering speech. A dynamic visual feature is also proposed based on the LipTRAPs. Visual speech recognition experiments on a connected-digits task show that the LipTRAP feature can yield significant WRR improvments than conventional delta features.
What problem does this paper attempt to address?