Effectivity from single visual channel lipreading system

CHEN Rong,YAO Hong-xun,HONG Xiao-peng,WAN Yu-qi
DOI: https://doi.org/10.3321/j.issn:1002-8331.2007.20.009
2007-01-01
Abstract:To build a large vocabulary lipreading system based on single visual channel,an unitary U-LDCT-KL two-level feature extraction method is presented in this paper.It is based on lip region partition DCT coefficients to be gotten rid off the overlap of those local coefficients by KL.This method,on one hand extractes the most efficient low features for lipreading,on the other hand,selectes features reasonably to improve their distinguishability.With 42-dimensional two-level visual features can get 77.8% rate of lip movement contents recognition for speaker-dependent cases.Experiments also prove that the features of blocks DCT coefficients in lip region are efficacious to visual single channel lipreading system.
What problem does this paper attempt to address?