Speaker Recognition Technology Based on Lip Movement

Liu Qinghui,Yao Hongxun
DOI: https://doi.org/10.3321/j.issn:1002-8331.2006.12.026
2006-01-01
Abstract:For most of speaker recognition systems based on acoustic signals,a novel approach of speaker recognition technology based on lip movement is presented in this paper.By Discrete Cosine Transform,visual features is extracted from the talking image sequences,which represent both the physical characteristics of the speaker mouth and his lip movement behaviour trait.Based on these feautures,the static-dynmic models are constructed for the speakers,in which the dynmic model is based on SCHMM.We implement both speaker identification system and speaker verification system on a small visual database,and the accuracy of the text-dependent and the text-independent got to 100% and 99.7% for identification,respectively,and the ERR of both of them are 0.09% and 0.33% for speaker verification,separately.
What problem does this paper attempt to address?