Lip motion recognition of speaker based on SIFT

Xinjun MA,Chenchen WU,Qianyuan ZHONG,Yuanyuan LI
DOI: https://doi.org/10.11772/j.issn.1001-9081.2017.09.2694
2017-01-01
Abstract:Aiming at the problem that the lip feature dimension is too high and sensitive to the scale space,a technique based on the Scale-Invariant Feature Transform (SIFT) algorithm was proposed to carry out the speaker authentication.Firstly,a simple video frame image neat algorithm was proposed to adjust the length of the lip video to the same length,and the representative lip motion pictures were extracted.Then,a new algorithm based on key points of SIFT was proposed to extract the texture and motion features.After the integration of Principal Component Analysis (PCA) algorithm,the typical lip motion features were obtained for authentication.Finally,a simple classification algorithm was presented according to the obtained features.The experimental results show that compared to the common Local Binary Pattern (LBP) feature and the Histogram of Oriental Gradient (HOG) feature,the False Acceptance Rate (FAR) and False Rejection Rate (FRR) of the proposed feature extraction algorithm are better,which proves that the whole speaker lip motion recognition algorithm is effective and can get the ideal results.
What problem does this paper attempt to address?