Artificial Intelligence: A Survey on Lip-Reading Techniques

Apurva H. Kulkarni,Dnyaneshwar Kirange
DOI: https://doi.org/10.1109/icccnt45670.2019.8944628
2019-07-01
Abstract:Lip reading is a visual way of “listening” to someone. This is done by looking at the speakers face to follow their speech patterns in order to recognize what is being said. Lip-reading technology mainly includes face detection, lip localization, feature extraction, training the classifier through corpus and finally recognition of the word/sentence through lip movement. An intelligent system will be trained by giving user's lip-movement frames sequences as input and will identify lip movement and the said word using either visual information or both audio and visual information. Deep learning is an emerging branch of artificial intelligence which mimics the human brain. It has different layers in the model which is used to process minute details like neurons in brain. This paper mainly focuses on the survey of different lip reading techniques and different language datasets in the era of deep learning. Various Automatic lip reading techniques are discussed and summarized.
What problem does this paper attempt to address?