Lip Movement Detection Using 3D Convolution and Resnet

O Obulesu,Teneti. Sanjana,V Rupa Sree,Saahithya D V,B Srija Reddy
DOI: https://doi.org/10.22214/ijraset.2023.54180
2023-06-30
International Journal for Research in Applied Science and Engineering Technology
Abstract:Abstract: Recognition of Lip movements has become one of the most challenging tasks and has crucial applicationsin the contemporary scenario. Being able to see speech helps people communicate better, especially in challenging listening environments like when there is a background noise and video surveillance when there is no audio. Lip reading is a technique primarily used by deaf people or those who have some form of hearing impairment. It’s a way of understanding and communicating with others who might not be familiar with another form of inclusive communication, such as sign language. Lip-reading technology mainly includes face detection, lip localization, feature extraction, training the classifier and finally recognising the word or sentence through lip movement. Many developments have taken place in this growing field using various deep learning-based techniques. An intelligent system will be trained by giving users lip-movement frames sequences as input and will identify lip movement and the said word using 3D convolution and ResNet . This project does analysis over various deep learning models and other datasets. This study also aims to find out the optimal architecture suitable for building a new model with high accuracy for lip movement detection.
What problem does this paper attempt to address?