Decimal Digits Recognition from Lip Movement Using GoogleNet network

Kwakib Saadun Naif1,Kadhim Mahdi Hashim2,Kadhim Mahdi Hashim2
DOI: https://doi.org/10.32792/jeps.v12i2.195
2023-02-14
Abstract:Lip reading is a visual way to communicate with people through the movement of the lips, especially thehearing impaired and people who are in noisy environments such as stadiums and airports. Lip reading isnot easy to face many difficulties, especially when taking a video of the person, including lighting,rotation, the person’s position and different skin colors...etc. As researchers are constantly looking fornew techniques for lip-reading.The main objective of the paper is to design and implement an effective system for identifying decimaldigits by movement. Our proposed system consists of two stages, namely, preprocessing, in which theface and mouth area are detected, lips are determined and stored in a temporary folder to used viola jones.The second stage is to take a GoogleNet neural network and insert the flange frame in it, where thefeatures will be extracted in the convolutional layer and then the classification process where the resultswere convincing and we obtained an accuracy of 87% by using a database consisting of 35 videos and itcontained seven males and two females, and the number of the frame was 21,501 lips image.
What problem does this paper attempt to address?