Learning Face Expression Features from Video Using Spatio-Temporal Feature Extractor and CNN-LSTM

Vandana S. Bhat,Anil B. Gavade,Priyanka A. Gavade
DOI: https://doi.org/10.1109/ICIIP61524.2023.10537794
2023-11-22
Abstract:The study of recognizing facial expressions (FER) has posed a significant challenge within the realm of computer vision over the past three decades. However, it continues to garner attention from researchers because of persistent challenges such as-blurring, illumination variation, and pose variation. Existing studies have predominantly focused on static image-based FER, neglecting temporal information. This research paper presents a systematic approach to identify and classify face expressions from videos by leveraging both spatial and temporal information. Datasets containing movie clips having spontaneous expression shot in the wild and the laboratory trained dataset, shot in a controlled environment are considered for evaluation, namely, RAVDESS, SAVEE, CK+ and AFEW. Performance metrics, such as Accuracy, Precision, and Recall, are employed to quantitatively assess the quality of the proposed Facial Expression Recognition (FER) system. Collectively, the findings validate the effectiveness of the proposed approach, solidifying its status as a significant advancement in the field of automatic facial expression recognition.
Computer Science
What problem does this paper attempt to address?