Attention on Emotions: A Vision Transformer Approach to Advancing Facial Expression Recognition

Sakshi Bajpai,Anoop Kumar,Natasha Sharma,Dhuriya Ankit Subhash,Yuvjeet Arora,Rishav Raj
DOI: https://doi.org/10.1109/ICITEICS61368.2024.10625569
2024-06-28
Abstract:“Progress in facial emotion recognition (FER) is essential for improving interactions in affective computing, social robots, and human-computer interface systems. Although the foundation for FER was established by conventional Convolutional Neural Networks (CNNs), recent developments in computer vision have led to new opportunities using Transformer-based models, which have demonstrated exceptional performance in tasks such as picture classification and semantic analysis. In the framework of an extensive FER dataset, we test the performance of many Vision Transformer (ViT) models in addition to well-known CNNs like MobileNet and DenseNet. With the help of the FER2013 dataset, which is well-known for its varied portrayal of face expressions in grayscale pictures, we used several data augmentation methods to replicate real-world diversity. By using careful training with optimized parameters for both CNN and ViT models, our work provides a comparison analysis that shows the advantages and differences between each model type for FER tasks. This investigation shows that CNNs are still useful for effectively processing face characteristics and supports the viability of ViTs in capturing complex emotional expressions. Our results provide important new understandings of the features of the model that provide precise emotion identification, paving the way for further advancements in emotion-aware systems in a variety of fields.”
Computer Science
What problem does this paper attempt to address?