Abstract:We have recently seen significant advancements in the development of robotic machines that are designed to assist people with their daily lives. Socially assistive robots are now able to perform a number of tasks autonomously and without human supervision. However, if these robots are to be accepted by human users, there is a need to focus on the form of human–robot interaction that is seen as acceptable by such users. In this paper, we extend our previous work, originally presented in Ruiz-Garcia et al. (in: Engineering applications of neural networks: 17th international conference, EANN 2016, Aberdeen, UK, September 2–5, 2016, proceedings, pp 79–93, 2016. 10.1007/978-3-319-44188-7_6), to provide emotion recognition from human facial expressions for application on a real-time robot. We expand on previous work by presenting a new hybrid deep learning emotion recognition model and preliminary results using this model on real-time emotion recognition performed by our humanoid robot. The hybrid emotion recognition model combines a Deep Convolutional Neural Network (CNN) for self-learnt feature extraction and a Support Vector Machine (SVM) for emotion classification. Compared to more complex approaches that use more layers in the convolutional model, this hybrid deep learning model produces state-of-the-art classification rate of 96.26%\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$96.26\%$$\end{document}, when tested on the Karolinska Directed Emotional Faces dataset (Lundqvist et al. in The Karolinska Directed Emotional Faces—KDEF, 1998), and offers similar performance on unseen data when tested on the Extended Cohn–Kanade dataset (Lucey et al. in: Proceedings of the third international workshop on CVPR for human communicative behaviour analysis (CVPR4HB 2010), San Francisco, USA, pp 94–101, 2010). This architecture also takes advantage of batch normalisation (Ioffe and Szegedy in Batch normalization: accelerating deep network training by reducing internal covariate shift. http://arxiv.org/abs/1502.03167, 2015) for fast learning from a smaller number of training samples. A comparison between Gabor filters and CNN for feature extraction, and between SVM and multilayer perceptron for classification is also provided.

Deep facial expression detection using Viola-Jones algorithm, CNN-MLP and CNN-SVM

Human emotion detection and classification using modified viola-jones and convolution neural network

Emotion Recognition System Based on Facial Expressions Using SVM

Human Emotion Recognition Based on Spatio-Temporal Facial Features Using HOG-HOF and VGG-LSTM

Generalisation and Robustness Investigation for Facial and Speech Emotion Recognition Using Bio-Inspired Spiking Neural Networks

Facial Emotions Recognition Using Deep Learning Technology

A real time face emotion classification and recognition using deep learning model

Robust Human Face Emotion Classification Using Triplet-Loss-Based Deep CNN Features and SVM

Improvement of emotion recognition from facial images using deep learning and early stopping cross validation

Deep learning based MobileNet and multi-head attention model for facial expression recognition

Deep-Emotion: Facial Expression Recognition Using Attentional Convolutional Network

Breaking New Ground in Affective Computing: Enhanced Facial Expression Classification via CNN-SVM Integration

Emotional Facial Expression Detection using YOLOv8

A hybrid deep learning neural approach for emotion recognition from facial expressions for socially assistive robots

Human Behavior Understanding in Big Multimedia Data Using CNN based Facial Expression Recognition

Expert System for Smart Virtual Facial Emotion Detection Using Convolutional Neural Network

Optimal Facial Feature Based Emotional Recognition Using Deep Learning Algorithm

Distinguishing Posed and Spontaneous Smiles by Facial Dynamics

A deep-learning-based facial expression recognition method using textural features

Facial emotion recognition and music recommendation system using CNN-based deep learning techniques