Abstract:We have recently seen significant advancements in the development of robotic machines that are designed to assist people with their daily lives. Socially assistive robots are now able to perform a number of tasks autonomously and without human supervision. However, if these robots are to be accepted by human users, there is a need to focus on the form of human–robot interaction that is seen as acceptable by such users. In this paper, we extend our previous work, originally presented in Ruiz-Garcia et al. (in: Engineering applications of neural networks: 17th international conference, EANN 2016, Aberdeen, UK, September 2–5, 2016, proceedings, pp 79–93, 2016. 10.1007/978-3-319-44188-7_6), to provide emotion recognition from human facial expressions for application on a real-time robot. We expand on previous work by presenting a new hybrid deep learning emotion recognition model and preliminary results using this model on real-time emotion recognition performed by our humanoid robot. The hybrid emotion recognition model combines a Deep Convolutional Neural Network (CNN) for self-learnt feature extraction and a Support Vector Machine (SVM) for emotion classification. Compared to more complex approaches that use more layers in the convolutional model, this hybrid deep learning model produces state-of-the-art classification rate of 96.26%\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$96.26\%$$\end{document}, when tested on the Karolinska Directed Emotional Faces dataset (Lundqvist et al. in The Karolinska Directed Emotional Faces—KDEF, 1998), and offers similar performance on unseen data when tested on the Extended Cohn–Kanade dataset (Lucey et al. in: Proceedings of the third international workshop on CVPR for human communicative behaviour analysis (CVPR4HB 2010), San Francisco, USA, pp 94–101, 2010). This architecture also takes advantage of batch normalisation (Ioffe and Szegedy in Batch normalization: accelerating deep network training by reducing internal covariate shift. http://arxiv.org/abs/1502.03167, 2015) for fast learning from a smaller number of training samples. A comparison between Gabor filters and CNN for feature extraction, and between SVM and multilayer perceptron for classification is also provided.

Video-based person-dependent and person-independent facial emotion recognition

Generalisation and Robustness Investigation for Facial and Speech Emotion Recognition Using Bio-Inspired Spiking Neural Networks

Emotion Recognition for Challenged People Facial Appearance in Social using Neural Network

Real-time Facial Expression Recognition "In The Wild'' by Disentangling 3D Expression from Identity

Facial expression recognition based on adaptation of the classifier to videos of the user

Human Emotion Recognition Based on Spatio-Temporal Facial Features Using HOG-HOF and VGG-LSTM

A VGG16 Based Hybrid Deep Convolutional Neural Network Based Real-Time Video Frame Emotion Detection System for Affective Human Computer Interaction

Classifying Emotions and Engagement in Online Learning Based on a Single Facial Expression Recognition Neural Network

Video-based Emotion Recognition Using Multi-dichotomy RNN-DNN

Human Face Emotion in 3D Using Machine Learning

Facial Emotion Recognition Under Mask Coverage Using a Data Augmentation Technique

Modelling an efficient hybridized approach for facial emotion recognition using unconstraint videos and deep learning approaches

Spatio-Temporal Facial Expression Recognition Using Convolutional Neural Networks and Conditional Random Fields

Semantic-based visual emotion recognition in videos-a transfer learning approach

Facial emotion recognition using geometrical features based deep learning techniques

Image-based facial emotion recognition using convolutional neural network on emognition dataset

Comparative analysis of facial emotion recognition

Facial emotion recognition and music recommendation system using CNN-based deep learning techniques

A hybrid deep learning neural approach for emotion recognition from facial expressions for socially assistive robots

Collaborative expression representation using peak expression and intra class variation face images for practical subject-independent emotion recognition in videos

An Ensemble Approach for Facial Expression Analysis in Video