Abstract:Faces play a magnificent role in human robot interaction, as they do in our daily life. The inherent ability of the human mind facilitates us to recognize a person by exploiting various challenges such as bad illumination, occlusions, pose variation etc. which are involved in face recognition. But it is a very complex task in nature to identify a human face by humanoid robots. The recent literatures on face biometric recognition are extremely rich in its application on structured environment for solving human identification problem. But the application of face biometric on mobile robotics is limited for its inability to produce accurate identification in uneven circumstances. The existing face recognition problem has been tackled with our proposed component based fragmented face recognition framework. The proposed framework uses only a subset of the full face such as eyes, nose and mouth to recognize a person. It's less searching cost, encouraging accuracy and ability to handle various challenges of face recognition offers its applicability on humanoid robots. The second problem in face recognition is the face spoofing, in which a face recognition system is not able to distinguish between a person and an imposter (photo/video of the genuine user). The problem will become more detrimental when robots are used as an authenticator. A depth analysis method has been investigated in our research work to test the liveness of imposters to discriminate them from the legitimate users. The implication of the previous earned techniques has been used with respect to criminal identification with NAO robot. An eyewitness can interact with NAO through a user interface. NAO asks several questions about the suspect, such as age, height, her/his facial shape and size etc., and then making a guess about her/his face.

Addressee Detection Using Facial and Audio Features in Mixed Human–Human and Human–Robot Settings: A Deep Learning Framework

Cgan Based Facial Expression Recognition for Human-Robot Interaction

To Whom are You Talking? A Deep Learning Model to Endow Social Robots with Addressee Estimation Skills

Deep Learning Based Multi-modal Addressee Recognition in Visual Scenes with Utterances

Real-time Architecture for Audio-Visual Active Speaker Detection.

Efficient Face Detection with Audio-Based Region Proposals for Human-Robot Interactions

Facial Expression Recognition Based on Zero-Addition Pretext Training and Feature Conjunction-Selection Network in Human–Robot Interaction

A Facial Expression Emotion Recognition Based Human-robot Interaction System

Contactless Interaction System Based on Facial Expression Recognition for Humanoid Piano Robot

Audio-Visual Bimodal Combination-Based Speaker Tracking Method for Mobile Robot

Transferring Audio Deepfake Detection Capability Across Languages

Emotional Communication Robot Based on 3D Face Model and ASR Technology

Human–robot non-verbal interaction empowered by real-time auditory and visual multiple-talker tracking

A robust audio deepfake detection system via multi-view feature

On effective human robot interaction based on recognition and association

Human-assisted Sound Event Recognition for Home Service Robots.

Vision-Guided Robot Hearing

Intelligent Facial Emotion Recognition and Semantic-Based Topic Detection for A Humanoid Robot

Enhancing Human–Robot Collaboration through a Multi-Module Interaction Framework with Sensor Fusion: Object Recognition, Verbal Communication, User of Interest Detection, Gesture and Gaze Recognition

Visuo-auditory Multimodal Emotional Structure to Improve Human-Robot-Interaction

Hybrid ASR for Resource-Constrained Robots: HMM - Deep Learning Fusion