Abstract:Hand gesture recognition (HGR) is a vital component in enhancing the human-computer interaction experience, particularly in multimedia applications, such as virtual reality, gaming, smart home automation systems, etc. Users can control and navigate through these applications seamlessly by accurately detecting and recognizing gestures. However, in a real-time scenario, the performance of the gesture recognition system is sometimes affected due to the presence of complex background, low-light illumination, occlusion problems, etc. Another issue is building a fast and robust gesture-controlled human-computer interface (HCI) in the real-time scenario. The overall objective of this paper is to develop an efficient hand gesture detection and classification model using a channel-pruned YOLOv5-small model and utilize the model to build a gesture-controlled HCI with a quick response time (in ms) and higher detection speed (in fps). First, the YOLOv5s model is chosen for the gesture detection task. Next, the model is simplified by using a channel-pruned algorithm. After that, the pruned model is further fine-tuned to ensure detection efficiency. We have compared our suggested scheme with other state-of-the-art works, and it is observed that our model has shown superior results in terms of mAP (mean average precision), precision (\%), recall (\%), and F1-score (\%), fast inference time (in ms), and detection speed (in fps). Our proposed method paves the way for deploying a pruned YOLOv5s model for a real-time gesture-command-based HCI to control some applications, such as the VLC media player, Spotify player, etc., using correctly classified gesture commands in real-time scenarios. The average detection speed of our proposed system has reached more than 60 frames per second (fps) in real-time, which meets the perfect requirement in real-time application control.

Multimodal Gesture Recognition with Spatio-Temporal Features Fusion Based on YOLOv5 and MediaPipe

Dynamic hand gesture recognition using hidden Markov models

A real-time hand gesture recognition method

Interaction and control with the auxiliary of hand gesture

Multimodal Gesture Recognition Based On Choquet Integral

Novel Human Machine Interface via Robust Hand Gesture Recognition System using Channel Pruned YOLOv5s Model

WristCam: A Wearable Sensor for Hand Trajectory Gesture Recognition and Intelligent Human–Robot Interaction

CAPG-MYO - A Muscle-Computer Interface Supporting User-defined Gesture Recognition.

Mixed Hand Gesture Recognition System And Its Application

Multimodal Gesture Recognition for Mascot Robot System Based on Choquet Integral Using Camera and 3D Accelerometers Fusion

Gesture Detection and Recognition Based on Object Detection in Complex Background

Hand Gesture Control for Human–Computer Interaction with Deep Learning

Computer Interactive Gesture Recognition Model Based on Improved YOLOv5 Algorithm

Gesture Recognition with a 3-D Accelerometer

HGR-FYOLO: a robust hand gesture recognition system for the normal and physically impaired person using frozen YOLOv5

Multimodal Gesture Recognition Based on Attention Slow-Fast Fusion Networks

Gesture recognition using combination of acceleration sensor and images for casual communication between robots and humans

Next-Gen Dynamic Hand Gesture Recognition: MediaPipe, Inception-v3 and LSTM-Based Enhanced Deep Learning Model

An interaction system using mixed hand gestures.

Multimodal Spatiotemporal Feature Map for Dynamic Gesture Recognition

Hand Gesture Recognition using Deep Feature Fusion Network based on Wearable Sensors