Dynamic Hand Gesture-Featured Human Motor Adaptation in Tool Delivery using Voice Recognition

Haolin Fei,Stefano Tedeschi,Yanpei Huang,Andrew Kennedy,Ziwei Wang
2023-09-20
Abstract:Human-robot collaboration has benefited users with higher efficiency towards interactive tasks. Nevertheless, most collaborative schemes rely on complicated human-machine interfaces, which might lack the requisite intuitiveness compared with natural limb control. We also expect to understand human intent with low training data requirements. In response to these challenges, this paper introduces an innovative human-robot collaborative framework that seamlessly integrates hand gesture and dynamic movement recognition, voice recognition, and a switchable control adaptation strategy. These modules provide a user-friendly approach that enables the robot to deliver the tools as per user need, especially when the user is working with both hands. Therefore, users can focus on their task execution without additional training in the use of human-machine interfaces, while the robot interprets their intuitive gestures. The proposed multimodal interaction framework is executed in the UR5e robot platform equipped with a RealSense D435i camera, and the effectiveness is assessed through a soldering circuit board task. The experiment results have demonstrated superior performance in hand gesture recognition, where the static hand gesture recognition module achieves an accuracy of 94.3\%, while the dynamic motion recognition module reaches 97.6\% accuracy. Compared with human solo manipulation, the proposed approach facilitates higher efficiency tool delivery, without significantly distracting from human intents.
Robotics,Artificial Intelligence
What problem does this paper attempt to address?
The problem this paper attempts to address is how to achieve more natural, intuitive, and efficient interaction methods in Human-Robot Collaboration (HRC), especially in situations where the user's hands are occupied. Specifically, it aims to enable robots to understand the user's intentions and promptly provide the necessary tools. Existing gesture recognition systems often rely on complex interfaces and large amounts of training data, which not only increase the user's burden but also limit the system's flexibility and adaptability. Additionally, these systems usually require users to make exaggerated gestures, which can lead to fatigue over prolonged use and may fail to convey detailed instructions or preferences. To tackle these challenges, the paper proposes an innovative Human-Robot Collaboration framework that integrates gesture recognition, dynamic action recognition, voice recognition, and switchable control adaptation strategies. This framework aims to understand the user's intentions by analyzing the form and dynamics of gestures, as well as voice commands. Consequently, it allows the robot to flexibly provide the required tools or objects based on the user's natural gestures and verbal instructions without the need for additional training. Experimental results show that the system performs excellently in gesture recognition, with an accuracy rate of 94.3% for the static gesture recognition module and 97.6% for the dynamic action recognition module. Compared to manual operation alone, this method can improve the efficiency of tool delivery without significantly distracting the user.