Motion-Pose Recurrent Neural Network With Instantaneous Kinematic Descriptor For Skeleton Based Gesture Detection And Recognition

Zhi Zhang,Yonghong Song,Yuanlin Zhang
DOI: https://doi.org/10.1109/ACPR.2017.14
2017-01-01
Abstract:Skeleton based human gesture detection and recognition have been attracting increasing attention in the field of human action understanding. A number of approaches have been proposed to only explore skeleton inherent pose features by deep learning method. In this paper, we propose a novel instantaneous kinematic descriptor and a motion-pose recurrent neural network (RNN) for skeleton based gesture detection and recognition. Instead of the raw skeleton joint position as input, we propose instantaneous kinematic descriptor to represent not only the skeleton inherent pose but also the instantaneous movement at a current frame. Meanwhile, the proposed network is capable of transforming the gesture detection to the problem of frame labeling, which can model both the frame-wise dynamic motion and long-term temporal context of gestures. The proposed method is evaluated on the Chalearn LAP gesture dataset, and the result demonstrates that our method achieves the state-of-art performance in the task of skeleton-based gesture detection and recognition.
What problem does this paper attempt to address?