Skeleton Based Action Recognition Algorithm on Multi-modal Lightweight Graph Convolutional Network

吴小俊,宋晓宁,苏江毅,於东军
DOI: https://doi.org/10.3778/j.issn.1673-9418.2008051
2021-01-01
Abstract:Compared with the traditional RGB-based methods, the skeleton-based action recognition methods have become the main research direction in the field of computer vision in recent years because they are less affected by many factors such as illumination, viewing angle and background complexity. However, the current skeleton-based methods still have some problems such as large parameters, long time-consuming and high computational complexity, which makes it complicated and difficult to meet the requirements of efficiency and accuracy simultaneously. To address these issues, a lightweight graph convolution network using multi-modal data fusion is proposed. Firstly, the multi-modal information flow data are fused by multi-modal data fusion method. Secondly, the spatial and temporal information of human joints are extracted using spatial and temporal modules respectively. Finally, the classification results are obtained through the fully connected layer. Experimental results conducted on the two commonly used datasets including NTU60 RGB+D and NTU120 RGB+D demonstrate that the proposed network outperforms some mainstream methods in the last two years in both recognition accuracy and efficiency, thus verifying that the network has excellent performance in terms of accuracy, while considering time efficiency and computational cost.
What problem does this paper attempt to address?