Effective skeleton topology and semantics-guided adaptive graph convolution network for action recognition

Zhong-Xiang Qiu,Hong-Bo Zhang,Wei-Mo Deng,Ji-Xiang Du,Qing Lei,Guo-Liang Zhang,Zhong-Xiang Qiu,Hong-Bo Zhang,Wei-Mo Deng,Ji-Xiang Du,Qing Lei,Guo-Liang Zhang
DOI: https://doi.org/10.1007/s00371-022-02473-7
IF: 2.835
2022-04-17
The Visual Computer
Abstract:Most of the existing graph convolutional network-based action recognition methods use an adaptive mechanism to learn action features from a skeleton sequence. Although this mechanism improves the recognition accuracy to some extent, its performance is still limited by the initial skeleton topology, which uses a natural human connection approach to connect skeletal joints. In addition, the semantic information of skeletal joints is naturally informative and discriminative for action recognition tasks, but its inclusion has rarely been investigated in the existing methods. To solve these problems, in this work, we propose a novel multistream-based effective skeleton topology and semantically guided adaptive graph convolution network for action recognition. By comparing several different topological graphs, we design an elbow- and knee-centric topology structure that forms the input to the adaptive graph convolutional network. Moreover, we explicitly embed the high-level semantic skeletal information into this network to enhance the feature representation capabilities. Finally, we study the positional relationships between different joints and the center of gravity in the same frame to generate relative position data. They are combined with the joint data, bone data and their corresponding motion information by a multistream network to further improve the action recognition accuracy. Extensive experiments show that the proposed method achieves state-of-the-art performance levels.
What problem does this paper attempt to address?