Accommodating Self-attentional Heterophily Topology into High- and Low-pass Graph Convolutional Network for Skeleton-based Action Recognition

Chao Wei,Zhidong Deng
DOI: https://doi.org/10.1109/IJCNN54540.2023.10191560
2023-01-01
Abstract:In the scene of human skeleton-based action recognition, graph convolutional network (GCN) is widely applied to model human action dynamics and achieves extraordinary results, where graph topology is in charge of feature extraction and plays a crucial role in learning representation in GCNs. This paper presents a novel high- and low-pass graph convolutional network (HLP-GCN), aiming to effectively aggregate joint features based on learned homophily and heterophily topologies using self-attention modules for skeleton-based action recognition tasks. Specifically, such an HLP-GCN framework first learns homophily and heterophily topologies dynamically so as to capture different correlations between input samples and then combine similar information arising from homophily topologies with dissimilar ones from heterophily topologies to get the final embeddings. Furthermore, a temporal modeling module is added to this HLPGCN framework, and a new pooling strategy called global node (GNode) is cascaded subsequently, in order to have the extraction of more robust spatiotemporal features for skeleton action recognition. Finally, the experimental results demonstrate that our method outperforms previous state-of-the-art methods by 0.2%, 0.4%, and 0.3% on three large-scale action recognition datasets including NTU RGB+D X-Set, NTU RGB+D 120 X-Sub, and NW-UCLA, respectively.
What problem does this paper attempt to address?