Self-Constructing Temporal Excitation Graph for Skeleton-Based Action Recognition

Jianan Li,Zhifu Zhao,Jiawen Yang,Hua Chu,Qingshan Li
DOI: https://doi.org/10.1109/jsen.2023.3306819
IF: 4.3
2023-01-01
IEEE Sensors Journal
Abstract:Graph convolutional network (GCN)-based methods have obtained remarkable performance and gained widespread attention for skeleton-based human action recognition. These methods typically apply 1-D local convolutions to model temporal correlations and simply utilize multilayer stacking to capture long-range temporal dynamics. However, the 1-D local convolution focuses on the relations between the adjacent time steps. Also, with the repeat of a lot of local convolutions, the key temporal relation with nonadjacent temporal distance may be ignored due to the information dilution. Therefore, it remains unclear how to fully explore the temporal dynamics of skeleton sequences. In this article, we propose a temporal excitation GCN (TE-GCN) to exploit a self-constructing temporal relation graph for capturing complex temporal dynamics. Specifically, the constructed temporal relation graph explicitly establishes the connections between semantically related temporal features to adaptively capture the temporal relations between the skeleton sequences. Meanwhile, to further explore the sufficient temporal dynamics concurrently, a multihead mechanism is designed to investigate multikinds of temporal relations. Extensive experiments are performed on two widely used large-scale datasets, NTU-60 RGB+D and NTU-120 RGB+D. Also, experimental results show that the proposed model obtains significant improvements by making contribution to temporal modeling for action recognition.
What problem does this paper attempt to address?