Abstract:The authors introduce a novel Temporal Channel Reconfiguration Multi‐Graph Convolution Network (TRMGCN) that addresses limitations in existing methods for skeleton‐based action recognition by introducing the Temporal Channel Fusion with Guidance (TCFG) module for capturing crucial temporal information and the Top‐Down Attention Multi‐graph Independent Convolution (TD‐MIG) for learning topological graph features. TRMGCN demonstrates advanced performance on large‐scale datasets (NTU‐RGB + D60 and 120) and exhibits strong generalisation capabilities on the smaller dataset NW‐UCLA. Skeleton‐based action recognition has received much attention and achieved remarkable achievements in the field of human action recognition. In time series action prediction for different scales, existing methods mainly focus on attention mechanisms to enhance modelling capabilities in spatial dimensions. However, this approach strongly depends on the local information of a single input feature and fails to facilitate the flow of information between channels. To address these issues, the authors propose a novel Temporal Channel Reconfiguration Multi‐Graph Convolution Network (TRMGCN). In the temporal convolution part, the authors designed a module called Temporal Channel Fusion with Guidance (TCFG) to capture important temporal information within channels at different scales and avoid ignoring cross‐spatio‐temporal dependencies among joints. In the graph convolution part, the authors propose Top‐Down Attention Multi‐graph Independent Convolution (TD‐MIG), which uses multi‐graph independent convolution to learn the topological graph feature for different length time series. Top‐down attention is introduced for spatial and channel modulation to facilitate information flow in channels that do not establish topological relationships. Experimental results on the large‐scale datasets NTU‐RGB + D60 and 120, as well as UAV‐Human, demonstrate that TRMGCN exhibits advanced performance and capabilities. Furthermore, experiments on the smaller dataset NW‐UCLA have indicated that the authors' model possesses strong generalisation abilities.

Motion Complement and Temporal Multifocusing for Skeleton-Based Action Recognition

An Attentional Spatial Temporal Graph Convolutional Network with Co-Occurrence Feature Learning for Action Recognition

SpatioTemporal Focus for Skeleton-based Action Recognition

Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition

Multi-Scale Adaptive Graph Convolution Network for Skeleton-Based Action Recognition

Spatial Temporal Graph Attention Network for Skeleton-Based Action Recognition

A Tri-Attention Enhanced Graph Convolutional Network for Skeleton-Based Action Recognition

Temporal channel reconfiguration multi‐graph convolution network for skeleton‐based action recognition

Temporal Enhanced Multi-Stream Graph Convolutional Nerual Networks For Skeleton-Based Action Recognition

Densely Connected and Multiple Temporal Graph Convolution Networks for Skeleton-based Action Recognition

Multi-Modality Adaptive Feature Fusion Graph Convolutional Network for Skeleton-Based Action Recognition

Dual-Excitation Spatial–Temporal Graph Convolution Network for Skeleton-Based Action Recognition

An improved spatial temporal graph convolutional network for robust skeleton-based action recognition

Attention-Based Multilevel Co-Occurrence Graph Convolutional LSTM for 3-D Action Recognition

A motion-aware and temporal-enhanced Spatial–Temporal Graph Convolutional Network for skeleton-based human action segmentation

Multiple temporal scale aggregation graph convolutional network for skeleton-based action recognition

Multi‐temporal scale aggregation refinement graph convolutional network for skeleton‐based action recognition

Spatiotemporal Progressive Inward-Outward Aggregation Network for skeleton-based action recognition

Combining channel-wise joint attention and temporal attention in graph convolutional networks for skeleton-based action recognition

Channel attention and multi-scale graph neural networks for skeleton-based action recognition

Temporal segment graph convolutional networks for skeleton-based action recognition