Abstract:To balance accuracy and computation cost well, this paper proposes a simple and efficient attention‐enhanced semantic‐guided graph convolutional network (AeS‐GCN) for skeleton‐based action recognition. Firstly we fuse semantics of joint type and frame index together as representation of skeleton. Then we use spatial attention block (SAB) to explore important features in spatial structure and temporal attention block (TAB) to extract latent temporal information. The model proposed is lightweight and achieves the state‐of‐the‐art performance on mainstream datasets with less parameters and less computational complexity. Skeleton‐based action recognition has been extensively studied in recent years and applied in virtual reality, detection systems and other cases with strong requirements for low cost as well as high accuracy, but most of the existing methods mainly focus on complex architecture of deep neural networks without considering computation efficiency. To balance accuracy and computation cost well, this paper proposes a simple and efficient attention‐enhanced semantic‐guided graph convolutional network (AeS‐GCN) for skeleton‐based action recognition. Firstly, we fuse semantics of joint type and frame index and dynamics together as representation of skeleton. Then, we use spatial attention block (SAB) to explore important features in spatial structure, in which adaptive GCN layer is adopted to adaptively model skeleton topology structure. Next, we use temporal attention block (TAB) to extract latent temporal information. The model proposed is a lightweight network and achieves the state‐of‐the‐art performance on mainstream datasets with less parameters and less computational complexity.

Effective skeleton topology and semantics-guided adaptive graph convolution network for action recognition

Skeleton-Based Action Recognition with Multi-Stream Adaptive Graph Convolutional Networks

Spatio-Temporal Inception Graph Convolutional Networks for Skeleton-Based Action Recognition.

Overcoming Topology Agnosticism: Enhancing Skeleton-Based Action Recognition through Redefined Skeletal Topology Awareness

An efficient self-attention network for skeleton-based action recognition

Part-Wise Adaptive Topology Graph Convolutional Network for Skeleton-Based Action Recognition

Actional-Structural Graph Convolutional Networks for Skeleton-based Action Recognition

MFGCN: an efficient graph convolutional network based on multi-order feature information for human skeleton action recognition

Skeleton-Based Action Recognition With Directed Graph Neural Networks

Graph Edge Convolutional Neural Networks for Skeleton Based Action Recognition

Multiple temporal scale aggregation graph convolutional network for skeleton-based action recognition

Multi-Stage Attention-Enhanced Sparse Graph Convolutional Network for Skeleton-Based Action Recognition

Temporal Enhanced Multi-Stream Graph Convolutional Nerual Networks For Skeleton-Based Action Recognition

An improved spatial temporal graph convolutional network for robust skeleton-based action recognition

Skeleton-based action recognition by part-aware graph convolutional networks

Graph Edge Convolutional Neural Networks for Skeleton-Based Action Recognition

Generalized Graph Convolutional Networks for Skeleton-based Action Recognition

AeS‐GCN: Attention‐enhanced semantic‐guided graph convolutional networks for skeleton‐based action recognition

Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections

Richly Activated Graph Convolutional Network for Robust Skeleton-Based Action Recognition

Multi-Scale Adaptive Graph Convolution Network for Skeleton-Based Action Recognition