Abstract:Video action recognition is an important research direction in the field of computer vision and pattern recognition, with extensive applications in intelligent video surveillance, human-computer interaction, and sports analysis. The development of data storage and computing hardware over the past decade has driven a shift from traditional feature extraction and machine learning algorithms to deep learning-based approaches. This paper reviews the current state of development, problems, and future research directions of video action recognition techniques. Traditional methods are gradually being replaced by deep learning methods such as convolutional neural networks (CNNs), recurrent neural networks (RNNs), and long-short-term memory networks (LSTMs). These methods automatically extract features and handle time-dependency, significantly improving the accuracy and robustness of action recognition. In particular, models based on the attention mechanism further enhance action recognition performance by dynamically adjusting the focus of attention, a current hot spot in research. Despite many advances, video action recognition still faces several challenges, including high computational resource requirements, complex model training, dataset bias issues, and variations in real-world application scenarios such as viewpoint changes, lighting changes, and occlusion. Future research can explore multi-modal fusion, lightweight models, self-supervised learning, and cross-domain transfer learning to improve the accuracy, robustness, and generalization of action recognition. The review provided aims to offer researchers a comprehensive perspective on the current state of development and future research directions of video action recognition technology.

Video action recognition: A survey

A Review of Deep Learning Based Video Action Recognition Techniques

A Method of Simultaneously Action Recognition and Video Segmentation of Video Streams.

A Comprehensive Study of Deep Video Action Recognition

Human Action Recognition Using Deep Learning Methods.

View-invariant action recognition:a survey

A Comprehensive Survey of Vision-Based Human Action Recognition Methods

Human Behavior Analysis: A Survey on Action Recognition

[LIVER DISORDERS IN INFLUENZA IN CHILDHOOD].

A Survey on Backbones for Deep Video Action Recognition

Study of human action representation in video sequences

Recent Progress in Appearance-based Action Recognition

A review of video action recognition based on 3D convolution

Action recognition in compressed domains: A survey

Current Advances on Deep Learning-based Human Action Recognition from Videos: a Survey

A Survey on Video Action Recognition in Sports: Datasets, Methods and Applications

Action Recognition In Rgb-D Egocentric Videos

Spatio-temporal Action Recognition: A Survey

How to Improve Video Analytics with Action Recognition: A Survey

Handcrafted Vs. Learned Representations for Human Action Recognition

Action Recognition By Learning Deep Multi-Granular Spatio-Temporal Video Representation