Deep historical long short-term memory network for action recognition

Jiaxin Cai,Junlin Hu,Xin Tang,Tzu-Yi Hung,Yap-Peng Tan
DOI: https://doi.org/10.1016/j.neucom.2020.03.111
IF: 6
2020-09-01
Neurocomputing
Abstract:<p>Human action recognition technology has received increasing interest recently. It is very useful in sports game analysis. Most of the action recognition methods in sports mainly focus on recognizing which sport is being performed. However, recognition of the specific action in videos is important for the analysis of tennis matches. Hence, in this paper, we proposed a deep historical long short-term memory network for video-based tennis action recognition and general action recognition. First, the spatial representations were extracted from each frame using a pre-trained convolutional neural networks (CNNs). To describe the temporal information, a stacked multi-layer long short-term memory networks (LSTMs) was used. The historical information of the past frames is important for modeling the action. So we proposed a historical information layer that was added on the top of the multi-layered LSTM network. A historical feature of each video was generated by hybridizing the hidden state of LSTMs at time t and the historical updated feature at time t-1 with an updating scheme and utilized for classification. Experiments on the benchmarks demonstrate that our method that used only simple raw RGB video outperforms state-of-the-art baselines for both general action recognition and tennis action recognition.</p>
computer science, artificial intelligence
What problem does this paper attempt to address?