A Novel Motion Accumulation and Selection Network for Rgb-Based Action Recognition

huafeng wang,Hanlin Li,Wanquan Liu,Xianfeng Gu
DOI: https://doi.org/10.2139/ssrn.4170497
2022-01-01
SSRN Electronic Journal
Abstract:A large number of existing methods have demonstrated the significance of motion information for action recognition in video. In the literature, most of the previous methods rely heavily on the temporal differences of features extracted by convolutional networks (CNN) to represent the motion. However, this type of motion representation approach may have two potential drawbacks: 1) The difference operation may cause the incompleteness of the extracted moving target contour; 2) Treating all the extracted motion features equally may lead to situations where some motion features will not contribute to the classification or even produce negative incentives. Inspired by the fact that the human eye is often a cumulative and selective process of visual attention when observing a video sequence, we propose a motion accumulation and selection network (MAS-Net) based on a novel embedded 2D CNN. According to the experimental results on typical video datasets such as Something-Something V1&V2 and Kinetics-400, it is shown that MAS-Net has achieved the state-of-the-arts on Something-Something V1&V2 and the competitive results on Kinetics-400, while the computational load is kept at a relatively low level.
What problem does this paper attempt to address?