Action Recognition With Motion Diversification and Dynamic Selection

Peiqin Zhuang,Yu Guo,Zhipeng Yu,Luping Zhou,Lei Bai,Ding Liang,Zhiyong Wang,Yali Wang,Wanli Ouyang
DOI: https://doi.org/10.1109/TIP.2022.3189811
IF: 10.6
2022-01-01
IEEE Transactions on Image Processing
Abstract:Motion modeling is crucial in modern action recognition methods. As motion dynamics like moving tempos and action amplitude may vary a lot in different video clips, it poses great challenge on adaptively covering proper motion information. To address this issue, we introduce a Motion Diversification and Selection (MoDS) module to generate diversified spatio-temporal motion features and then select the suitable motion representation dynamically for categorizing the input video. To be specific, we first propose a spatio-temporal motion generation (StMG) module to construct a bank of diversified motion features with varying spatial neighborhood and time range. Then, a dynamic motion selection (DMS) module is leveraged to choose the most discriminative motion feature both spatially and temporally from the feature bank. As a result, our proposed method can make full use of the diversified spatio-temporal motion information, while maintaining computational efficiency at the inference stage. Extensive experiments on five widely-used benchmarks, demonstrate the effectiveness of the method and we achieve state-of-the-art performance on Something-Something V1 & V2 that are of large motion variation.
What problem does this paper attempt to address?