BRMR: TAL Based on Boundary Refinement and Multi-scale Regression.

Jing Jiang,Jiankun Zhu,Lining Wang,Hongxun Yao
DOI: https://doi.org/10.1007/978-3-031-46314-3_21
2023-01-01
Abstract:Transformer based networks have been widely used in Temporal Action Localization (TAL). An example of this is the previous state of the art method ActionFormer. By analyzing the predicted results of these transformer based models, we find that though these models are lightweight and powerful, they still have two drawbacks: inaccurate boundary predictions and unreliable confidence in results. Therefore, we propose the B oundary R efinement and M ulti-scale R egression (BRMR) model to solve these two weaknesses and improve model performance. The core of BRMR is the RCE Head and MTDR Module. The RCE Head supervises the quality of the predicted boundaries of the action segments during training, allowing the model to have objective evaluation indicators for the quality of the action segments during testing. The MTDR Module synthesizes and extracts the information of Encoders to obtain aggregated feature pyramids with different temporal scales, which is beneficial for the model to perceive the information of actions at different scales. BRMR achieves state of the art on THUMOS14 and EPIC-Kitchens 100. Besides, BRMR has comparable performance on ActivityNet 1.3.
What problem does this paper attempt to address?