Overview of temporal action detection based on deep learning

Kai Hu,Chaowen Shen,Tianyan Wang,Keer Xu,Qingfeng Xia,Min Xia,Chengxue Cai
DOI: https://doi.org/10.1007/s10462-023-10650-w
IF: 9.588
2024-02-04
Artificial Intelligence Review
Abstract:Temporal Action Detection (TAD) aims to accurately capture each action interval in an untrimmed video and to understand human actions. This paper comprehensively surveys the state-of-the-art techniques and models used for TAD task. Firstly, it conducts comprehensive research on this field through Citespace and comprehensively introduce relevant dataset. Secondly, it summarizes three types of methods, i.e., anchor-based, boundary-based, and query-based, from the design method level. Thirdly, it summarizes three types of supervised learning methods from the level of learning methods, i.e., fully supervised, weakly supervised, and unsupervised. Finally, this paper explores the current problems, and proposes prospects in TAD task.
computer science, artificial intelligence
What problem does this paper attempt to address?
The paper mainly focuses on the field of Temporal Action Detection (TAD), particularly on how to accurately capture the time intervals of each action in untrimmed videos and understand human action behaviors. Specifically, the goals of the paper include: 1. **Comprehensive Review of Existing Technologies and Models**: The paper conducts a comprehensive survey of the latest technologies and models in the field of temporal action detection using the literature research tool Citespace and introduces relevant datasets. 2. **Method Classification Summary**: From the perspective of design methods, the existing methods are classified into three types: Anchor-based, Boundary-based, and Query-based. 3. **Supervised Learning Methods Summary**: From the perspective of learning methods, the paper summarizes three types of supervised learning methods: fully supervised, weakly supervised, and unsupervised. 4. **Discussion of Issues and Future Prospects**: The paper also discusses the current issues faced by temporal action detection tasks and proposes future research directions. In summary, this paper aims to provide a comprehensive and up-to-date review of the field of temporal action detection to help researchers better understand and advance the technological development in this area.