Overview of temporal action detection based on deep learning
Kai Hu,Chaowen Shen,Tianyan Wang,Keer Xu,Qingfeng Xia,Min Xia,Chengxue Cai
DOI: https://doi.org/10.1007/s10462-023-10650-w
IF: 9.588
2024-02-04
Artificial Intelligence Review
Abstract:Temporal Action Detection (TAD) aims to accurately capture each action interval in an untrimmed video and to understand human actions. This paper comprehensively surveys the state-of-the-art techniques and models used for TAD task. Firstly, it conducts comprehensive research on this field through Citespace and comprehensively introduce relevant dataset. Secondly, it summarizes three types of methods, i.e., anchor-based, boundary-based, and query-based, from the design method level. Thirdly, it summarizes three types of supervised learning methods from the level of learning methods, i.e., fully supervised, weakly supervised, and unsupervised. Finally, this paper explores the current problems, and proposes prospects in TAD task.
computer science, artificial intelligence