Deep Learning-Based Human Pose Estimation: A Survey

Ce Zheng,Wenhan Wu,Chen Chen,Taojiannan Yang,Sijie Zhu,Ju Shen,Nasser Kehtarnavaz,Mubarak Shah
2023-07-04
Abstract:Human pose estimation aims to locate the human body parts and build human body representation (e.g., body skeleton) from input data such as images and videos. It has drawn increasing attention during the past decade and has been utilized in a wide range of applications including human-computer interaction, motion analysis, augmented reality, and virtual reality. Although the recently developed deep learning-based solutions have achieved high performance in human pose estimation, there still remain challenges due to insufficient training data, depth ambiguities, and occlusion. The goal of this survey paper is to provide a comprehensive review of recent deep learning-based solutions for both 2D and 3D pose estimation via a systematic analysis and comparison of these solutions based on their input data and inference procedures. More than 250 research papers since 2014 are covered in this survey. Furthermore, 2D and 3D human pose estimation datasets and evaluation metrics are included. Quantitative performance comparisons of the reviewed methods on popular datasets are summarized and discussed. Finally, the challenges involved, applications, and future research directions are concluded. A regularly updated project page is provided: \url{<a class="link-external link-https" href="https://github.com/zczcwh/DL-HPE" rel="external noopener nofollow">this https URL</a>}
Computer Vision and Pattern Recognition,Graphics,Multimedia
What problem does this paper attempt to address?
The aim of this paper is to comprehensively review and summarize recent human pose estimation (HPE) methods based on deep learning, and to systematically analyze and compare 2D and 3D human pose estimation. Specifically: 1. **Review of Deep Learning Methods**: The paper covers more than 260 related research papers from 2014 to the present, focusing on 2D and 3D human pose estimation methods based on deep learning, and classifies them according to input data, inference process, etc. 2. **Performance Evaluation**: The paper summarizes the performance of various 2D and 3D HPE methods on common datasets and compares them, revealing the advantages and disadvantages of different methods, as well as future research directions. 3. **Application Areas**: The paper outlines the wide applications of HPE technology in fields such as surveillance, augmented reality (AR), virtual reality (VR), healthcare, and more. 4. **Challenges and Future Directions**: The paper discusses the key challenges currently faced by HPE technology and points out future research directions for improving performance. Through these contents, the paper aims to provide researchers with a clear picture of the current research landscape and to promote the development of HPE technology.