Innovative AI techniques for photorealistic 3D clothed human reconstruction from monocular images or videos: a survey

Shuo Yang,Xiaoling Gu,Zhenzhong Kuang,Feiwei Qin,Zizhao Wu
DOI: https://doi.org/10.1007/s00371-024-03641-7
IF: 2.835
2024-09-28
The Visual Computer
Abstract:The reconstruction of high-quality 3D clothed humans from monocular images or videos has gained popularity in recent years due to its significant practical applications. While several surveys have addressed the reconstruction of full-body parametric human models from images or videos, this survey specifically delves into the challenges and methodologies of reconstructing 3D clothed humans. It covers both pose-dependent and dynamic approaches to clothed human reconstruction. Regarding pose-dependent clothed human reconstruction from monocular images, we investigate methodologies that employ regression models trained on high-quality 3D scans to estimate human geometry with clothing. Additionally, we explore research leveraging texture priors within large-scale diffusion models to enhance the inference of human appearance in occluded or unseen areas. In terms of dynamic clothed human reconstruction from monocular and sparse multi-view videos, we analyze human modeling techniques utilizing neural radiance fields and 3D Gaussian representations, which employ deformation fields to capture human movements across frames. Furthermore, we provide an overview of the datasets and commonly used quantitative evaluation metrics in these studies. Finally, we conclude by discussing open issues and proposing future research directions in the realistic reconstruction of clothed humans, emphasizing areas that warrant additional investigation.
computer science, software engineering
What problem does this paper attempt to address?