Joint2Human: High-quality 3D Human Generation Via Compact Spherical Embedding of 3D Joints

Muxin Zhang,Qiao Feng,Zhuo Su,Chao Wen,Zhou Xue,Kun Li
DOI: https://doi.org/10.1109/cvpr52733.2024.00142
2024-01-01
Computer Vision and Pattern Recognition
Abstract:3D human generation is increasingly significant in various applications.However, the direct use of 2D generative methods in 3D generation often resultsin losing local details, while methods that reconstruct geometry from generatedimages struggle with global view consistency. In this work, we introduceJoint2Human, a novel method that leverages 2D diffusion models to generatedetailed 3D human geometry directly, ensuring both global structure and localdetails. To achieve this, we employ the Fourier occupancy field (FOF)representation, enabling the direct generation of 3D shapes as preliminaryresults with 2D generative models. With the proposed high-frequency enhancerand the multi-view recarving strategy, our method can seamlessly integrate thedetails from different views into a uniform global shape. To better utilize the3D human prior and enhance control over the generated geometry, we introduce acompact spherical embedding of 3D joints. This allows for an effective guidanceof pose during the generation process. Additionally, our method can generate 3Dhumans guided by textual inputs. Our experimental results demonstrate thecapability of our method to ensure global structure, local details, highresolution, and low computational cost simultaneously. More results and thecode can be found on our project page athttp://cic.tju.edu.cn/faculty/likun/projects/Joint2Human.
What problem does this paper attempt to address?