A Comprehensive Survey on 3D Content Generation

Jian Liu,Xiaoshui Huang,Tianyu Huang,Lu Chen,Yuenan Hou,Shixiang Tang,Ziwei Liu,Wanli Ouyang,Wangmeng Zuo,Junjun Jiang,Xianming Liu
2024-03-19
Abstract:Recent years have witnessed remarkable advances in artificial intelligence generated content(AIGC), with diverse input modalities, e.g., text, image, video, audio and 3D. The 3D is the most close visual modality to real-world 3D environment and carries enormous knowledge. The 3D content generation shows both academic and practical values while also presenting formidable technical challenges. This review aims to consolidate developments within the burgeoning domain of 3D content generation. Specifically, a new taxonomy is proposed that categorizes existing approaches into three types: 3D native generative methods, 2D prior-based 3D generative methods, and hybrid 3D generative methods. The survey covers approximately 60 papers spanning the major techniques. Besides, we discuss limitations of current 3D content generation techniques, and point out open challenges as well as promising directions for future work. Accompanied with this survey, we have established a project website where the resources on 3D content generation research are provided. The project page is available at
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
The paper attempts to address the technical and application challenges in the field of 3D content generation. Specifically, 3D content generation technology holds significant value in both academic and practical applications, but it also faces substantial technical difficulties. The paper aims to integrate and evaluate the latest developments in the field of 3D content generation, proposing a new classification method that divides existing 3D content generation methods into three categories: 3D native generation methods, 3D generation methods based on 2D priors, and hybrid 3D generation methods. ### Main Objectives of the Paper: 1. **Systematic Classification**: Propose a new classification method that systematically divides 3D content generation technologies into three categories. 2. **Comprehensive Review**: Cover approximately 60 related papers, encompassing major technical methods. 3. **Discuss Limitations and Future Directions**: Discuss the limitations of current 3D content generation technologies, pointing out open challenges and future research directions. 4. **Resource Compilation**: Establish a project website providing resources related to 3D content generation research. ### Application Areas of 3D Content Generation: - **Game and Entertainment Design**: Reduce the time and labor costs of character and object design. - **Architecture**: Quickly generate 3D concept models, bridging the gap between designers and clients. - **Industrial Design**: Virtually generate all 3D models and assemble them into comprehensive models, reducing time and material waste. ### Main Technical Classifications: 1. **3D Native Generation Methods**: Directly use 3D datasets to train networks and generate 3D assets. These methods require a large amount of 3D data, but 3D data is relatively scarce. 2. **3D Generation Methods Based on 2D Priors**: Utilize large-scale paired image-text datasets to train 2D diffusion models and then generate 3D models. This method can leverage abundant 2D data to compensate for the lack of 3D data. 3. **Hybrid 3D Generation Methods**: Combine 3D native and 2D prior-based methods, utilizing the advantages of each to generate 3D content. ### Future Research Directions: - **Improve Generation Quality**: Including resolution, geometric details, and texture fidelity. - **Multi-view Consistency**: Ensure the consistency of generated 3D content from different perspectives. - **Optimize Generation Speed**: Increase the generation speed in interactive applications. - **Expand Application Scenarios**: Explore more application areas for 3D content generation, such as healthcare, education, etc. Through these efforts, the paper aims to provide a comprehensive guide for practitioners in both industry and academia, helping them better understand and address the rapid developments in the field of 3D content generation.