DreamCraft3D++: Efficient Hierarchical 3D Generation with Multi-Plane Reconstruction Model

Jingxiang Sun,Cheng Peng,Ruizhi Shao,Yuan-Chen Guo,Xiaochen Zhao,Yangguang Li,Yanpei Cao,Bo Zhang,Yebin Liu
2024-10-17
Abstract:We introduce DreamCraft3D++, an extension of DreamCraft3D that enables efficient high-quality generation of complex 3D assets. DreamCraft3D++ inherits the multi-stage generation process of DreamCraft3D, but replaces the time-consuming geometry sculpting optimization with a feed-forward multi-plane based reconstruction model, speeding up the process by 1000x. For texture refinement, we propose a training-free IP-Adapter module that is conditioned on the enhanced multi-view images to enhance texture and geometry consistency, providing a 4x faster alternative to DreamCraft3D's DreamBooth fine-tuning. Experiments on diverse datasets demonstrate DreamCraft3D++'s ability to generate creative 3D assets with intricate geometry and realistic 360° textures, outperforming state-of-the-art image-to-3D methods in quality and speed. The full implementation will be open-sourced to enable new possibilities in 3D content creation.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to significantly improve the efficiency of complex 3D asset generation while maintaining high - quality generation. Specifically, the paper proposes a system named DreamCraft3D++, aiming to overcome several key challenges in existing 3D generation methods: 1. **Slow generation speed**: Existing 3D generation methods, such as DreamCraft3D, although able to generate high - quality 3D assets, have a long processing time and usually take several hours to complete the generation of a single object. This is unacceptable in practical applications, especially in scenarios that require rapid iteration and large - scale generation. 2. **Poor geometric and texture consistency**: Many optimization - based methods, when generating complex 3D content, although each view seems reasonable, lack consistency and coherence as a whole, resulting in the generated 3D models looking unnatural in the global view. 3. **Data scarcity**: 3D data is relatively scarce compared to 2D data, which limits the generalization ability of models trained on 3D data. Therefore, how to generate high - quality 3D assets based on limited 3D data is an important issue. To solve the above problems, DreamCraft3D++ introduces the following innovations: - **Multi - Plane Reconstruction Model (MP - LRM)**: Using a feed - forward multi - plane reconstruction model instead of the original time - consuming geometric carving optimization process, significantly improving the generation speed, achieving a 1,000 - fold acceleration effect. - **Untrained IP - Adapter module**: In order to enhance the consistency of texture and geometry, the paper proposes an untrained IP - Adapter module. This module enhances the consistency of texture and geometry through conditional multi - view images, providing an alternative that is 4 times faster than DreamCraft3D's DreamBooth fine - tuning. - **Efficient joint optimization of texture and geometry**: By dynamically selecting image embeddings based on camera positions, the IP - Adapter module can efficiently optimize texture and geometric details while maintaining consistency with multi - view input images. These innovations make DreamCraft3D++ not only significantly improve in generation speed, but also exceed existing 3D generation methods in generation quality, especially in dealing with complex geometric structures and realistic textures. The paper verifies these advantages through experiments on multiple datasets, demonstrating the great potential of DreamCraft3D++ in 3D content creation.