High-Fidelity 3D Model Generation with Relightable Appearance from Single Freehand Sketches and Text Guidance

Tianrun Chen,Runlong Cao,Ankang Lu,Tao Xu,Xiaoling Zhang,Papa Mao,Min Zhang,Lingyun Sun,Ying Zang
DOI: https://doi.org/10.1109/icmew63481.2024.10645361
2024-01-01
Abstract:ARlVR's rise has increased the demand for 3D models, but conventional CAD modeling is time-consuming, and has a steep learning curve for novice users. To address this issue, this paper proposes a sketch-based 3D modeling approach for rapid content creation in the metaverse, Deep3DSketch-PA, that uses a single free-hand sketch for modeling and text input for painting appearance. Due to the sparsity and ambiguity, modeling from sketches is a challenging task. Deep3DSketch-PA develops a dual-stage optimization scheme with a specially designed network that can capture point and local features and a test-time optimization network for mesh refinement. We then introduce an appearance generation method that can generate a consistent and relightable appearance using material graph representation using text guidance. Unlike existing methods that can only produce “dirty” texture, our method can generate highly photo-realistic appearance and production-ready materials in AR/VR. Extensive experiments demonstrated the effectiveness of the approach, achieving state-of-the-art performance on both syn-thetic and real datasets. We believe that Deep3DSketch-PA has the potential to revolutionize the process of 3D modeling in metaverse applications by providing an intuitive and easy-to-use solution for novice users.
What problem does this paper attempt to address?