GenUDC: High Quality 3D Mesh Generation with Unsigned Dual Contouring Representation

Ruowei Wang,Jiaqi Li,Dan Zeng,Xueqi Ma,Zixiang Xu,Jianwei Zhang,Qijun Zhao
2024-10-23
Abstract:Generating high-quality meshes with complex structures and realistic surfaces is the primary goal of 3D generative models. Existing methods typically employ sequence data or deformable tetrahedral grids for mesh generation. However, sequence-based methods have difficulty producing complex structures with many faces due to memory limits. The deformable tetrahedral grid-based method MeshDiffusion fails to recover realistic surfaces due to the inherent ambiguity in deformable grids. We propose the GenUDC framework to address these challenges by leveraging the Unsigned Dual Contouring (UDC) as the mesh representation. UDC discretizes a mesh in a regular grid and divides it into the face and vertex parts, recovering both complex structures and fine details. As a result, the one-to-one mapping between UDC and mesh resolves the ambiguity problem. In addition, GenUDC adopts a two-stage, coarse-to-fine generative process for 3D mesh generation. It first generates the face part as a rough shape and then the vertex part to craft a detailed shape. Extensive evaluations demonstrate the superiority of UDC as a mesh representation and the favorable performance of GenUDC in mesh generation. The code and trained models are available at <a class="link-external link-https" href="https://github.com/TrepangCat/GenUDC" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition,Graphics
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve This paper aims to address several key issues in high-quality 3D mesh generation: 1. **Generation of Complex Structures and Realistic Surfaces**: Existing 3D generation models typically use sequential data or deformable tetrahedral meshes for mesh generation. However, sequential methods struggle to generate complex structures with multiple faces, and deformable tetrahedral mesh methods (such as MeshDiffusion) cannot recover realistic surfaces due to the deformability of the mesh. 2. **Memory Limitations**: Sequential methods are constrained by memory limitations, which restrict the complexity of the generated mesh structures. For example, methods like PolyGen, MeshGPT, and PolyDiff encounter difficulties when generating meshes with more than 2800 faces. 3. **Ambiguity and Inaccurate Supervision**: The MeshDiffusion method requires 2D image supervision when fitting meshes, which can lead to wrinkles and inaccuracies on the generated mesh surfaces. To address these issues, the authors propose the GenUDC framework, which utilizes Unsigned Dual Contouring (UDC) as the mesh representation and adopts a two-stage coarse-to-fine generation process to produce high-quality 3D meshes. ### Main Contributions 1. **Proposed a New Framework GenUDC**: Utilizes UDC as the representation to generate high-quality 3D meshes. 2. **Designed a Two-Stage Coarse-to-Fine Generation Process**: First generates the face part, then the vertex part, solving the edge aliasing problem. 3. **Extensive Experimental Validation**: Demonstrates the superior performance of GenUDC in mesh generation and data fitting. Compared to MeshDiffusion, GenUDC improves running speed by 3274 times and reduces memory consumption to only 13%. ### Method Overview 1. **UDC Representation**: Discretizes the mesh into a regular grid, divided into face parts and vertex parts. The face part is represented by boolean values, and the vertex part includes all actual and potential vertices. 2. **Two-Stage Generation Process**: - **Face Part Generation**: Uses a Latent Diffusion Model (LDM) to generate the face part, determining the approximate shape and topology of the mesh. - **Vertex Part Generation**: Based on the rough shape, uses a vertex refiner to generate the vertex part, finely describing the mesh details. 3. **Implementation Details**: Trains VAE and U-Net using the AdamW optimizer, and the diffusion model is trained in a class-specific manner. During inference, the sampling method from Denoising Diffusion Probabilistic Models is used, with the inference steps set to 1000. Through these methods, GenUDC can generate high-quality 3D meshes with complex structures and realistic details, suitable for various 3D tasks.