MSG-Voxel-GAN: Multi-Scale Gradient Voxel GAN for 3D Object Generation

Bingxu Wang,Jinhui Lan,Feifan Li
DOI: https://doi.org/10.1007/s11042-023-17116-9
IF: 2.577
2023-01-01
Multimedia Tools and Applications
Abstract:The Generative Adversarial Network (GAN) has been the subject of significant attention since it was introduced. It has been widely used in the image domain. However, there has been less research conducted in three dimensions (3D). Moreover, further research in the field of 3D generation has focused on the direct processing of point clouds. Voxel-based methods for 3D object generation were introduced in the early years, but there have been rare subsequent studies. Current methods generate 3D objects of subpar quality. To improve the quality of generated 3D objects, the Multi-Scale Gradient Voxel GAN (MSG-Voxel-GAN) is proposed. Voxel-based methods have achieved promising results in 3D object detection for their fast computation speed and accurate feature description. Therefore, in this paper, we propose a 3D object classification method based on Voxel-RCNN and incorporate it into the discriminator of GAN to generate 3D objects. We apply the network architecture of Multi-Scale Gradient GAN (MSG-GAN) for stable training. Experimental results show that the voxel-based feature extraction method can accurately describe the features of 3D objects, leading to precise classification. The training process of the proposed method is stable, and the quality of generated 3D objects significantly exceeds that of other methods in both subjective visual and objective evaluation metrics. This method can facilitate the development of 3D generative techniques based on GAN.
What problem does this paper attempt to address?