Deep Learning for 3D Reconstruction, Augmentation, and Registration: A Review Paper

Prasoon Kumar Vinodkumar,Dogus Karabulut,Egils Avots,Cagri Ozcinar,Gholamreza Anbarjafari
DOI: https://doi.org/10.3390/e26030235
IF: 2.738
2024-03-08
Entropy
Abstract:The research groups in computer vision, graphics, and machine learning have dedicated a substantial amount of attention to the areas of 3D object reconstruction, augmentation, and registration. Deep learning is the predominant method used in artificial intelligence for addressing computer vision challenges. However, deep learning on three-dimensional data presents distinct obstacles and is now in its nascent phase. There have been significant advancements in deep learning specifically for three-dimensional data, offering a range of ways to address these issues. This study offers a comprehensive examination of the latest advancements in deep learning methodologies. We examine many benchmark models for the tasks of 3D object registration, augmentation, and reconstruction. We thoroughly analyse their architectures, advantages, and constraints. In summary, this report provides a comprehensive overview of recent advancements in three-dimensional deep learning and highlights unresolved research areas that will need to be addressed in the future.
physics, multidisciplinary
What problem does this paper attempt to address?
The paper aims to address challenges in 3D reconstruction, enhancement, and registration. Specifically: 1. **3D Reconstruction**: Constructing 3D models from a series of 2D images or 3D point clouds. This task is highly challenging due to the complexity of 3D geometric structures and the lack of spatial order. 2. **3D Enhancement**: Transforming existing data to generate new data while maintaining the integrity of the underlying information. This helps improve the quality and completeness of the data. 3. **3D Registration**: Matching multiple point clouds to the same coordinate system. Traditional geometric transformation and parameter optimization methods are effective, but deep learning provides a more comprehensive approach and has achieved significant results. The paper provides a detailed analysis of the latest deep learning methods applied to these tasks, including their architectures, advantages, and limitations, and highlights the research gaps that need to be addressed in the future. Additionally, the paper introduces common 3D data representation methods such as point clouds, voxels, and meshes, and outlines benchmark datasets used for training and evaluating models.