Three-dimensional reconstruction of an object from a single image using deep convolutional neural networks
Denis V. Gadasin,Andrey V. Shvedov,Ivan A. Kuzin,,,
DOI: https://doi.org/10.36724/2072-8735-2022-16-7-29-35
2022-01-01
T-Comm
Abstract:The automatic creation of three-dimensional prototypes and digital copies of three-dimensional objects of the real world is a revolutionary innovation that is actively used today in many areas of human activity, for example, for identification in smartphones and e-commerce applications, as well as in visualization and design systems. This trend has intensified now that additive technologies have become available to a wide range of users, and large-scale storage of three-dimensional objects are becoming increasingly popular and widespread. One of the tasks that a person solves every day on an unconscious level is the cognition of images: visual, sound intelligible, tactile and others. Thanks to image recognition, it is possible to identify people by external signs and distinguish them from each other, identify sounds, classify different objects by similar properties, and also accurately determine the subjective characteristics of the observed objects, such as color, shape, volume and depth. The problem of recognizing images of objects of the surrounding world and understanding their scale and volume by two-dimensional projections is one of the most urgent and studied problems solved by computer vision methods. However, this class of tasks is quite difficult to formalize, which makes their solution time-consuming to develop and implement. The article describes the development of a software package that re-constructs three-dimensional scenes according to their projections using neural network machine learning methods: the basics of three-dimensional reconstruction are considered, a model of the general architecture of the PAK is proposed, the architecture is introduced the developed neural network, the results of training and test experiments.