Shape Modeling Method Based on Deep Learning
Juan ZHANG,Xi-Li WANG,Jian-Gong YANG
DOI: https://doi.org/10.11897/SP.J.1016.2018.00132
2018-01-01
Chinese Journal of Computers
Abstract:Models of the object shape play an important role in many image applications such as object detection,image segmentation and inpainting.Compared with the gray,texture,edge and other low-level visual features of the image,shape as high-level visual feature helps to describe the global visual information of an object better.Generally,the performance of algorithm will be improved after global shape information (expressed by a good model) joined.In recent years,deep learning models have attracted more and more attentions because of their excellent learning ability on modeling data with intrinsic structure.Deep learning models contain multiple layers and hidden perception units,which are used to simulate the cognitive mechanism of human brain.They can extract multi-level data features and represent complicated data distributions.They form high-level data abstract representation through combining the lower layer features.A deep learning model learning the representation and abstraction of multiple levels is more helpful to make sense of data,such as shape images.In view of the complicated and changeable object shape,we propose two kinds of shape models based on deep belief network and deep Boltzmann machine,and the methods of two models constructing,training and shape generating in this paper.Both of these two models are deep,hierarchical probabilistic models that make heavy use of hidden units to modal high-order dependencies between the observed units,which can make full use of low-level features and multi-layer high-level features of object shape to describe the probability distribution of training set.They have the potential of learning internal representation that become progressively complex at higher layers.The key feature of these two models is greedy layer-by-layer training that can be repeated several times in order to find a good set of modal parameters quickly,even for modal contains many layers and parameters.Thus they can effectively model the shape,this kind of model not only can express the shape of the training set,but also can generate shape that differ from samples in the training set.The shape change is relatively large when the shape of the training set is multiple categories,the modal can also define the multi-model distribution of the shape set.A deep learning model has strong ability of shape expression and can be applied in many kinds of tasks such as shape generating,shape completion,and denoising.We give some of the experimental results on the Weizmann Horse data set and Caltech101 Silhouettes data set.The results show that compared with the restricted Boltzmann machine model,the proposed two models based on deep learning can express the shape of training set better.The samples generated from the deep models involving structure of multi-layer nonlinear processing are more realistic,and the samples from the deep Boltzmann machine model are much more clear than deep belief network model.Experimental results in the graphical shape and similarity measure illustrate that deep learning model is fit for the training shape,and the model has better generalization ability.