The Parallel Implementation of Deep Neural Decision Tree Based on Spark

Zhi-chang Huang,Jun-yi Wang,Lin Zheng,Jie-lin Fu
DOI: https://doi.org/10.1142/9789813146426_0107
2017-01-01
Abstract:We present a deep neural decision tree based on Spark - a novel approach that unifies classification tree with the representation learning functionality known from deep belief network and runs on the Spark to dramatically decrease the training time of the model. Firstly, we introduce a stochastic, differentiable, and therefore back-propagation compatible version of decision tree, guiding the representation learning in lower layers of deep belief networks. Furthermore, we make an parallel implementation for the decision tree back -propagation method based on Spark. We show experimental results on the Sloan Digital Sky Survey Data Release 7(SDSS-DR7) star/galaxy set and find out it scales well with the cluster size and have on-par or superior results in the classification accuracy when compared to state-of-the-art model which are used on the data set.
What problem does this paper attempt to address?