MapReduce Based Parallel Neural Networks in Enabling Large Scale Machine Learning

Yang Liu,Jie Yang,Yuan Huang,Lixiong Xu,Siguang Li,Man Qi
DOI: https://doi.org/10.1155/2015/297672
IF: 3.12
2015-01-01
Computational Intelligence and Neuroscience
Abstract:Artificial neural networks (ANNs) have been widely used in pattern recognition and classification applications. However, ANNs are notably slow in computation especially when the size of data is large. Nowadays, big data has received a momentum from both industry and academia. To fulfill the potentials of ANNs for big data applications, the computation process must be speeded up. For this purpose, this paper parallelizes neural networks based on MapReduce, which has become a major computing model to facilitate data intensive applications. Three data intensive scenarios are considered in the parallelization process in terms of the volume of classification data, the size of the training data, and the number of neurons in the neural network. The performance of the parallelized neural networks is evaluated in an experimental MapReduce computer cluster from the aspects of accuracy in classification and efficiency in computation.
What problem does this paper attempt to address?