Implementation of Artificial Neural Networks in MapReduce Optimization

Changlong Li,Xuehai Zhou,Kun Lu,Chao Wang,Dong Dai
2014-01-01
Abstract:The ability to handle large datasets has become a critical consideration for the success and ability of industrial organizations such as Microsoft, Amazon, Yahoo! and Facebook. As an important cloud computing framework for data processing, MapReduce is widely used by these organizations. However, its performance has been seriously limited by its stiff configuration strategy. In practice, even for a single simple job in a MapReduce framework, a large number of tuning parameters have to be set by end users, who often run into performance problems since they do not know how to configure them. Besides, once set, most parameters will never be changed again. This may easily lead to performance loss due to some misconfigurations. In this paper, we present a soft computing technique: Artificial Neural Network(ANN) to achieve the automatic configuration of parameters for MapReduce. Given a cluster and MapReduce job, frameworks can adapt the hardware and software configurations to the system dynamically and drive the system to an optimal configuration in acceptable time with the help of ANN. Experimental results show that ANN has a great contribution to optimize system performance and let the system at the speedup of 9x. Keywords-Neural Network; MapReduce; Automatic Configuration; Optimization; Big Data;
What problem does this paper attempt to address?