A Survey on MapReduce Optimization Technologies

HUANG Shan,WANG Botao,WANG Guoren,YU Ge,LI Jiajia
2013-01-01
Abstract:As a parallel programming model for big data processing,MapReduce is getting more and more attractions from academia and industry for its good scalability,availability and fault tolerance.There exist a lot of optimization technologies focusing on the application limitations of MapReduce.This paper firstly introduces the MapReduce framework,then compares the research work on MapReduce optimization technologies including column storage,index,join,iteration calculation,scientific calculation,and scheduling algorithms respectively.Finally,this paper analyzes the challenges and figures out the trends of this area.
What problem does this paper attempt to address?