Clustering Method for High Dimensional Data on MapReduce

Liao Songbo,He Zhenying
2011-01-01
Journal of Computer Research and Development
Abstract:As various audio data spring up,how to identify and analyze the high dimensional audio data draws the attention of researchers.In the process of music recognition,it is need to cluster music frames.However,the magnanimity of data and the complexity of the high dimensional frames make the task of analyzing high dimensional data resort to MapReduce for parallel computing on the distributed system.We propose a clustering system for high dimensional music data on Hadoop-MapReduce—HDCH.Experiment proves that the system has high availability and extendibility.Besides,HDCH can be used to process other clustering application on high dimensional data.
What problem does this paper attempt to address?