CloudNMF: a MapReduce Implementation of Nonnegative Matrix Factorization for Large-Scale Biological Datasets.

Ruiqi Liao,Yifan Zhang,Jihong Guan,Shuigeng Zhou
DOI: https://doi.org/10.1016/j.gpb.2013.06.001
2014-01-01
Abstract:In the past decades, advances in high-throughput technologies have led to the generation of huge amounts of biological data that require analysis and interpretation. Recently, nonnegative matrix factorization (NMF) has been introduced as an efficient way to reduce the complexity of data as well as to interpret them, and has been applied to various fields of biological research. In this paper, we present CloudNMF, a distributed open-source implementation of NMF on a MapReduce framework. Experimental evaluation demonstrated that CloudNMF is scalable and can be used to deal with huge amounts of data, which may enable various kinds of a high-throughput biological data analysis in the cloud. CloudNMF is freely accessible at http://admis.fudan.edu.cn/projects/CloudNMF.html.
What problem does this paper attempt to address?