A Study on Parallel Algorithm of the Gene Expression Data Clustering Analysis

MAO Shao-yang,LI Ken-li
DOI: https://doi.org/10.3969/j.issn.1000-7180.2007.09.038
2007-01-01
Abstract:Put forward a clustering parallel algorithms based on the density. Use MPI under the APRAM model, passing twice computing with time complexity is O(n2) that of the Euclidean distance matrix and the density function, can make the time complexity of clustering procedure be O(n), reduce the time complexity of clustering through adding once computing. The experiment based on eight nodes indicates that this algorithm can attain higher parallel accelerate ratio than the same kind algorithm, raise the clustering rate of the high dimension living data.
What problem does this paper attempt to address?