Application and Research of Data Partition in Large Scale Database During Clustering

ZHENG Hong-ying,NI Lin,XIAO Di
DOI: https://doi.org/10.3969/j.issn.1001-3695.2007.02.068
2007-01-01
Abstract:People raised many algorithms,but there are many disadvantages,for example,much computing especially in large scale database,demanding for large volume of memory support and so on.Furthermore clustering quality will be affected when the cluster density and the distance between clusters are not even.In order to improve the efficiency and quality,this paper adopt pretreatment technology named data partition before clustering.After that,the number of data points is less and the distribution of data points is even.
What problem does this paper attempt to address?