Effective algorithm for maximal pattern-based subspace clustering

Rong Hu,Yansheng Lu,Lei Zou,Chong Zhou
2007-01-01
Journal of Computational Information Systems
Abstract:Most traditional clustering algorithms consider the close values of objects in all the dimensions or a set of dimensions. In this paper we focus on finding an interesting pattern, where objects exhibit a coherent pattern of rise and fall in subspaces (a set of dimensions). We address the issues of scalability, usability and non-presumption of data distribution by proposing a novel approach, named EMaPle to mine the maximal pattern-based subspace clusters. Unlike conventional pattern-based subspace mining algorithms, EMaPle searches clusters only in the column enumeration space which is relatively few compared to the large number of row combinations in the typical datasets, and exploits novel pruning techniques. Both synthetic data sets and real data sets are used to evaluate EMaPle and demonstrate that it is more efficient and scalable than previous approach like MaPle.
What problem does this paper attempt to address?