A Novel Kernel Possibitistic Fuzzy C-Means Clustering Algorithm For Large Scale Data Sets

Yu Qu,Hongye Su,Ying Zhang,Jian Chu
2007-01-01
Abstract:Kernel Method(KM) is a algorithm that, by replacing the inner product with an appropriate positive definite function, implicitly perform a nonlinear mapping of the input data into a high-dimensional feature space. The incorporation of KM enables the Kernel Possibitistic Fuzzy c-Means (KPFCM) algorithm to explore the inherent data pattern in the new, space. However, the applications of KPFCM algorithm are confined to small scale data sets due to its expensive computation and storage cost. In this paper, KPFCM-L algorithm is presented to solve the large scale clustering problem. In KPFCM-L, kernel method is adopted to solve the nonlinear separable problem and get nonlinear boundaries. The proposed algorithm is applied to a customer segmentation application and the simulation results indicate the algorithm is very efficient for large scale data sets.
What problem does this paper attempt to address?