PR-FCM: A polynomial regression-based fuzzy C-means algorithm for attribute-associated data

Yong Pang,Maolin Shi,Liyong Zhang,Xueguan Song,Wei Sun
DOI: https://doi.org/10.1016/j.ins.2021.11.056
IF: 8.1
2022-03-01
Information Sciences
Abstract:Partitioning data into internally homogeneous parts is an important problem when mining in situ engineering data. In this paper, a polynomial regression-based fuzzy c-means (PR-FCM) clustering algorithm that utilizes the functional relationships among the attributes of the input dataset is proposed. In this algorithm, a polynomial regression equation is taken as the center of each cluster instead of the cluster prototype used in conventional FCM, and the difference between a sample and a cluster prototype is defined as the distance between the actual value of one attribute and the corresponding predicted value provided by its own polynomial regression equation. An alternating optimization method is designed to optimize the new clustering objective function of the proposed algorithm. A series of experiments on synthetic and real-world datasets are conducted to evaluate the performance of the PR-FCM algorithm, which exhibits higher effectiveness and possesses more advantages than the original FCM algorithm. The PR-FCM algorithm is applied to tunnel boring machine (TBM) operation data from a TBM project in China. The experimental results show that the proposed algorithm can effectively cluster TBM operation data.
computer science, information systems
What problem does this paper attempt to address?