A Kind of Improved Data Clustering Algorithm in Web Log Mining

Jin Guo,Shengbing Zhang,Zheng Qiu
DOI: https://doi.org/10.2991/isrme-15.2015.438
2015-01-01
Abstract:Aiming at the user clustering and page clustering in Web log mining and based on the analysis of K-means clustering algorithm and matrix clustering algorithm, the paper presented an improved clustering algorithm that combining fuzzy matrix algorithm with K-means algorithm. Extract compressed sub-matrix from relational matrix of user and page, establishing user interval, and then divide all users into large intervals and separate the noise data, obtain initial value and classified number for K-means algorithm, effectively solve the defect in the K-means algorithm that always suppose or make a try to definite the classified number and the initial value, also include the lacking to exclude the noise data obstruction.
What problem does this paper attempt to address?