Clickstream Clustering Based on Closed Frequent Gapped Subsequence

MA Chao,SHEN Wei
DOI: https://doi.org/10.3969/j.issn.1000-3428.2010.23.024
2010-01-01
Abstract:Clustering of clickstreams in Web-logs can find Web visitors' using patterns,and categorize these visitors.However,traditional clustering method faces challenge of extracting representative feature vector,sparse clickstreams and feature vector.To solve the problems,a closed repetitive gapped subsequence mining based clickstream clustering method is proposed.Extract repetitive support of subsequence from clickstream,and construct feature vector.A bidirectional projected Euclidean distance based on fuzzy dissimilarity is proposed and used as distance measure of feature vectors.Clustering quality of BIRCH algorithm on clickstream is enhanced.
What problem does this paper attempt to address?