PATH CLUSTERING: DISCOVERING THE KNOWLEDGE IN THE WEB SITE

Shi WANG,Wen GAO,Jin-tao LI,Hui XIE
2001-01-01
Journal of Computer Research and Development
Abstract:When users access a Web site, the access of the users represents the interest of users in the Web pages of the Web site. Each user's interest can be manifested by the sequence of each user access. After processing the Log in the Web site and identifying each user access transaction, the access paths of all the users can be clustered. This is called path clustering. Each cluster can then represent the similar access interest of the users in the cluster. Presented in this paper is a new clustering approach: K-paths to partition the users' access according to the interest of the users. In this approach, according to the requirement of the clustering, the new method is defined to measure similarity and to get the center of a cluster. The experiment shows that this approach is successful.
What problem does this paper attempt to address?