Session Identification Based on Time Interval in Web Log Mining

L Zhuang,ZB Kou,CS Zhang
DOI: https://doi.org/10.1007/0-387-23152-8_50
2006-01-01
Abstract:In this paper, we calculate the time intervals of page views, and analyze the time intervals to obtain a certain threshold, which is then used to break the web logs into sessions. Based on the time intervals, frequencies for each interval are counted and frequency vectors are obtained for each IP. Some IPs with special features of frequency distributions can be deemed as single users. For these IPs, we can define threshold for each individual IP, and separate sessions at the points of long access time intervals.
What problem does this paper attempt to address?