Research of web sequential pattern mining based on improved AprioriAll algorithm

WU Hai-yan,ZHU Jing-jun,GAO Guo-zhu,CHENG Zhi-rui
2010-01-01
Abstract:To reduce the generation of candidate sequences and the scans to sequence database for AprioriAll algorithm,an efficient sequential pattern mining method based on improved AprioriAll algorithm is presented. Firstly,data are preprocessed. Then the sequentialpattern mining is finished by improved AprioriAll algorithm. The improvements of AprioriAll algorithm are mainly two points:one is to change the connection of candidate sequences to reduce the generation of candidate sequences; the other is to reduce the needless data-base scans to improve the efficiency of algorithm. Finally,the efficiency and validity of improved AprioriAll algorithm is validated byexperiments.
What problem does this paper attempt to address?