An Effective Algorithm for Mining Compressed Sequential Patterns

CHANG Lei,YANG Dongqing,WANG Tengjiao,TANG Shiwei
DOI: https://doi.org/10.3778/j.issn.1673-9418.2008.01.005
2008-01-01
Abstract:The problem of how to compress sequential patterns using SP-Features(Sequential Pattern Features) is examined.SP-Feature is a novel structure for representing a set of sequential patterns succinctly.A new similarity measure is proposed for clustering SP-Features and a SP-Feature combination method is designed.Based on the hierarchical clustering framework,an effective algorithm CSP is developed to mine compressed sequential patterns.Extensive experimental results on both real and synthetic datasets show that CSP can compress sequential patterns efficiently and effectively with low restoration el Tor(less than 4%on dense datasets).
What problem does this paper attempt to address?