Estimation and maintenance of frequent pattern on data streams

Guo Jie Song,Shiwei Tang,Dongqing Yang,Tengjiao Wang
2004-01-01
Ruan Jian Xue Bao/Journal of Software
Abstract:In this paper, the methods are investigate for online, frequent pattern mining of stream data, with the following contributions: (1) based on heuristic methodology and sample theory, step-by-step data stream mining method is used to estimate potential pattern set; (2) will find any length pattern not only single item pattern; (3) to find more appropriate length of each segment satisfying accuracy requirement, Hoeffding bound theory was introduced and revised to make it more suit for pattern mining; (4) a maintenance approach for estimating frequent patterns is developed for on-line analysis. Based on this design, estimation and maintenance algorithms are proposed for efficient analysis of data streams. This performance study compares the proposed algorithms and identifies the most accuracy-, memory- and time- efficient algorithms for stream data analysis.
What problem does this paper attempt to address?