Efficient Algorithm for Mining of Frequent Itemsets over Uncertain Data Streams
Liu Yinlei,Liu Yubao,Chen Cheng
2011-01-01
Journal of Computer Research and Development
Abstract:Uncertain data streams exist in lots of real applications,such as,sensor network data and tracks of mobile object etc.In existing frequent itemsets mining algorithms over uncertain data streams,the generalized data structure used to store uncertain data streams is often complex and difficult to maintain,and the mining algorithm cost is also expensive.In this paper,we present a new generalized data structure,that is,SRUF-tree,to store uncertain data streams.In detail,this data structure includes three key components,namely,global SRtree,temporary table,and windows queue.All itemsets in recent window are contained in global SRtree in a condensed format.Each batch of itemset information is stored in temporary table.Based on SRUF-tree,we also present a new mining algorithm for frequent itemsets,that is,SRUF-mine algorithm.This algorithm searches in depth the global tree to find frequent itemsets.The maintaining of SRUF-tree is simple and just handles the temporary table corresponds to the oldest batch of itemsets in windows queue.The compared experiment results with existing UF-streaming algorithm show the efficiency of our presented algorithm SRUF-mine.