Mining Sequential Patterns with Periodic General Gap Constraints

You-Xi WU,Kun ZHOU,Jing-Yu LIU,He JIANG,Xin-Dong WU
DOI: https://doi.org/10.11897/SP.J.1016.2017.01338
2017-01-01
Chinese Journal of Computers
Abstract:Sequential pattern mining is to discover the frequent patterns in the sequences and plays an essential role in many critical data mining tasks with broad applications.Given sub-patterns pi and pj(i<j) can match events A and B respectively,traditional pattern mining methods can detect the sequences in which event B is after event A,but fail to find the sequences with event B occurring before event A.To tackle this challenge,in this paper,we propose sequential pattern mining with periodic general gap constraints with five characteristics as follows.The minimal gap constraint,namely general gap constraint,can be a negative value.All gap constraints of the pattern are the same.Any event in the sequence can be used more than once in different supports.The problem satisfies the Apriori property under the new definition of offset sequences.If the support ratio of a pattern is greater than the given threshold,the pattern is a frequent pattern.To solve the problem effectively,a depth-first search strategy is used to create the pattern tree.An Incomplete Nettree structure,the last layer of a Nettree which can be stored in an array,can represent the support of a pattern in the sequence.Pattern matching method is used to create the Incomplete Nettrees for all super patterns of a frequent pattern by one way scan over the sequence database.Therefore,we can calculate the support ratios of these super patterns and find the frequent ones.Experimental results validate the feasibility and effectiveness of the proposed algorithms.
What problem does this paper attempt to address?