Pattern Matching with Wildcards Based on Suffix Tree

HOU Bao-jian,XIE Fei,HU Xue-gang,LIU Ying-ling,WANG Hai-ping
DOI: https://doi.org/10.3969/j.issn.1002-137x.2012.12.040
2012-01-01
Computer Science
Abstract:Pattern matching with wildcards is a hot research problem that can be used in biological sequence analysis,text indexing,network intrusion detection,and so on.Aiming at the problem that the wildcards have strong limitations in the existing research work,pattern matching with flexible wildcards was studied.The wildcards can appear between any two substrings and can be specified with flexible length constraints.The nonlinear data structure—suffix tree was used to design a completeness algorithm PAST.In the prepare process,an online incremental algorithm was used to build the suffix tree which has priori knowledge of the text.In the search phase,the idea of dynamic programming was used to match the characters of the pattern.Experiments on DNA sequences show that our method has better perfor-mances in time than the related matching algorithm.
What problem does this paper attempt to address?