SUA-Based Algorithm for Finding SATRs in DNA Sequence

WANG Di,ZHAO Yi,CHEN Bai-chen,WANG Guo-ren
DOI: https://doi.org/10.3321/j.issn:1005-3026.2007.02.009
2007-01-01
Abstract:Studies finding approximate repetitions in DNA sequence,which is an important problem in gene analysis.Analyzing the approximate repetitions and similarity measurements and based on Hamming Distance,two definitions of pattern-similarity and segment-similarity are proposed as new measurements of similarity,then on the basis of the two definitions,a new concept of approximate repetition,i.e., the segment-similarity based approximate tandem repeats(SATR) is given.In addition,the succeeding unit array(SUA) as a lightweight index is introduced in finding SATRs in DNA sequence with an algorithm designed to find SATRs based on the index.Theoretical analysis and experiment results both show that the SATR finding algorithm based on SUA is superior to other methods in finding results and time saving.
What problem does this paper attempt to address?