An Algorithm for DNA Multiple Sequence Alignment Based on Center Star Method and Keyword Tree

邹权,郭茂祖,王晓凯,张涛涛
DOI: https://doi.org/10.3321/j.issn:0372-2112.2009.08.021
2009-01-01
Abstract:Multiple sequence alignment is necessary and important for reconstructing evolutionary trees and comparing haplotype sequences.Center star method is always used to deal with lots of long sequences.However,square time complexity is a bottleneck for large data.In this paper,we propose a novel keyword tree based algorithm for improving the center star method.Aho-Corasick algorithm is employed to match a set of substrings and the rest regions are aligned by dynamic programming.Experiments show that the improved method runs faster than the initial center star method and clustalx.
What problem does this paper attempt to address?