Constrained Sequence Alignment: A Dedicated Version and Its Applications

yunsheng chung,chuan yi tang
2013-01-01
Abstract:In this paper, we study a problem that arises naturally in biological applications. Given two sequences, along with a sequence of patterns, we want to align the two sequences such that the specified patterns are aligned together. This is the constrained sequence alignment problem and is defined in [14]. The multiple sequence version is called CMSA. In this paper, we focus on the pairwise version where one of the sequences is annotated with the correct occurrence positions of the patterns. This version solves many applications that one would use constrained alignment to solve, and the algorithm proposed here is more efficient than that for the original version. In addition, we propose an improved approximation algorithm for CMSA. Our algorithm improves that proposed in [3] from O(Ckmn) time and O(kmn) space to O(Ckn) time and O(kn) space, where k is the number of sequences and C is the maximum number of valid “constrained lists” (defined later).
What problem does this paper attempt to address?