[Alignment-Free Biomolecular Sequence Comparison Method].

Weijuan Fu,Yuanyuan Wang,Daru Lu
DOI: https://doi.org/10.3321/j.issn:1001-5515.2005.03.039
2005-01-01
Abstract:Biosequence analysis is the primary research field of bioinformatics. In this field, useful information can be extracted by comparison analysis methods. Among them, sequence alignment is the most common comparison method. However the sequence comparison by alignment, which assumes conservation of contiguity between homologous segments, is at odds with genetic recombination. Especially for the multisequence alignment, there exists the difficulty in the complexity of calculation. Therefore, alignment-free sequence comparison methods are required. In this paper, two main categories of alignment-free sequence comparison methods are reviewed. The first one is based on the word (oligomer) frequency and its distribution. The sequences are compared using the distances defined in a Cartesian space by the frequency vectors. In the second category, sequences are compared using Kolmogorov complexity and chaos theory.
What problem does this paper attempt to address?