Three distances for rapid similarity analysis of DNA sequences

Wei Chen,Yusen Zhang
2009-01-01
Abstract:Three distances for assessing genomic similarity based on dinucleotide frequency in large DNA sequences is introduced. The method requires neither homologous sequences nor prior sequence alignments. The analysis centers on symmetrized dinucleotide frequency reflecting DNA structures related to dinucleotide stacking energies, constraints of DNA curvature. To show the utility of the method, we use these distances to examine the similarities among the first exon-1 of the beta-globin gene for 11 different species.
What problem does this paper attempt to address?