One novel representation of DNA sequence based on the global and local position information

Zhiyi Mo,Wen Zhu,Yi Sun,Qilin Xiang,Ming Zheng,Min Chen,Zejun Li
DOI: https://doi.org/10.1038/s41598-018-26005-3
2018-05-15
Abstract:One novel representation of DNA sequence combining the global and local position information of the original sequence has been proposed to distinguish the different species. First, for the sufficient exploitation of global information, one graphical representation of DNA sequence has been formulated according to the curve of Fermat spiral. Then, for the consideration of local characteristics of DNA sequence, attaching each point in the curve of Fermat spiral with the related mass has been applied based on the relationships of neighboring four nucleotides. In this paper, the normalized moments of inertia of the curve of Fermat spiral which composed by the points with mass has been calculated as the numerical description of the corresponding DNA sequence on the first exons of beta-global genes. Choosing the Euclidean distance as the measurement of the numerical descriptions, the similarity between species has shown the performance of proposed method.
What problem does this paper attempt to address?