A Novel Representation of DNA Sequence Based on CMI Coding

Wenbing Hou,Qiuhui Pan,Mingfeng He
DOI: https://doi.org/10.1016/j.physa.2014.04.030
IF: 3.778
2014-01-01
Physica A Statistical Mechanics and its Applications
Abstract:Graphical representation of DNA sequences provides a simple and intuitive way of analyzing and sorting various gene sequences. It is attractive to researchers to propose much more appropriate methods. In this study, a new graphical representation is presented. The method adopts the CMI coding to represent four nucleotides-A, G, C and T. Our approach considers not only the sequences’ structure but also the chemical structure for DNA sequence. We take several sets of data to test our method. The results of our experiment demonstrate that our representation is effective.
What problem does this paper attempt to address?