Genetic K-Modes Based Dna Splice Site Adjacent Sequences Feature Analysis

Quanwei Zhang,Qinke Peng,Hequan Sun,Kankan Li
DOI: https://doi.org/10.1109/WCICA.2008.4593242
2008-01-01
Abstract:DNA splice site adjacent sequences have remarkable conservative feature, and mining their underlying biological knowledge has become a key issue in the field of DNA sequences analysis. In this paper, we analyze the feature of human being's DNA splice site adjacent sequences. Firstly, we propose a kind of DNA splice site sequences clustering method based on Genetic K-odes, secondly, we analyze the frequency of various bases, di-bases and tri-bases about the experimental data set and each cluster, lastly, we propose one kind of Markov model based frequent patterns discovery algorithm and use it to mine the frequent patterns of the experimental data set and each cluster.
What problem does this paper attempt to address?