Comparative statistical analysis of bacteria genomes in "word" context

Olga V. Kirillova
DOI: https://doi.org/10.1016/S0378-4371%2800%2900549-5
2000-10-17
Abstract:Statistical analysis of bacteria genomes texts has been performed on the basis of 20 complete genomes origin from Genebank. It has been revealed that the word ranked distributions are quite well approximated by logarithmic law. Results obtained in the absent words investigation show the considerably nonrandom character of DNA texts. In character of autocorrelation function behavior in several genomes period 3 oscillations were found. Short range autocorrelations are present in short ($n=3$) words and practically absent in longer words.
Condensed Matter,Quantitative Biology
What problem does this paper attempt to address?