Statistical distributions and entropy considerations in gene codes

Krystyna Lukierska-Walasek,Krzysztof Topolski,Krzysztof Trojanowski
DOI: https://doi.org/10.48550/arXiv.1407.2269
2014-07-05
Abstract:In our paper selected linguistic features of genomes to study the statistics of the gene codes are considered. We present the information theory from which it follows that if the system is described by distributions of hyperbolic type it leads to the possibility of entropy loss and stability. We show that the histograms of gene lengths are similar to that of language words. We show the correspondence between presented theory and results for the number of replicated genes and replicated fragments of genes in genomes for Borelia burgdorferi, Escherichia coli and Saccharomyces cerevisiae S288c.
Genomics,Biological Physics
What problem does this paper attempt to address?