An entropy-based technique for classifying bacterial chromosomes according to synonymous codon usage

Andrew Hart,Servet Martínez
DOI: https://doi.org/10.1007/s00285-016-1067-4
2016-10-12
Journal of Mathematical Biology
Abstract:We present a framework based on information theoretic concepts and the Dirichlet distribution for classifying chromosomes based on the degree to which they use synonymous codons uniformly or preferentially, that is, whether or not codons that code for an amino acid appear with the same relative frequency. At its core is a measure of codon usage bias we call the Kullback–Leibler codon information bias (KL-CIB or CIB for short). Being defined in terms of conditional entropy makes KL-CIB an ideal and natural quantity for expressing a chromosome’s degree of departure from uniform synonymous codon usage. Applying the approach to a large collection of annotated bacterial chromosomes reveals three distinct groups of bacteria.
mathematical & computational biology,biology
What problem does this paper attempt to address?