Abstract:Abstract Frequencies of synonymous codons are typically non-uniform, despite the fact that such codons correspond to the same amino acid in the genetic code. This phenomenon, known as codon bias, is broadly believed to be due to a combination of factors including genetic drift, mutational biases, and selection for speed and accuracy of codon translation; however, quantitative modeling of codon bias has been elusive. We have developed a biophysical population genetics model which explains genome-wide codon frequencies observed across 20 organisms. We assume that codons evolve independently of each other under the influence of mutation and selection forces, and that the codon population has reached evolutionary steady state. Our model implements codon-level treatment of mutations with transition/transversion biases, and includes two contributions to codon fitness which describe codon translation speed and accuracy. Furthermore, our model includes wobble pairing – the possibility of codon-anticodon base pairing mismatches at the 3’ nucleotide position of the codon. We find that the observed patterns of genome-wide codon usage are consistent with a strong selective penalty for mistranslated amino acids. Thus codons undergo purifying selection and their relative frequencies are affected in part by mutational robustness. We find that the dependence of codon fitness on translation speed is weaker on average compared to the strength of selection against mistranslation. Although no constraints on codon-anticodon pairing are imposed a priori , a reasonable hierarchy of pairing rates, which conforms to the wobble hypothesis and is consistent with available structural evidence, emerges spontaneously as a model prediction. Finally, treating the translation process explicitly in the context of a finite ribosomal pool has allowed us to estimate mutation rates per nucleotide directly from the coding sequences. Reminiscent of Drake’s observation that mutation rates are inversely correlated with the genome size, we predict that mutation rates are inversely proportional to the number of genes. Overall, our approach offers a unified biophysical and population genetics framework for studying codon bias across all domains of life.

A simple model describing evolution of genomic GC content with random perturbations in asexually reproducing organisms

A simple stochastic model to describe the evolution over time of core genome SNP GC content in prokaryotes

Structural Parameters for Globular Clusters in NGC 5128

A mutation-selection model for general genotypes with recombination

Entangled Evolution Model: Application to Adaptive Mutation

An integrated model of the effects on fitness of beneficial, deleterious, and neutral mutations in sexual populations

Criticality in the Luria-Delbrück model with an arbitrary mutation rate

Models of Genome Evolution

Genomic mutation rates that neutralize adaptive evolution and natural selection

Mutation and evolution: Conceptual possibilities

A Model on Genome Evolution

Evolution enhances mutational robustness and suppresses the emergence of a new phenotype: A new computational approach for studying evolution

Gene-Mating Dynamic Evolution Theory I: Fundamental assumptions, exactly solvable models and analytic solutions

Genome Evolution by Substitutions, Duplications and Deletions

Minimal model for genome evolution and growth

Selection, Mutations and Codon Usage in Bacterial Model

Inferring biophysical models of evolution from genome-wide patterns of codon usage

Unifying (simple) models of genetic robustness and evolvability

Modulation of Base Specific Mutation and Recombination Rates Enables Functional Adaptation within the Context of the Genetic Code

Environmental and genetic influence on rate and spectrum of spontaneous mutations in

Relaxing the Hypotheses of Symmetry and Time-Reversibility in Genome Evolutionary Models