Predicting Gene Expression Level from Codon Usage Bias

Ian Henry,Paul M. Sharp
DOI: https://doi.org/10.1093/molbev/msl148
IF: 10.7
2006-10-12
Molecular Biology and Evolution
Abstract:The "expression measure" of a gene, E(g), is a statistic devised to predict the level of gene expression from codon usage bias. E(g) has been used extensively to analyze prokaryotic genome sequences. We discuss 2 problems with this approach. First, the formulation of E(g) is such that genes with the strongest selected codon usage bias are not likely to have the highest predicted expression levels; indeed the correlation between E(g) and expression level is weak among moderate to highly expressed genes. Second, in some species, highly expressed genes do not have unusual codon usage, and so codon usage cannot be used to predict expression levels. We outline a simple approach, first to check whether a genome shows evidence of selected codon usage bias and then to assess the strength of bias in genes as a guide to their likely expression level; we illustrate this with an analysis of Shewanella oneidensis.
genetics & heredity,biochemistry & molecular biology,evolutionary biology
What problem does this paper attempt to address?