Abstract:We used the Moran's I index of global spatial autocorrelation with the aim of studying the distribution of the physicochemical or biological properties of amino acids within the genetic code table. First, using this index we are able to identify the amino acid property - among the 530 analyzed - that best correlates with the organization of the genetic code in the set of amino acid permutation codes. Considering, then, a model suggested by the coevolution theory of the genetic code origin - which in addition to the biosynthetic relationships between amino acids took into account also their physicochemical properties - we investigated the level of optimization achieved by these properties either on the entire genetic code table, or only on its columns or only on its rows. Specifically, we estimated the optimization achieved in the restricted set of amino acid permutation codes subject to the constraints derived from the biosynthetic classes of amino acids, in which we identify the most optimized amino acid property among all those present in the database. Unlike what has been claimed in the literature, it would appear that it was not the polarity of amino acids that structured the genetic code, but that it could have been their partition energy instead. In actual fact, it would seem to reach an optimization level of about 96% on the whole table of the genetic code and 98% on its columns. Given that this result has been obtained for amino acid permutation codes subject to biosynthetic constraints, that is to say, for a model of the genetic code consistent with the coevolution theory, we should consider the following conclusions reasonable. (i) The coevolution theory might be corroborated by these observations because the model used referred to the biosynthetic relationships between amino acids, which are suggested by this theory as having been fundamental in structuring the genetic code. (ii) The very high optimization on the columns of the genetic code would not only be compatible but would further corroborate the coevolution theory because this suggests that, as the genetic code was structured along its rows by the biosynthetic relationships of amino acids, on its columns strong selective pressure might have been put in place to minimize, for example, the deleterious effects of translation errors. (iii) The finding that partition energy could be the most optimized property of amino acids in the genetic code would in turn be consistent with one of the main predictions of the coevolution theory. Since the partition energy is reflective of the protein structure and therefore of the enzymatic catalysis, the latter might really have been the main selective pressure that would have promoted the origin of the genetic code. Indeed, we observe that the β-strands show an optimization percentage of 95.45%; so it is possible to hypothesize that they might have become the object of selection during the origin of the genetic code, conditioning the choice of biosynthetic relationships between amino acids. (iv) The finding that the polarity of amino acids is less optimized than their partition energy in the genetic code table might be interpreted against the physicochemical theories of the origin of the genetic code because these would suggest, for example, that a very high optimization of the polarity of amino acids in the code could be an expression of interactions between amino acids and codons or anticodons, which would have promoted its origin. This might now become less sustainable, given the very high optimization that is instead observed in favor of the partition energy but not polarity. Finally, (v) the very high optimization of the partition energy of amino acids would seem to make a neutral origin of error minimization, i.e. of the ability of the genetic code to buffer, for example, the deleterious effects of translation errors, very unlikely. Indeed, an optimization of about 100% would seem that it might not have been achieved by a simple neutral process, but this ability should probably have been generated instead by the intervention of natural selection. In actual fact, we show that the neutral theory of the origin of error minimization has been falsified for the model analyzed here. Therefore, we will discuss our observations within the theories proposed to explain the origin of the organization of the genetic code, reaching the conclusion that the coevolution theory is the most strongly corroborated theory.

Codon Usage Decreases the Error Minimization Within the Genetic Code.

Codon Usage Bias: An Endless Tale

A code within the genetic code: codon usage regulates co-translational protein folding

A Realistic Model Under Which the Genetic Code is Optimal

Optimality of the genetic code with respect to protein stability and amino acid frequencies

Synonymous but not the same: the causes and consequences of codon bias

Synonymous but Not Silent: The Codon Usage Code for Gene Expression and Protein Folding

Codon usage is less optimized in eukaryotic gene segments encoding intrinsically disordered regions than in those encoding structural domains

Synonymous Codons: Choose Wisely for Expression

Codon Usage Optimization in the Prokaryotic Tree of Life: How Synonymous Codons Are Differentially Selected in Sequence Domains with Different Expression Levels and Degrees of Conservation

Codon usage bias from tRNA's point of view: Redundancy, specialization, and efficient decoding for translation optimization

The genetic code is very close to a global optimum in a model of its origin taking into account both the partition energy of amino acids and their biosynthetic relationships

Optimization of the standard genetic code according to three codon positions using an evolutionary algorithm

Codon Usage Influences the Local Rate of Translation Elongation to Regulate Co-translational Protein Folding

Codon Codes: Codon Usage Bias Influences Many Levels of Gene Expression

Usage Patterns of Codons Versus Complementary Codons among Cellular Organisms and Organelles

Stability of the genetic code and optimal parameters of amino acids

Nonoptimal Codon Usage Is Critical for Protein Structure and Function of the Master General Amino Acid Control Regulator CPC-1

Codon catalog usage and the genome hypothesis.

Codon Optimality Controls Differential Mrna Translation During Amino Acid Starvation.

Genome-wide impact of codon usage bias on translation optimization in Drosophila melanogaster