| Literature DB >> 34373928 |
Hana Rozhoňová1,2, Joshua L Payne1,2.
Abstract
Selection for resource conservation can shape the coding sequences of organisms living in nutrient-limited environments. Recently, it was proposed that selection for resource conservation, specifically for nitrogen and carbon content, has also shaped the structure of the standard genetic code, such that the missense mutations the code allows tend to cause small increases in the number of nitrogen and carbon atoms in amino acids. Moreover, it was proposed that this optimization is not confounded by known optimizations of the standard genetic code, such as for polar requirement or hydropathy. We challenge these claims. We show the proposed optimization for nitrogen conservation is highly sensitive to choice of null model and the proposed optimization for carbon conservation is confounded by the known conservative nature of the standard genetic code with respect to the molecular volume of amino acids. There is therefore little evidence the standard genetic code is optimized for resource conservation. We discuss our findings in the context of null models of the standard genetic code.Entities:
Keywords: evolutionary design principles; standard genetic code; translation
Mesh:
Substances:
Year: 2021 PMID: 34373928 PMCID: PMC8557452 DOI: 10.1093/molbev/msab236
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Two Key Properties of the Randomized Genetic Codes Generated with the Ten Different Methods Used in This Study (supplementary methods, Supplementary Material online).
| Method | Preserves the Number of Codons per Amino Acid | Preserves the Exact Block Structure of the SGC |
|---|---|---|
| Quartet shuffling | Yes | No |
| Amino acid permutation | No | Yes |
| Restricted amino acid permutation | No | Yes |
| N-Block shuffler | Yes | No |
| Codon shuffler | Yes | No |
| AAAGALOC shuffler | No | No |
| Random expansion | No | Yes |
| Ambiguity reduction 1 | No | Yes |
| Ambiguity reduction 2 | No | Yes |
| 2–1–3 model | No | Yes |
The randomized codes have a block structure, but it is different from that of the SGC.
The number of codons per amino acid is allowed to change by at most two, relative to the SGC.
Fig. 1.Nitrogen conservation is highly sensitive to choice of null model. (A) Histograms of the ERMC for nitrogen (blue) and carbon (black) in 1 million randomized codes generated by amino acid permutation. The vertical red line corresponds to the SGC. Codon frequencies and mutation rates are from the “ocean parameters.” (B) P values of the ERMC for nitrogen (top) and carbon (bottom) of the SGC, relative to 1 million randomized codes generated by amino acid permutation, using the “diverse species parameters.” Shades of gray correspond to statistically insignificant P values (P > 0.05; darker=less significant) and shades of red to statistically significant P values (; darker=more significant). The P values were adjusted using Benjamini–Hochberg correction for multiple testing. Organisms in each group are ordered based on the GC content of their coding sequences. Unicell. euk., unicellular eukaryotes.
Fig. 2.The ERMC for nitrogen is correlated with the number of codons for nitrogen-rich amino acids. The black line shows the mean, and the shaded area shows the 25th to the 75th quantile, of the ERMC for nitrogen in relation to the number of codons for nitrogen-rich amino acids in 1 million randomized codes generated by amino acid permutation. The point and dotted lines correspond to the SGC. Histograms of the number of codons for nitrogen-rich amino acids and the ERMC for nitrogen are shown on the top and on the right of the main panel, respectively. The ERMC for nitrogen was computed using the “ocean parameters.”
Fig. 3.Carbon conservation is confounded by the molecular volume of amino acids. (A) Scatter plot of the number of carbon atoms and the molecular volume of the 20 proteinogenic amino acids. (B) Scatter plot of the absolute change in the number of carbon atoms and the absolute change in molecular volume for the 75 amino acid pairs that are connected by a missense mutation in the SGC. Jitter applied in the x axis for visualization. (C) Histograms of the ERMC for (top) the molecular volume of amino acids in 1 million randomized codes generated by amino acid permutation and (bottom) for carbon in the subset of 14,400 randomized codes that have an ERMC for molecular volume that is less than or equal to that of the SGC. The ERMC was computed using the “ocean parameters.”