| Literature DB >> 34383940 |
Abstract
The causes and consequences of the nonrandom structure of the standard genetic code (SGC) have been of long-standing interest. A recent study reported that mutations in present-day protein-coding sequences are less likely to increase proteomic nitrogen and carbon uses under the SGC than under random genetic codes, concluding that the SGC has been selectively optimized for resource conservation. If true, this finding might offer important information on the environment in which the SGC and some of the earliest life forms evolved. However, we here show that the hypothesis of optimization of a genetic code for resource conservation is theoretically untenable. We discover that the aforementioned study estimated the expected mutational effect by inappropriately excluding mutations lowering resource consumptions and including mutations involving stop codons. After remedying these problems, we find no evidence that the SGC is optimized for nitrogen or carbon conservation.Entities:
Keywords: carbon; evolution; mutation; nitrogen; second-order selection
Mesh:
Substances:
Year: 2021 PMID: 34383940 PMCID: PMC8557414 DOI: 10.1093/molbev/msab239
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Fig. 1Schematics contrasting first- and second-order selections for resource conservation. A hypothetical organism living in a resource-limited environment has two codons; 80% of its codons are A and 20% are B. Under the wild-type (WT) genetic code, A encodes amino acid L that has a low cost of resource, whereas B encodes amino acid H that has a high cost of resource. Under the mutant code, A encodes H, whereas B encodes L. The code-table-altering mutation immediately increases the proteomic resource consumption but will lower the cost of future mutations, so the code-table-altering mutation is favored by the second-order selection but disfavored (to a greater extent) by the first-order selection for resource conservation. Codon-amino acid relationships are indicated by solid arrows, whereas mutations are indicated by broken arrows.
Fig. 2Testing the optimization of the SGC for resource conservation using RGCs generated by Shenhav and Zeevi’s method and nERMC. (A) Pearson’s correlation (Rcodon frequency–N/C content) between the genomic frequency of a codon and the number of nitrogen or carbon atoms in its encoded amino acid in each of 39 species examined. Each dot represents one species. A dot for nitrogen or carbon is marked in red if one or more of the nine examined κ (transition/transversion mutation rate ratio) values yield significant results in the corresponding species in (B) or (C); otherwise it is marked in blue. The box plot shows the distribution of the 39 data points, with the left and right edges of the box representing the first (qu1) and third (qu3) quartiles, respectively, the vertical line inside the box indicating the median (md), and the whiskers extending to the most extreme values inside inner fences, md±1.5(qu3−qu1). (B–D) Heat map of the significance level of the optimization of the SGC for conservation of nitrogen (B), carbon (C), or both carbon and nitrogen (D). Colors indicate the nominal P value, which is the fraction of RGCs whose nERMC is smaller than that of the SGC. (E and F) Relationship between Rcodon frequency–N/C content and the significance level of the optimization of the SGC for nitrogen (E) or carbon (F) conservation. The significance level of optimization is determined under κ = 3 because κ is around 3 in most species (Zou and Zhang 2021). Pearson’s correlation between Rcodon frequency–N/C content and the significance level of optimization is −0.75 (P < 0.0001) in (E) and −0.92 (P < 0.0001) in (F).