| Literature DB >> 25919033 |
Abstract
The standard genetic code (SGC) is central to molecular biology and its origin and evolution is a fundamental problem in evolutionary biology, the elucidation of which promises to reveal much about the origins of life. In addition, we propose that study of its origin can also reveal some fundamental and generalizable insights into mechanisms of molecular evolution, utilizing concepts from complexity theory. The first is that beneficial traits may arise by non-adaptive processes, via a process of "neutral emergence". The structure of the SGC is optimized for the property of error minimization, which reduces the deleterious impact of point mutations. Via simulation, it can be shown that genetic codes with error minimization superior to the SGC can emerge in a neutral fashion simply by a process of genetic code expansion via tRNA and aminoacyl-tRNA synthetase duplication, whereby similar amino acids are added to codons related to that of the parent amino acid. This process of neutral emergence has implications beyond that of the genetic code, as it suggests that not all beneficial traits have arisen by the direct action of natural selection; we term these "pseudaptations", and discuss a range of potential examples. Secondly, consideration of genetic code deviations (codon reassignments) reveals that these are mostly associated with a reduction in proteome size. This code malleability implies the existence of a proteomic constraint on the genetic code, proportional to the size of the proteome (P), and that its reduction in size leads to an "unfreezing" of the codon - amino acid mapping that defines the genetic code, consistent with Crick's Frozen Accident theory. The concept of a proteomic constraint may be extended to propose a general informational constraint on genetic fidelity, which may be used to explain variously, differences in mutation rates in genomes with differing proteome sizes, differences in DNA repair capacity and genome GC content between organisms, a selective pressure in the evolution of sexual reproduction, and differences in translational fidelity. Lastly, the utility of the concept of an informational constraint to other diverse fields of research is explored.Entities:
Year: 2015 PMID: 25919033 PMCID: PMC4500140 DOI: 10.3390/life5021301
Source DB: PubMed Journal: Life (Basel) ISSN: 2075-1729
Figure 1How error minimization may neutrally emerge in genetic codes.
Robustness related potential pseudaptations.
| Trait | Potential driving force | Indirect benefit (neutrally emergent*) |
|---|---|---|
| Increased proteome hydrophobicity in AT rich genomes | Hypothesized that AT bias may arise neutrally via changes in mutation bias [ | Increased hydrophobicity of proteome results in increased protein folding stability [ |
| Scale free structure of metabolic networks | There is evidence preferential attachment has given rise to the scale free property [ | Robustness to gene deletion [ |
| Scale free structure of protein interaction networks | There is evidence that preferential attachment has given rise to the scale free property [ | Robustness to gene deletion [ |
| Scale free structure of gene regulatory networks | There is evidence a combination of gene duplication and preferential attachment are responsible for the scale free property [ | Robustness to mutation [ |
| Survival of the flattest | Survival of the flattest refers to the increase in number of robust organisms in a population when mutation rates are high. This neutrally emerges in digital organisms [ | Increased robustness of the population to mutation |
| Mutational robustness of protein and RNA structures | Mutational robustness in RNA secondary structures [ | Increased structural robustness to mutation |
| Error minimization of the genetic code | There is evidence that error minimization neutrally emerged during genetic code expansion via gene duplication of adaptor molecules and charging enzymes [ | Error minimization reduces the deleterious impact of point mutations, transcriptional and translational errors |
| Genetic dominance | It has been proposed that genetic dominance is selected for to increase metabolic flux [ | Increased mutational robustness [ |
| Enhanced DNA repair in Deinococcus radiodurans | The ability to withstand dessication may have led to the enhanced repair of double stranded breaks [ | Enhanced repair of double stranded breaks also leads to radiation resistance in this species. Radiation is rarely encountered in nature, so it is unlikely radiation resistance was directly selected for [ |
Figure 2Neutral emergence of mutational robustness in a population of proteins.
Evolvability related potential pseudaptations.
| Trait | Potential driving force | Indirect benefit (neutrally emergent *) |
|---|---|---|
| Sexual reproduction | The purpose of sexual reproduction has been proposed to be DNA repair via recombination [ | Recombination leads to a reduction in the Hill-Robertson effect, enhancing the strength of selection |
| Segmentation of virus genomes | The role of virus genome segmentation has been linked to differential gene expression [ | In cystoviruses, segmentation leads to random assortment, and subsequent amelioration of linkage disequilibrium [ |
| Protein domain shuffling | Domain shuffling is facilitated by the occurrence of introns [ | Domain shuffling has been linked to evolutionary innovation [ |
| Reduced population size | Many factors may act to reduce population size and it is unlikely to be directly selected for | Ability to traverse evolutionary barriers [ |
| Nonfunctional DNA in higher eukaryotes | The function of the majority of intron sequences and intergenic DNA, if any, has not been established. Notably, overall there is a lack of sequence conservation, indicating a lack of sequence specific selection [ | Longer introns and intergenic DNA regions lead to an increase in recombination events, reducing the Hill-Roberston effect and so increasing the strength of selection [ |
| Evolutionary capacitance of HSP90 | HSP90 is a normal part of the stress response in the eukaryotes | HSP90 acts to store cryptic genetic variation, this is exposed in times of stress due to a reduction in the concentration of free HSP90 [ |
| Evolutionary capacitance of complex gene regulatory networks | Gene regulatory network structure is driven by the addition and removal of nodes, according to the immediate selective benefit | The loss of a gene enhances the phenotypic variation of remaining components of the network, and this promotes evolvability, this effect is not dependent on network topology [ |
| Error minimization in the SGC | There is evidence that error minimization has neutrally emerged as a consequence of genetic code expansion over time [ | Error minimization has been proposed to result in the increased evolvability of proteins [ |
| Elevated mutation rates in RNA viruses | The ultimate cause of elevated mutation rates in RNA viruses has not established, but reduced P may be a factor [ | Elevated mutation rates increase the ability to evade the host immune system and adapt to drug treatments |
| Ambiguous decoding of the CUG codon as both serine and leucine in | The ambiguous decoding of CUG [ | Ambiguous CUG decoding produces elevated levels of HSPs and this enhances survivability in challenging environments [ |
| Polyploidy | Polyploidy is caused by abnormal cell division | Polyploidy is proposed to result in increased evolvability in plants [ |
| Lateral gene transfer (LGT) in prokaryotes | LGT may have a role in DNA repair of the prokaryotic genome [ | LGT leads to increased evolvability in response to environmental challenges |
Bacteria that have undergone codon reassignments.
| Lineage and phylogenetic affiliation | Genetic code change | Genome size | Genome GC content | Elevated substitution rate? | Loss of DNA repair? | Habitat |
|---|---|---|---|---|---|---|
| Mycoplasmas (Mollicutes) | UGA (stop)→trp [ | 580–1359 kbp (Genbank) | 25%–40% (Genbank) | Yes [ | Yes [ | Vertebrate cells |
| Spiroplasmas (Mollicutes) | UGA (stop)→trp [ | 940–2220 kbp [ | 29% [ | Yes [ | Yes [ | Insect and plant cells |
| Ureaplasmas (Mollicutes) | UGA (stop)→trp [ | 750–950 kbp [ | 25% [ | Yes [ | Not determined | Vertebrate cells |
| SR1 bacteria (related to Chloroflexi) | UGA (stop)→gly [ | 1178 kbp [ | 31% [ | Yes [ | Not determined | Human body (extracellular), sediments |
| UGA (stop)→trp [ | 112 kbp [ | 17% [ | Yes [ | Yes [ | Circada (insect) cells | |
| UGA (stop)→trp [ | 190 kbp [ | 24% [ | Yes [ | Yes [ | Sharpshooter (insect) cells | |
| UGA→trp [ | 144 kbp [ | 58% [ | Yes [ | Yes [ | Circada (insect) cells |
Figure 3Relationship between proteome size and underlying mutation rates in diverse genomes.
The importance of information content in diverse systems. The use of information as a parameter in differing fields of study.
| Discipline | Parameter |
|---|---|
| Information theory | Shannon entropy/message length |
| Signalling games | Complete/incomplete/perfect information |
| Physics | Physical information |
| Economics | Information goods |
| Linguistics | Word/sentence length is related to information content |
| Ecology | Alpha diversity |
| Complexity theory | Complexity measures are related to information content |
| Biology | Genomic information content, organismal complexity |
The importance of information content in diverse systems. Increased information content/complexity may act as a constraint in a variety of different systems, biological and non-biological.
| System | Nature of informational/complexity constraint | Consequence |
|---|---|---|
| Business | Complexity of business | “Complexity costs“ add financial burden on the business |
| Healthcare | Complexity of medical treatments | Increased probability of error and consequent detrimental health outcomes [ |
| Statistical models | Number of parameters in a model | Greater number of parameters increases the variance of outcome [ |
| Messages in communication systems | Message length | Greater message length in communications is costly, leading to the noiseless coding theorum which formalizes message compression [ |
| Computer programming | Complexity of code, “feature creep” | Increased production costs |
| Ecosystem | Biodiversity/number of endemic species | The more biodiverse an ecosystem, the greater the political/economic pressure to preserve it |
| Biological research | Equation density in a research paper | Reduced citation of paper [ |
| Genomics | Quantity and complexity of high throughput data | Analysis costs, |
| Multicellular animals | Body size | More cells (and so genome copies) proposed to increase cancer risk [ |
| Lateral gene transfer | Complexity of protein complexes | The complexity hypothesis proposes that participation in multi-subunit protein complexes constitutes a barrier to the lateral transfer of informational genes [ |
| Organismal evolution | Organismal complexity | Organismal complexity proposed to constrain rate of adaptation [ |
| Molecular evolution | Genomic information content | Proposed to constrain genetic fidelity [ |