| Literature DB >> 29344693 |
Gregory A Babbitt1, Erin E Coppola2, Jamie S Mortensen2, Patrick X Ekeren3,4, Cosmo Viola4,5, Dallan Goldblatt4, André O Hudson6.
Abstract
Since the elucidation of the genetic code almost 50 years ago, many nonrandom aspects of its codon organization remain only partly resolved. Here, we investigate the recent hypothesis of 'dual-use' codons which proposes that in addition to allowing adjustment of codon optimization to tRNA abundance, the degeneracy in the triplet-based genetic code also multiplexes information regarding DNA's helical shape and protein-binding dynamics while avoiding interference with other protein-level characteristics determined by amino acid properties. How such structural optimization of the code within eukaryotic chromatin could have arisen from an RNA world is a mystery, but would imply some preadaptation in an RNA context. We analyzed synonymous (protein-silent) and nonsynonymous (protein-altering) mutational impacts on molecular dynamics in 13823 identically degenerate alternative codon reorganizations, defined by codon transitions in 7680 GPU-accelerated molecular dynamic simulations of implicitly and explicitly solvated double-stranded aRNA and bDNA structures. When compared to all possible alternative codon assignments, the standard genetic code minimized the impact of synonymous mutations on the random atomic fluctuations and correlations of carbon backbone vector trajectories while facilitating the specific movements that contribute to DNA polymer flexibility. This trend was notably stronger in the context of RNA supporting the idea that dual-use codon optimization and informational multiplexing in DNA resulted from the preadaptation of the RNA duplex to resist changes to thermostability. The nonrandom and divergent molecular dynamics of synonymous mutations also imply that the triplet-based code may have resulted from adaptive functional expansion enabling a primordial doublet code to multiplex gene regulatory information via the shape and charge of the minor groove.Entities:
Keywords: Codon organization; Dual-use codon; Genetic code; Molecular dynamics; RNA world
Mesh:
Substances:
Year: 2018 PMID: 29344693 PMCID: PMC5846835 DOI: 10.1007/s00239-018-9828-x
Source DB: PubMed Journal: J Mol Evol ISSN: 0022-2844 Impact factor: 2.395
Fig. 1A general overview (a) of our pipeline for calculating the level of MD optimization within the context of nucleic acid structures (i.e., dsRNA and bDNA). Generalized Born MD simulations (Amber 14) were run on 50 sets of all 64 codons centered on implicitly solvated G-capped 23 bp DNA fragments. Explicitly solvated systems were tested as well with equilibration times reduced to 0.2 µs. Alternative coding schemes defining the synonymous/nonsynonymous categorizations were generated by combinatoric base exchanges at all three codon positions, thus maintaining the same level of degeneracy in all alternative code combinations. Mutational impacts for all changes within the genetic code were defined using atomic fluctuations and correlations (i.e., cpptraj) and then summed over all synonymous sites (blue line) and all nonsynonymous sites (yellow line) and then compared to histograms of respective synonymous or nonsynonymous impacts in all the alternative codes (histograms). Mutational impacts were defined two ways, first as (b) absolute differences in correlation of vector trajectories (i.e., yellow arrows) along backbone carbons, and second as (c) absolute differences in atomic fluctuations or rapid harmonic vibrations (i.e., blue circles). (Color figure online)
Optimization of the genetic code to solvated aRNA structures
| Method | Mutation | dCORR | dFLUX |
|---|---|---|---|
| emp. | emp. | ||
| Implicit | N | 0.932 | 0.994 |
| S | 0.067 | 0.006 | |
| NN | 0.952 | 0.994 | |
| SN | 0.048 | 0.002 | |
| SS | 0.549 | 0.653 | |
| Explicit | N | 0.661 | 0.645 |
| S | 0.338 | 0.355 | |
| NN | 0.937 | 0.755 | |
| SN | 0.010 | 0.192 | |
| SS | 0.987 | 0.947 |
Table shows the empirical p value (i.e., percent optimization) for the genetic code’s biophysically defined mutational impacts in both implicitly and explicitly solvated simulations when compared to all possible codon reassignments with similar levels of degeneracy
Fig. 2Sum totals of relative impacts of synonymous and nonsynonymous mutation on a the MD of double-stranded aRNA, b the MD of double-stranded bDNA, and c the predicted DNA flexibility when comparing the canonical genetic code (dashed line) to all possible 13,823 alternative codes (histogram). Mutational impacts are defined according to mutational shifts in both correlation of atomic vector trajectories (dCORR—top) and atomic fluctuations (dFLUX—bottom) collected on all backbone carbons within a 5 bp mask centered on the mutation site. The calculations were conducted with cpptraj software (Ambertools16) in plots (a) and (b) and the TRX score of (Heddi et al. 2010) in plot (c)
Optimization of the genetic code to solvated bDNA structures
| Method | Mutation | dCORR | dFLUX |
|---|---|---|---|
| emp. | emp. | ||
| Implicit | N | 0.809 | 0.765 |
| S | 0.191 | 0.235 | |
| NN | 0.928 | 0.785 | |
| SN | 0.025 | 0.163 | |
| SS | 0.921 | 0.674 | |
| Explicit | N | 0.884 | 0.992 |
| S | 0.116 | 0.008 | |
| NN | 0.923 | 0.992 | |
| SN | 0.609 | 0.017 | |
| SS | 0.693 | 0.463 |
Table shows the empirical p value (i.e., percent optimization) for the genetic code’s biophysically defined mutational impacts in both implicitly and explicitly solvated simulations when compared to all possible codon reassignments with similar levels of degeneracy
Optimization of the genetic code to sequence-based scores for bDNA flexibility (TRX score)
| Method | Mutation | emp. |
|---|---|---|
| dTRX | N | 0.188 |
| S | 0.882 | |
| NN | 0.118 | |
| SN | 0.832 | |
| SS | 0.851 |
Table shows the empirical p value (i.e., percent optimization) for the genetic code’s biophysically defined mutational impacts for dTRX when compared to all possible codon reassignments with similar levels of degeneracy
Fig. 3Sum totals of relative impacts of strand-specific mutation on the MD of a double-stranded aRNA and b bDNA comparing the canonical genetic code (dashed line) to all possible 13,823 alternative codes (histogram). Mutational impacts are defined as in Fig. 2 but were collected according to whether mutation types were symmetric (e.g., nonsynonymous on both leading and lagging strand) or asymmetric (i.e., synonymous on one strand but nonsynonymous on the other)
Fig. 4Atom specific heatmaps showing the average dCORR and dFLUX on a double-stranded aRNA and b bDNA as a function of mutation type (N nonsynonymous, S synonymous, NN nonsynonymous on both strands, SS synonymous on both strands, SN synonymous on only one strand)