| Literature DB >> 18665254 |
Feng-Jie Sun1, Gustavo Caetano-Anollés.
Abstract
Transfer RNA (tRNA) molecules play vital roles during protein synthesis. Their acceptor arms are aminoacylated with specific amino acid residues while their anticodons delimit codon specificity. The history of these two functions has been generally linked in evolutionary studies of the genetic code. However, these functions could have been differentially recruited as evolutionary signatures were left embedded in tRNA molecules. Here we built phylogenies derived from the sequence and structure of tRNA, we forced taxa into monophyletic groups using constraint analyses, tested competing evolutionary hypotheses, and generated timelines of amino acid charging and codon discovery. Charging of Sec, Tyr, Ser and Leu appeared ancient, while specificities related to Asn, Met, and Arg were derived. The timelines also uncovered an early role of the second and then first codon bases, identified codons for Ala and Pro as the most ancient, and revealed important evolutionary take-overs related to the loss of the long variable arm in tRNA. The lack of correlation between ancestries of amino acid charging and encoding indicated that the separate discoveries of these functions reflected independent histories of recruitment. These histories were probably curbed by co-options and important take-overs during early diversification of the living world.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18665254 PMCID: PMC2474678 DOI: 10.1371/journal.pone.0002799
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1A global phylogenetic tree of tRNA molecules reconstructed from sequence and structure.
MP analyses of data from 571 tRNA molecules resulted in the preset limit of 20,000 minimal length trees, each of 10,083 steps. Consistency index = 0.069 and 0.069, with and without uninformative characters, respectively; Retention index = 0.681; Rescaled consistency index = 0.047; g1 = −0.107. Terminal leaves are not labeled since they would not be legible. tRNA molecules coding for Sec, Ser, Tyr, and Leu are labeled with colors. Note several of these tRNAs have short variable arms and these are derived in the tree. Nodes labeled with closed circles have BS values >50%. The figure has been modified from [27] and a global tRNA tree with labeled terminal taxa can be found in Supporting Information (Figure S1).
Figure 2Phylogenetic analyses of the amino acid charging function in tRNA.
A. Cumulative frequency distribution plots that describe the cumulative number of tRNAs charging for different amino acids as a function of node distance (nd), the number of nodes from an hypothetical tRNA molecules at the base of the tree. B. Average nd for each tRNA charging group plotted against number of additional steps (S) needed to satisfy constraints that force the monophyletic grouping of corresponding tRNA molecules, normalized to a 0–1 scale. Different colored circles correspond to the three groups of tRNA molecules described in the text. C. Plot describing the effect of numbers of tRNA that are constrained versus normalized S values.
The numbers of additional steps (S) required to force molecules into monophyly based on constraints related to amino acid specificity (spec) or amino acid specificity and tRNA structural classes (type I and II tRNAs) (cat-spec) during MP analyses of the combined structure and sequence data for 571 tRNA molecules.
| Constraint: |
| Constraints: |
|
| ((TyrII), …) | 133 | ((Type I: Ala, Arg, …, Val), (Type II: (Tyr), Sec, Ser, Leu)) | 130 |
| ((Sec), …) | 113 | ((Type I: Ala, Arg, …, Val), (Type II: (Sec), Ser, Leu, Tyr)) | 139 |
| ((SerII), …) | 140 | ((Type I: Ala, Arg, …, Val), (Type II: (Ser), Sec, Leu, Tyr)) | 142 |
| ((LeuII), …) | 146 | ((Type I: Ala, Arg, …, Val), (Type II: (Leu), Sec, Ser, Tyr)) | 158 |
| ((Ala), …) | 148 | ((Type I: (Ala), Arg, …, Val), (Type II: Sec, Ser, Leu, Tyr)) | 150 |
| ((Leu), …) | 140 | ((Type I: (Leu), Ala, …, Val), (Type II: Sec, Ser, Leu, Tyr)) | 157 |
| ((Cys), …) | 151 | ((Type I: (Cys), Ala, …, Val), (Type II: Sec, Ser, Leu, Tyr)) | 162 |
| ((Pro), …) | 144 | ((Type I: (Pro), Ala, …, Val), (Type II: Sec, Ser, Leu, Tyr)) | 166 |
| ((His), …) | 161 | ((Type I: (His), Ala, …, Val), (Type II: Sec, Ser, Leu, Tyr)) | 178 |
| ((Ser), …) | 166 | ((Type I: (Ser), Ala, …, Val), (Type II: Sec, Ser, Leu, Tyr)) | 185 |
| ((Tyr), …) | 173 | ((Type I: (Tyr), Ala, …, Val), (Type II: Sec, Ser, Leu, Tyr)) | 191 |
| ((Phe), …) | 179 | ((Type I: (Phe), Ala, …, Val), (Type II: Sec, Ser, Leu, Tyr)) | 192 |
| ((Ile), …) | 167 | ((Type I: (Ile), Ala, …, Val), (Type II: Sec, Ser, Leu, Tyr)) | 193 |
| ((Trp), …) | 175 | ((Type I: (Trp), Ala, …, Val), (Type II: Sec, Ser, Leu, Tyr)) | 193 |
| ((Gly), …) | 186 | ((Type I: (Gly), Ala, …, Val), (Type II: Sec, Ser, Leu, Tyr)) | 194 |
| ((Val), …) | 188 | ((Type I: (Val), Ala, …, Tyr), (Type II: Sec, Ser, Leu, Tyr)) | 198 |
| ((Glu), …) | 187 | ((Type I: (Glu), Ala, …, Val), (Type II: Sec, Ser, Leu, Tyr)) | 210 |
| ((Thr), …) | 203 | ((Type I: (Thr), Ala, …, Val), (Type II: Sec, Ser, Leu, Tyr)) | 210 |
| ((Lys), …) | 199 | ((Type I: (Lys), Ala, …, Val), (Type II: Sec, Ser, Leu, Tyr)) | 211 |
| ((Ini), …) | 190 | ((Type I: (Ini), Ala, …, Val), (Type II: Sec, Ser, Leu, Tyr)) | 212 |
| ((Asp), …) | 194 | ((Type I: (Asp), Ala, …, Val), (Type II: Sec, Ser, Leu, Tyr)) | 213 |
| ((Gln), …) | 207 | ((Type I: (Gln), Ala, …, Val), (Type II: Sec, Ser, Leu, Tyr)) | 214 |
| ((Asn), …) | 231 | ((Type I: (Asn), Ala, …, Val), (Type II: Sec, Ser, Leu, Tyr)) | 241 |
| ((Met), …) | 235 | ((Type I: (Met), Ala, …, Val), (Type II: Sec, Ser, Leu, Tyr)) | 252 |
| ((Arg), …) | 255 | ((Type I: (Arg), Ala, …, Val), (Type II: Sec, Ser, Leu, Tyr)) | 266 |
Each constrained group is given in parentheses and amino acids are indicated by the IUPAC 3-letter nomenclature. The length of the most parsimonious trees derived from the combined data set was 10,083 steps. Superscripts associated with amino acid codes indicate tRNAs belong to Type II molecules.
The numbers of additional steps (S) required to force molecules into monophyly based on amino acid chronology constraints during MP analyses of combined tRNA structure and sequence data.
| Chronology | Constraints |
|
| Miller | ((Gly, Ala, Asp, Val, Pro, Ser, Glu, Thr, Leu, Ile), …) | 339 |
| ((Gly, Ala, Asp, Val, Pro, Ser, Glu, Thr, Leu, Ile), (…)) | 357 | |
| Brooks et al. | ((Cys, Trp, Tyr, Gln, Phe, Leu, Lys, Glu), (Ala, Val, Gly, Ile, Thr, Asp, Ser, Asn, His), …) | 516 |
| ((Cys, Trp, Tyr, Gln, Phe, Leu, Lys, Glu), (Ala, Val, Gly, Ile, Thr, Asp, Ser, Asn, His), (…)) | 530 | |
| Trifonov | ((His, Lys, Cys, Phe, Tyr, Met, Ini, Trp, Sec), …) | 332 |
| ((His, Lys, Cys, Phe, Tyr, Met, Ini, Trp, Sec), (…)) | 348 | |
| Jordan et al. | ((Cys, Met, Ini, His, Ser, Phe, Asn, Thr, Ile, Val), (Pro, Ala, Glu, Gly, Lys), …) | 550 |
| ((Cys, Met, Ini, His, Ser, Phe, Asn, Thr, Ile, Val), (Pro, Ala, Glu, Gly, Lys), (…)) | 566 | |
| Fournier and Gogarten | ((Cys, Glu, Phe, Ile, Lys, Val, Trp, Tyr, Ser), (Asn, Gln, Gly, Leu, Pro, Asp), …) | 524 |
| ((Cys, Glu, Phe, Ile, Lys, Val, Trp, Tyr, Ser), (Asn, Gln, Gly, Leu, Pro, Asp), (…)) | 546 | |
| Present study (Type I and II are combined) | ((Ser, Sec, Leu, Tyr), …) | 139 |
| ((Ser, Sec, Leu, Tyr), (…)) | 253 | |
| Present study (Type I and II are separated) | ((Ser, Sec, Leu, Tyr), …) | 135 |
| ((Ser, Sec, Leu, Tyr), (…)) | 232 |
The length of the most parsimonious trees derived from the combined data set was 10,083 steps. Each constrained group is given in parentheses and groups of tRNA isoacceptors were labeled with IUPAC 3-letter amino acid nomenclature.
The numbers of additional steps (S) required to force molecules sharing the first, second, third, or the first two bases in codons into monophyly during MP analyses of the combined tRNA structure and sequence data.
| Constraints |
|
| Shared the first bases in codons: | |
| ((ANN), …) | 372 |
| ((CNN), …) | 466 |
| ((GNN), …) | 247 |
| ((UNN), …) | 389 |
| ((ANN), (CNN), (GNN), (UNN)) | 708 |
| Shared the second bases in codons: | |
| ((NAN), …) | 345 |
| ((NCN), …) | 267 |
| ((NGN), …) | 330 |
| ((NUN), …) | 299 |
| ((NAN), (NCN), (NGN), (NUN)) | 649 |
| Shared the third bases in codons: | |
| ((NNA), …) | 393 |
| ((NNC), …) | 817 |
| ((NNG), …) | 533 |
| ((NNU), …) | 896 |
| ((NNA), (NNC), (NNG), (NNU)) | 1597 |
| Shared the first and the second bases in codons: | |
| ((AAN), …) | 268 |
| ((ACN), …) | 172 |
| ((AGN), …) | 270 |
| ((AUN), …) | 195 |
| ((CAN), …) | 226 |
| ((CCN), …) | 144 |
| ((CGN), …) | 255 |
| ((CUN), …) | 294 |
| ((GAN), …) | 218 |
| ((GCN), …) | 148 |
| ((GGN), …) | 186 |
| ((GUN), …) | 188 |
| ((UAN), …) | 211 |
| ((UCN), …) | 186 |
| ((UGN), …) | 214 |
| ((UUN), …) | 309 |
| ((AAN), (ACN), (AGN), (AUN), (CAN), (CCN), (CGN), (CUN), (GAN), (GCN), (GGN), (GUN), (UAN), (UCN), (UGN), (UUN)) | 1197 |
The length of the most parsimonious trees derived from the combined data set was 10,083 steps. Each constrained group is given in parentheses and groups of tRNA molecules are indicated by codons with shared the first, second, third, or the first two bases. Symbol “N” indicates A, U, G, or C.
Figure 3Phylogenetic analysis of codon identity functions in tRNA.
A. Degenerate genetic code table painted with colors describing the ancestry (S) of their codon identity function, in which two (A) or one (B) position in the codon is degenerate and harbors any one of the four bases (N). Ancestries were derived by constraining sets of tRNA molecules into monophyletic groups. S values are provided in the right hand corner for every codon, and corrected S values that exclude type I molecules from the constraints are given in the left hand corner for codons related to Leu, Ser, and Tyr. Amino acid that are encoded are listed below corresponding codons.
Figure 4Plot describing the relationship between ancestries of amino acid charging (S) and codon identity (S) function, normalized to a 0–1 scale.
Dashed lines describe the effect of excluding type I tRNA variants from the constraints (from orange to red circles), and illustrate recruitment events related to the loss of the variable arm in these molecules. Color schemes of circles follow those of Figure 2.