| Literature DB >> 18369418 |
Feng-Jie Sun1, Gustavo Caetano-Anollés.
Abstract
Transfer RNAs (tRNAs) are ancient molecules that are central to translation. Since they probably carry evolutionary signatures that were left behind when the living world diversified, we reconstructed phylogenies directly from the sequence and structure of tRNA using well-established phylogenetic methods. The trees placed tRNAs with long variable arms charging Sec, Tyr, Ser, and Leu consistently at the base of the rooted phylogenies, but failed to reveal groupings that would indicate clear evolutionary links to organismal origin or molecular functions. In order to uncover evolutionary patterns in the trees, we forced tRNAs into monophyletic groups using constraint analyses to generate timelines of organismal diversification and test competing evolutionary hypotheses. Remarkably, organismal timelines showed Archaea was the most ancestral superkingdom, followed by viruses, then superkingdoms Eukarya and Bacteria, in that order, supporting conclusions from recent phylogenomic studies of protein architecture. Strikingly, constraint analyses showed that the origin of viruses was not only ancient, but was linked to Archaea. Our findings have important implications. They support the notion that the archaeal lineage was very ancient, resulted in the first organismal divide, and predated diversification of tRNA function and specificity. Results are also consistent with the concept that viruses contributed to the development of the DNA replication machinery during the early diversification of the living world.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18369418 PMCID: PMC2265525 DOI: 10.1371/journal.pcbi.1000018
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Figure 1A global phylogenetic tree of tRNA molecules reconstructed from sequence and structure.
MP analyses of data from 571 tRNA molecules resulted in the preset limit of 20,000 minimal length trees, each of 10,083 steps. Consistency index (CI) = 0.069 and 0.069, with and without uninformative characters, respectively; Retention index (RI) = 0.681; Rescaled consistency index (RC) = 0.047; g1 = −0.107. Terminal leaves are not labeled since they would not be legible. Nodes labeled with closed circles have BS values >50%. tRNA molecules belonging to different superkingdoms and viruses and coding for Sec, Ser, Tyr, and Leu are labeled with colors. Note several of these tRNAs have short variable arms and are derived in the tree.
Origins of the tripartite world
| tRNA class | Organismal constraints | Test |
|
| Unconstrained | ((A), B, E, V) | H | 132 |
| ((V), A, B, E) | H | 336 | |
| ((E), A, B, V) | H | 967 | |
| ((B), A, E, V) | H | 1039 | |
| ((A, B, E), V) | H | 339 | |
| ((B, E), A, V) | CH1 | 345 | |
| ((A, E), B, V) | CH1 | 971 | |
| ((A, B), E, V) | CH1 | 1038 | |
| (((A)(E)), B, V) | CH2 | 966 | |
| (((A)(B)), E, V) | CH2 | 1042 | |
| (((B)(E)), A, V) | CH2 | 1164 | |
| ((A), (B), (E), V) | CH3 | 1171 | |
| (((A)(B)(E)), V) | CH3 | 1171 | |
| (((A)(B))(E), V) | CH3 | 1171 | |
| (((A)(E))(B), V) | CH3 | 1178 | |
| (((B)(E))(A), V) | CH3 | 1179 | |
| ((A), (B), (E), (V)) | H | 1190 | |
| Constrained | ((Class I: A, B, E, V), (Class II: A, B, E, V)) | H | 232 |
| ((Class I: A, B, E, V), (Class II: (A), B, E, V)) | H | 136 | |
| ((Class I: A, B, E, V), (Class II: (V), A, B, E)) | H | 168 | |
| ((Class I: A, B, E, V), (Class II: (B), A, E, V)) | H | 318 | |
| ((Class I: A, B, E, V), (Class II: (E), A, B, V)) | H | 276 | |
| ((Class I: A, B, E, V), (Class II: (B, E), A, V)) | CH1 | 174 | |
| ((Class I: A, B, E, V), (Class II: (A, E), B, V)) | CH1 | 296 | |
| ((Class I: A, B, E, V), (Class II: (A, B), E, V)) | CH1 | 309 | |
| ((Class I: A, B, E, V), (Class II: ((A)(E)), B, V)) | CH2 | 294 | |
| ((Class I: A, B, E, V), (Class II: ((A)(B)), E, V)) | CH2 | 316 | |
| ((Class I: A, B, E, V), (Class II: ((B)(E)), A, V)) | CH2 | 325 | |
| ((Class I: (A), B, E, V), (Class II: A, B, E, V)) | H | 143 | |
| ((Class I: (V), A, B, E), (Class II: A, B, E, V)) | H | 291 | |
| ((Class I: (E), A, B, V), (Class II: A, B, E, V)) | H | 814 | |
| ((Class I: (B), A, E, V), (Class II: A, B, E, V)) | H | 852 | |
| ((Class I: (B, E), A, V), (Class II: A, B, E, V)) | CH1 | 297 | |
| ((Class I: (A, E), B, V), (Class II: A, B, E, V)) | CH1 | 825 | |
| ((Class I: (A, B), E, V), (Class II: A, B, E, V)) | CH1 | 851 | |
| ((Class I: ((A)(E)), B, V), (Class II: A, B, E, V)) | CH2 | 843 | |
| ((Class I: ((A)(B)), E, V), (Class II: A, B, E, V)) | CH2 | 870 | |
| ((Class I: ((B)(E)), A, V), (Class II: A, B, E, V)) | CH2 | 961 |
The numbers of additional steps (S) required to force molecules into monophyly were calculated based on class (class I and II tRNAs) and organismal (three superkingdoms of life or viruses) constraints using MP analyses of combined tRNA structure and sequence data. The length of the most parsimonious trees derived from the combined data set was 10,083 steps. Each constrained group is given in parentheses and groups of tRNA molecules are indicated by superkingdoms of life (Archaea, Bacteria, and Eukarya) or viruses. Both chloroplast and mitochondria tRNAs were included in Bacteria. A = Archaea, B = Bacteria, E = Eukarya, V = viruses. CH, competing hypothesis; H, non-competing hypothesis.
Figure 2Timeline of organismal diversification.
Constraints representing non-competing hypotheses of organismal relationship (white circles) are used to define a timeline for the appearance of lineages in a universal tree derived from the sequence and structure of tRNA. Blue circles represent constraints representing competing hypotheses. They illustrate both the most parsimonious lineage relationship and their coalescence. Areas colored in light green, salmon, and light yellow are delimited by lineage coalescence and describe three evolutionary epochs. The timeline is given in a scale of additional steps (S) needed to fulfill constraints. S values were not normalized.
Origins of the viral world
| tRNA category | Organismal constraints | Test |
|
| Unconstrained | ((A, V), B, E) | CH1 | 342 |
| ((B, V), A, E) | CH1 | 979 | |
| ((E, V), A, B) | CH1 | 1034 | |
| (((A)(V)), B, E) | CH2 | 333 | |
| (((B)(V)), A, E) | CH2 | 1164 | |
| (((E)(V)), A, B) | CH2 | 1162 | |
| ((A, VB), VE, B, E) | CH3 | 249 | |
| ((B, VB), VE, A, E) | CH3 | 959 | |
| ((E, VB), VE, A, B) | CH3 | 1015 | |
| ((A, VE), VB, B, E) | CH4 | 198 | |
| ((B, VE), VB, A, E) | CH4 | 926 | |
| ((E, VE), VB, A, B) | CH4 | 955 | |
| (((A)(VB)), VE, B, E) | CH5 | 246 | |
| (((B)(VB)), VE, A, E) | CH5 | 1100 | |
| (((E)(VB)), VE, A, B) | CH5 | 1088 | |
| (((A)(VE)), VB, B, E) | CH6 | 192 | |
| (((B)(VE)), VB, A, E) | CH6 | 1078 | |
| (((E)(VE)), VB, A, B) | CH6 | 1018 | |
| Constrained | ((Class I: A, B, E, V), (Class II: (A, V), B, E)) | CH7 | 182 |
| ((Class I: A, B, E, V), (Class II: (B, V), A, E)) | CH7 | 289 | |
| ((Class I: A, B, E, V), (Class II: (E, V), A, B)) | CH7 | 312 | |
| ((Class I: A, B, E, V), (Class II: ((A)(V)), B, E)) | CH8 | 189 | |
| ((Class I: A, B, E, V), (Class II: ((B)(V)), A, E)) | CH8 | 324 | |
| ((Class I: A, B, E, V), (Class II: ((E)(V)), A, B)) | CH8 | 328 | |
| ((Class I: (A, V), B, E), (Class II: A, B, E, V)) | CH9 | 292 | |
| ((Class I: (B, V), A, E), (Class II: A, B, E, V)) | CH9 | 817 | |
| ((Class I: (E, V), A, B), (Class II: A, B, E, V)) | CH9 | 855 | |
| ((Class I: ((A)(V)), B, E), (Class II: A, B, E, V)) | CH10 | 301 | |
| ((Class I: ((B)(V)), A, E), (Class II: A, B, E, V)) | CH10 | 971 | |
| ((Class I: ((E)(V)), A, B), (Class II: A, B, E, V)) | CH10 | 961 |
The numbers of additional steps (S) required to force molecules into monophyly were calculated based on class (class I and II tRNAs) and organismal (three superkingdoms of life or viruses) constraints using MP analyses of combined tRNA structure and sequence data. The length of the most parsimonious trees derived from the combined data set was 10,083 steps. Each constrained group is given in parentheses and groups of tRNA molecules are indicated by superkingdoms of life (Archaea, Bacteria, and Eukarya) or viruses. Both chloroplast and mitochondria tRNAs were included in Bacteria. A = Archaea, B = Bacteria, E = Eukarya, V = viruses, VB = viruses associated with Bacteria, VE = viruses associated with Eukarya. CH, competing hypothesis.
The numbers of additional steps (S) required to force molecules into monophyly based on tRNA category, amino acid specificity, and organismal constraints using MP analyses of combined tRNA structure and sequence data
| tRNA category | Constraints |
|
| Unconstrained | Superkingdom diversification prior to functional divergence: ((A: (Ala), (Arg), …, (Sec)), (B: (Ala), (Arg), …, (Sec)), (E: (Ala), (Arg), …, (Sec)), (V: (Ala), (Arg), …, (Sec))) | 2481 |
| Functional divergence prior to superkingdom diversification: ((Ala: (A)(B)(E)(V)), (Arg: (A)(B)(E)(V)), …, (Val: (A)(B)(E)(V))) | 2534 | |
| Constrained | Superkingdom diversification prior to functional divergence: ((Class II: (A: (Ser)(Sec)(Leu)(Tyr)), (B: (Ser)(Sec)(Leu)(Tyr)), (E: (Ser)(Sec)(Leu)(Tyr)), (V: (Ser)(Sec)(Leu)(Tyr))), (Class I: (A: (Ala), (Arg), …, (Sec)), (B: (Ala), (Arg), …, (Sec)), (E: (Ala), (Arg), …, (Sec)), (V: (Ala), (Arg), …, (Sec)))) | 2338 |
| Functional divergence prior to superkingdom diversification: ((Class II: (Ser: (A)(B)(E)(V)), (Sec: (A)(B)(E)(V)), (Leu; (A)(B)(E)(V)), (Tyr: (A)(B)(E)(V))), (Class I: (Ala: (A)(B)(E)(V)), (Arg: (A)(B)(E)(V)), …, (Val: (A)(B)(E)(V)))) | 2415 |
The length of the most parsimonious trees derived from the combined data set was 10,083 steps. Each constrained group is given in parentheses. Both chloroplast and mitochondria tRNAs were included in Bacteria. A = Archaea, B = Bacteria, E = Eukarya, V = viruses. Amino acids are indicated by the International Union of Pure and Applied Chemistry (IUPAC) 3-letter nomenclature.
Rates of evolution in the three superkingdoms and viruses derived from strict consensus trees
| Assigned branch length | Minimum length | Maximum length | |
| Archaea (59 leaves) | 5.25±6.44 (425) | 4.78±5.80 (387) | 5.74±6.57 (465) |
| Bacteria (275 leaves) | 4.20±5.34 (1,776) | 3.97±5.20 (1,679) | 4.45±5.44 (1,881) |
| Eukarya (220 leaves) | 5.19±6.16 (1,667) | 4.81±6.02 (1,544) | 5.60±6.34 (1,796) |
| Viruses (17 leaves) | 5.85±9.87 (193) | 5.42±9.43 (179) | 6.27±9.89 (207) |
The average number of character changes per branch (±standard deviations) are listed for assigned, minimum, and maximum values. The total numbers of character changes in the trees are given in parentheses. ANOVA showed average branch lengths were not significantly different between different superkingdoms or viruses (assigned branch lengths, df: 3, 854; F = 2.