| Literature DB >> 35741358 |
Yu Men1, Guoliang Lu2, Yanhui Wang1, Jinzhong Lin2, Qiang Xie1.
Abstract
The theory of the RNA world, especially with the catalytic capability of RNA, provides a reasonable framework explaining the evolution of molecular genetics system before the scenario of the central dogma. However, it remains a challenge to deduce the origin mechanism of rRNAs. Here we reconstructed the phylogenetic relationships of archaea and bacteria with bootstrap values of most nodes, especially the deep ones, higher than 90%. Based on the well-resolved tree, the full lengths of 16S, 5S, and 23S rRNA sequences of the last universal common ancestor (LUCA) were reconstructed for the first time. The potential similarities shared by the three ancestral rRNA sequences were further explored by searching for repeat short fragments in the level of purine-pyrimidine (RY) with certain lengths and arrangements. With the lengths ranging from 2 to 14, functional short fragments could be found in the three RNAs. As a representative, a set with a total of 75 short fragments of 11 nucleotides in length can recover all types of the known functional sites of ribosomes in a most concise manner. The 75 short fragments cluster around the functional center of the ribosome, among which 18 of them are highly conserved across five or six kingdoms and still contain all types of known functional sites except one. Alternatively, according to the strategy using the level of AUGC instead of RY, a similar pattern can be recovered. Such results indicate the local similarities shared by 16S, 5S, and 23S rRNAs and thus suggest a possible general mechanism in the formation of the LUCA rRNAs.Entities:
Keywords: ancestral sequence; last universal common ancestor; ribosomal RNA; ribosome; structure
Year: 2022 PMID: 35741358 PMCID: PMC9219793 DOI: 10.3390/biology11060837
Source DB: PubMed Journal: Biology (Basel) ISSN: 2079-7737
Figure 1Flow chart overview.
Figure 2Phylogenetic relationships of archaea and bacteria. Branches were grouped to display the tree at the phylum level. Bootstrap values for this ML(Maximum Likelihood) tree were indicated by colored circles on nodes. LUCA: last universal common ancestor. The ML-tree with all branches is shown in Figure S1.
Figure 3The total lengths of candidate units. The total length is equal to the length of the candidate unit multiplied by the number of candidate units. The total length of ancestral 16S, 5S, and 23S rRNA sequences of LUCA was 4428 nucleotides (marked by red line).
Overview of the lengths, positions, conservativeness, and function attributes of short fragments in RY and AUGC levels.
| RY | AUGC | |
|---|---|---|
| Length range of short fragment | 2–15 | 2–12 |
| Representative length in the most concise manner | 11 | 6 |
| Total number of short fragments with the representative length | 75 | 136 |
| Percentage of the overlapped or adjacent short fragments | 68% | 56% |
| Number of short fragments that belonged to the 16S rRNA | 21 | 34 |
| Number of short fragments that belonged to the 5S rRNA | 2 | 2 |
| Number of short fragments that belonged to the 23S rRNA | 52 | 100 |
| Number of short fragments conserved across archaea and bacteria | 36 | 47 |
| Number of universal short fragments (conserved across 5 or 6 kingdoms) | 18 | 29 |
| Percentage of the overlapped or adjacent universal short fragments | 67% | 66% |
| Number of functional sites located in the total short fragments | 69 | 48 |
| Number of universal short fragments that contained functional sites | 11 | 12 |
| Number of universal short fragments that contained no functional sites | 7 | 17 |
Figure 4Short fragments with 11 nucleotides in the RY level on secondary structures of LUCA rRNAs. (a) Secondary structure of 16S rRNA of LUCA. (b) Secondary structures of 5S rRNA (left) and 23S rRNA (right) of LUCA. Bases are indicated by colored circles. The black arrows mark the 226 nucleotides that were orthologous to the nucleotides performing functions in the ribosome (in total, 230 functional nucleotides were summarized, while 4 nucleotides’ orthologous sites were gaps in the reconstructed LUCA rRNA chains. Thus, a total of 226 nucleotides was marked). Short fragments are outlined by colored curves. Red and magenta curves outline the 18 universal short fragments, which were conserved across at least 5 of archaea, bacteria, protists, fungi, plants, and animals. Red curves and boxed numbers (from 1 to 11) outline and mark the 11 universal short fragments whose orthologous sequences of E. coli contain nucleotides performing functions in the ribosome. Magenta curves outline the other 7 universal short fragments containing no functional sites. An additional 57 short fragments are outlined by gray curves.
Figure 5Short fragments with 11 nucleotides in the RY level largely covered the key functional areas of the ribosome. The ribosomal tertiary structure of E. coli was mapped with the short fragments and is displayed on the front (a) and side (b) views. Short fragments are shown by colored lines with red, magenta, and gray. The short fragments marked in three colors and boxed numbers correspond to those in Figure 4. Red and magenta mark the 18 universal short fragments that were conserved across at least 5 of archaea, bacteria, protists, fungi, plants, and animals. Red and boxed numbers mark the 11 universal short fragments whose orthologous sequences of E. coli contain functional nucleotide sites. The functions of nucleotides contained in the 11 fragments are listed below the tertiary structures. Magenta marks the other 7 universal short fragments containing no functional site. An additional 57 short fragments are marked in gray. The L1 and L7/12 arms were functionally important domains in the ribosome.
Figure 6Number of randomly sampled functional nucleotide sites from the three rRNAs. The numbers of functional nucleotide sites contained in the randomly sampled 805 nucleotide sites are shown in blue. In total, 230 functional nucleotide sites of contemporary rRNAs were summed up. Sixty-nine of them were located in the 75 short fragments (marked in red).
Functions of nucleotide sites in short fragments.
| Function | N-Box | N-F75 | References |
|---|---|---|---|
| Interaction with tRNA in A-, P-, and E-sites | 1, 2, 3, 4, 5, 6, 8, 9, 10 | 529 (1-491), 530 (1-491), 531 (1-491), 532 (1-491), 2583 (2-2772), 2602 (2-2788), 1492 (1-1478), 1493 (1-1478), 1494 (1-1478), 1916 (2-2101), 1918 (2-2101), 1926 (2-2114), 2585 (2-2772), 926 (1-891), 956 (1-924), 2584 (2-2772), 2602 (2-2788), 2169 (2-2359), 1913 (2-2101), 2609 (2-2796), 2506 (2-2692), 2555 (2-2750), 2501 (2-2692), 2603 (2-2796), 2505 (2-2692), 693 (1-654), 2116 (2-2304) | [ |
| Interaction with mRNA | 1 | 532 (1-491), 693 (1-654), 1156 (1-1132), 1533 (1-1516), 1532 (1-1516), 1530 (1-1516), 1534 (1-1516), 1535 (1-1516), 1536 (1-1516), 1537 (1-1516), 1538 (1-1516), 1539 (1-1516), 1540 (1-1516) | [ |
| Interaction with mRNA–tRNA minihelix | 1, 4 | 1492 (1-1478), 1493 (1-1478), 530 (1-491) | [ |
| Interaction with nascent peptide | 7, 9 | 2062 (2-2249), 2585 (2-2772), 2506 (2-2692), 2609 (2-2796), 1614 (2-1797) | [ |
| Monitoring the codon–anticodon pairing and maintaining translational fidelity | 4 | 1492 (1-1478), 1493 (1-1478) | [ |
| Co-translational monitoring of nascent peptide chains inside the exit tunnel | 7 | 2062 (2-2249) | [ |
| Recognition of stop codon | 1, 4, 5 | 1913 (2-2101), 530 (1-491), 1493 (1-1478) | [ |
| Nascent peptide tunnel | 7 | 2058 (2-2249), 2059 (2-2249), 2060 (2-2249), 2061 (2-2249), 2062 (2-2249), 2063 (2-2249) | [ |
| Taking part in the peptide release | 9, 10 | 2585 (2-2772), 2602 (2-2788) | [ |
| Inter-subunit bridges | 4, 5, 9,10, 11 | 900(1-870), 901(1-870), 1493(1-1478), 1495(1-1478), 1496(1-1478), 1702(2-1884), 1703(2-1884), 1704(2-1884), 1705(2-1884), 1912(2-2101), 1913(2-2101), 1923(2-2114), 1928(2-2114), 1929(2-2114), 1932(2-2114), 1933(2-2114), 1960(2-2148), 1961(2-2148), 1962(2-2148), 2506(2-2692), 2585(2-2772), 2602(2-2788) | [ |
| Forming base pairs with other functional nucleotides | 1, 7 | 2061 (2-2249), 2063 (2-2249), 530 (1-491), 2499 (2-2692) | [ |
| Interacting with GTPase factors (EF-G, EF-Tu, IF2, RF3) | 2653 (2-2845), 2654 (2-2845), 2655 (2-2845), 2656 (2-2845), 2657 (2-2845), 2658 (2-2845) | [ |
N-Box: Boxed number corresponding to that in Figure 5. N-F75: Functional nucleotide sites contained in the 75 short fragments. The part outside the bracket refers to the functional nucleotide set in the rRNAs of E. coli. The part inside the bracket is the code number of the corresponding short fragment’s first nucleotide. The number before ‘-’ is the location, where ‘1’ is for 16S rRNA and ‘2’ is for 23S rRNA. The number after ‘-’ refers to the number of ancestral sequences with gaps.
Comparisons of two strategies and different lengths.
| Fragment Set with the Same Length | Fragment Set in the Same Level | |||
|---|---|---|---|---|
| RY Level | AUGC Level | Short | Long | |
| Positive results | More | Less | More | Less |
| Pseudo-positive results | More | Less | More | Less |
| Overlaps | More | Less | More | Less |
| Functional sets | More | Less | More | Less |