| Literature DB >> 30609737 |
Robert Root-Bernstein1, Meredith Root-Bernstein2.
Abstract
We propose that ribosomal RNA (rRNA) formed the basis of the first cellular genomes, and provide evidence from a review of relevant literature and proteonomic tests. We have proposed previously that the ribosome may represent the vestige of the first self-replicating entity in which rRNAs also functioned as genes that were transcribed into functional messenger RNAs (mRNAs) encoding ribosomal proteins. rRNAs also encoded polymerases to replicate itself and a full complement of the transfer RNAs (tRNAs) required to translate its genes. We explore here a further prediction of our "ribosome-first" theory: the ribosomal genome provided the basis for the first cellular genomes. Modern genomes should therefore contain an unexpectedly large percentage of tRNA- and rRNA-like modules derived from both sense and antisense reading frames, and these should encode non-ribosomal proteins, as well as ribosomal ones with key cell functions. Ribosomal proteins should also have been co-opted by cellular evolution to play extra-ribosomal functions. We review existing literature supporting these predictions. We provide additional, new data demonstrating that rRNA-like sequences occur at significantly higher frequencies than predicted on the basis of mRNA duplications or randomized RNA sequences. These data support our "ribosome-first" theory of cellular evolution.Entities:
Keywords: LINE; SINE; aminoacyl tRNA synthetase; extra-ribosomal; genome; mRNA; non-canonical; paralog; rRNA; ribosome; ribosome-binding protein; tRNA
Mesh:
Substances:
Year: 2019 PMID: 30609737 PMCID: PMC6337102 DOI: 10.3390/ijms20010140
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Non-canonical functions of transfer RNAs (tRNAs), their fragments, and mimics. Table based mainly on references [53,54,55,56,57,58,59,60,61,62]. See text for additional references. miRNA = microRNA; siRNA = small interfering RNA.
| Non-Canonical tRNA Properties | Molecules Involved | Functions |
|---|---|---|
|
| ||
| Lipid aminoacylation | Phosphatidylglycerol synthetases | Protects against some antibiotics |
| Enzymatic modification of peptidoglycans | MurM, Fem aminoacyl transferases | Modifies peptidoglycan synthesis in cell wall; alters protein binding |
| Enzymatic modification of amino terminus of proteins | Leucyl/Phenylalanyl transferases; Arg-tRNA transferase | Targets proteins for degradation |
| Enzymatic modification of peptides | AlbC, PacB, VlmA, DhpH-C peptidyltransferase | Intracellular signaling; Metal chelation; Antibiotic formation |
|
| ||
| Dipeptide synthesis | Diketopiperazines | Metal chelation |
| Tetrapyrrole synthesis | Heme; Chlorophyll | Energy transduction |
| Aminoacylation of tRNA | Gln; Asn; fMet; Cys; Sec | Makes up for missing tRNA |
|
| ||
| Bind directly to ribosome to modify activity | Deacylated tRNA | Stringent response: Form alarmones, ppGpp and pppGpp |
| DNA replication initiation | ColE1-type plasmids | Control of plasmid replication |
| Riboswitch control | Operons ; T-box | Transcription attenuation |
| Metabolism | Siderophores; Virulence factors | Stress responses; Aerobic/respiratory metabolism; Gene expression |
| Stress responses | Proteins and polysomes involved in siRNA binding & miRNA binding | Translation regulation |
|
| ||
| Translation regulation | tmRNA (small transfer-messenger RNA; SsrA or 10Sa RNA) & tmRNA-specific binding protein (SmpB or small protein B). | Rescues stalled ribosomes |
| “t-elements” (tRNA-like elements) in mRNAs | 3′ and 5′ ends of mitochondrial mRNA | Sites for RNAase P and RNAase Z activation of mRNA? |
| Short Interspersed nuclear elements (SINEs) | Genome (non-coding DNA) | Retrotransposons |
| Viral tRNA mimics | Viral mRNA | Hijacking translation machinery |
|
| ||
| tRNA halves | Outer membrane vesicles | Stress responses? Bacterial signaling? |
| tRNA-derived fragments (tRFs) | Ribosomal proteins; mRNAs; RNA-binding proteins | Interferes w/with peptidyl transferase; |
Non-canonical functions of ribosomal RNA (rRNA) [82,83,84,85,86,87,88].
| Non-Canonical rRNA Properties | Molecules Involved | Functions |
|---|---|---|
| Transcription regulation | 5S rRNA acting on transcription factor for polymerase factor IIIA | Inhibits activation of RNA polymerase III-dependent transcription and related genes |
| Translation regulation | Ribosomal protein fragments | siRNA and RNAi inhibiting mRNA translation |
| Permits read-through of termination codons | 23S rRNA fragment | Inhibits release factor 2 (RF2)-dependent translation termination |
| Protein activity control | 5S rRNA-RPL5-RPL11 complex | Inhibits the Hdm2 protein which in turn regulates p53 |
| Sources of novel transcripts | 5.8S and 28S rRNA fragments | Translated into proteins |
Extra-ribosomal functions of ribosomal proteins of the small subunit of prokaryotic ribosomes (PROK RP). Auto Reg = autologous regulation of ribosomal protein by binding to its own mRNA.
| Prok RP | Auto REG | Other Extra-Ribosomal Functions of RP | References |
|---|---|---|---|
| S1 | YES | Stimulates RNA polymerase; Phage Qβ replicase | [ |
| S2 | YES | Regulates rpsB and tsf genes | [ |
| S3 | YES | DNA repair; NF-κB signaling pathway; Apoptosis | [ |
| S4 | YES | Regulates rpsD gene and α operons; Transcription anti-termination; NUS transcription elongation complex | [ |
| S5 | YES | [ | |
| S6 | YES | [ | |
| S7 | YES | [ | |
| S8 | YES | Repressor of spc operon | [ |
| S9 | YES | SOS DNA repair system | [ |
| S10 | YES | Transcription anti-termination; NUS transcription elongation complex | [ |
| S11 | YES | [ | |
| S12 | YES | T4 phage RNA chaperone and intron splicing | [ |
| S13 | YES | [ | |
| S15 | YES | [ | |
| S18 | YES | [ | |
| S19 | YES | [ | |
| S20 | YES | Inhibition of ornithine and arginine decarboxylase enzymes | [ |
| S21 | YES | [ | |
| S26 | YES | [ | |
| S28 | YES | [ |
Extra-ribosomal functions of ribosomal proteins of the large subunit of prokaryotic ribosomes (PROK RP). Auto Reg = autologous regulation of ribosomal protein by binding to its own mRNA.
| Prok RP | Auto REG | Other Extra-Ribosomal Functions of RP | References |
|---|---|---|---|
| L1 | YES | [ | |
| L2 | YES | [ | |
| L3 | Transcription anti-termination | [ | |
| L4 | YES | Transcription anti-termination; inhibits S10 transcription; regulates RNAase E | [ |
| L6 | YES | [ | |
| L7/12 | YES | [ | |
| L10 | YES | [ | |
| L11 | YES | [ | |
| L13 | Transcription anti-termination | [ | |
| L14 | Phage replication helicase | [ | |
| L19 | YES | [ | |
| L20 | YES | [ | |
| L24 | YES | [ | |
| L25 | YES | [ | |
| L30 | YES | [ | |
| L31 | YES | [ | |
| L32 | YES | [ | |
| L35 | YES | [ |
Extra-ribosomal functions of ribosomal proteins of the small subunit of eukaryotic ribosomes (EUK RP). Auto Reg = autologous regulation of ribosomal protein by binding to its own mRNA.
| Euk RP | Auto REG | Extra-Ribosomal Functions of RP | Reference |
|---|---|---|---|
| SA | 67 kD Laminin receptor | [ | |
| S0+S21 | Promotes 18S rRNA maturation | [ | |
| S2 | Methylation of Rps2p; Laminin receptor | [ | |
| S3 | Identical to apurinic-apyrimidinic DNA endonuclease III; functions as DNA β-lyase; binds NFkB; Activates p53 tumor suppression gene; Cdk2- mediated cell cycle regulation; Inhibits RECQL4 ATPase/helicase; Substrate of Akt mediating neuronal apoptosis and DNA repair; Binds HSP 90/70; Binds the transcription factor CHOP | [ | |
| S4 | Cysteine protease | [ | |
| S6 | YES | Translation regulator; Replication regulator; Enhances virus replication | [ |
| S7 | Binds MDM2; Activates p53 tumor suppression gene; Regulates MAPK; Binds to BCCIPβ | [ | |
| S8 | Identical to p27, an nuclear oocyte peripheral membrane protein | [ | |
| S9 | YES | Translation regulator | [ |
| S12 | YES | RNA splicing and modification; Karyopherin Kap121-dependent sequestration of ribonucleotide reductase | [ |
| S13 | YES | Regulates own intron and mRNA splicing | [ |
| S14 | YES | Inhibits own mRNA splicing; Regulates own transcription; Represses CRY2 expression; Activates p53 & Tap73 tumor suppression genes; Inactivates c-Myc | [ |
| S15 | Activates p53 tumor suppression gene | [ | |
| S19 | Apoptosis; Inhibits MIF, ERK, NF-κB; Erythropoiesis via C5a receptor binding | [ | |
| S20 | Regulates Pol III transcription; Apoptosis; Activates p53 tumor suppression gene | [ | |
| S26 | YES | Developmental processes | [ |
| S27 | Activates p53 tumor suppression gene | [ | |
| S28 | YES | [ | |
| S30 | YES | [ |
Extra-ribosomal functions of ribosomal proteins of the large subunit of eukaryotic ribosomes (EUK RP). Auto Reg = autologous regulation of ribosomal protein by binding to its own mRNA.
| Euk RP | Auto REG | Extra-Ribosomal Functions of RP | Reference |
|---|---|---|---|
| L1 | YES | Regulates gene expression | [ |
| L2 | YES | [ | |
| L3 | Cell cycle regulation; Apoptosis | [ | |
| L4 | YES | Regulates nucleolar RNA helicase 2 (Guα); Inhibits virus production | [ |
| L5 | Binds to MDM2; Activates p53 & Tap73 tumor suppression genes; Inactivates c-Myc; Stimulates aminoacyl tRNA synthases | [ | |
| L6 | Regulates Pol III gene transcription; Activates p53 tumor suppression gene | [ | |
| L7 | YES | Cell arrest; Apoptosis; Co-regulates VDR-RXR nuclear receptor complex | [ |
| L10 | YES | Translation regulator; Replication regulator; Binds c-Jun gene; Antiviral activity | [ |
| L11 | Regulates PPARα gene transcription; Activates p53 & Tap73 tumor suppression genes; Inactivates c-Myc | [ | |
| L12 | YES | [ | |
| L13a | Regulates inflammatory gene expression and ceruloplasmin mRNA translation (GAIT complex); Anti-viral immunity; Binds to glyceraldehyde-3-phosphate dehydrogenase | [ | |
| L21 | Apoptosis | [ | |
| L22 | Binds histone H1; Binds EBV small RNA; modulates splicing of the pre-mRNA encoding smad2 (morphogenesis); Hematopoesis; Binds to casein kinase 2α | [ | |
| L23 | Activates p53 tumor suppression gene; Binds MDM2 | [ | |
| L26 | Activates p53 tumor suppression gene; Binds MDM2 | [ | |
| L27 | Apoptosis; Developmental control | [ | |
| L30 | YES | [ | |
| L31 | Apoptosis | [ | |
| L32 | YES | [ | |
| L33A | Inhibits GCN4 translation | [ | |
| L37 | Activates p53 tumor suppression gene | [ | |
| Asc1 | Cell-wall integrity; Iron homeostasis; Energy metabolism | [ | |
| P-prots | Regulates phosphate transporters and phosphatases | [ | |
| P0/LP0 | Endonuclease; DNA repair | [ | |
| P2 | Iron-binding protein | [ | |
| RACK1 | Receptor of activated C kinase (signal transduction) | [ |
Non-canonical functions of tRNA synthetases and their paralogs. Table based on [117,118,119,120,121,122,123,124,125,126,127,128,129,130,131,132,133,134,135,136,137,138,139].
| Non-Canonical Properties of Aminoacyl tRNA Synthetases (aa-RS) | Molecules Involved | Functions |
|---|---|---|
|
| ||
| Lysyl-tRNA synthetase | Nucleosides | Polyphosphate synthesis |
|
| ||
| Alanyl-tRNA synthetase | Ala-tRNA synthetase gene operon | Autogenous regulation of transcription |
| Asparaginyl-tRNA synthetase | FGF-2 induced osteoblast growth | Regulates anti-apoptotic APK/Atk signalling |
| Leucyl-tRNA synthetase | Group I introns, mRNA | Splices group 1 introns |
| Threonyl-tRNA synthetase | Thr-tRNA synthetase mRNA | Autogenous regulation of translation |
| Tyrosyl-tRNA synthetase | Group I introns, mRNA | Splices group 1 introns |
|
| ||
| Asparaginyl-tRNA synthetase | CXCR1, CXCR2, ERK1, etc. | Activates CCR5 & IL-8 receptors CXCR1 and CXCR2; MAP kinases |
| Glycyl-tRNA synthetase | Cadherin (CDH6) | Binding to CDH6 (cadherin) releases suppressed phosphatase 2A (PP2A) that dephosphorylates activated ERK |
| Histidyl-tRNA synthetase | CCR5 | Chemokine: CCR5 activation |
| Tryptophyl-tRNA synthetase | DNA-PKcs, PARP-1 | Bridges DNA-PKcs to PARP-1 to link IFN-γ and p53 signaling |
|
|
|
|
| LysRS-II | Complex of ArgRS, GlnRS, IleRS, LeuRS, LysRS, MetRS, AspRS, Glu-ProRS, AIMP1, AIMP2, AIMP3 | Inflammatory cytokine (Extracellular) |
| Glu-ProRS | Ditto above | Translational silencing, e.g., VEGF |
| GlnRS | Ditto above | Anti-apoptosis |
| MetRS | Ditto above | rRNA transcription |
| IleRS | Ditto above | Autoimmune response |
| AIMP1 (aminoacyl-tRNA synthetase interacting multifunctional protein 1) (p43) | Ditto above | Cytokine activation of macrophages |
| AIMP2 (p38) | Ditto above | Degradation of FBP (transcriptional activator of |
| AIMP3 (p18) | Ditto above | p53 induction and DNA repair by activating nuclear ATM/ATR |
|
| ||
| Lysyl-tRNA synthetase | Proteolytic internal fragment of Lys-RS | Cytokine-like domain that functions in chemotaxis |
| Tryptophyl-RS | Trp-RS C-terminal | Up-regulates TNF-alpha |
| Tryptophyl-RS | Trp-RS N-terminal | Mimics interleukin-8 functions |
| Tyrosyl-RS | Tyr-RS C-terminal | Up-regulates TNF-alpha |
| Tyrosyl-RS | Tyr-RS N-terminal | Mimics interleukin-8 functions |
|
| ||
| AlaX, paralog of Ala-RS | tRNA | Trans-editing of mischarged tRNA |
| AlbC, paralog of TyrRS & TrpRS | Phenylalanine, Serine | Cyclodipeptide synthase |
| AsnA, paralog of Asn-RS | Asparagine | Asparagine biosynthesis |
| Arc1p, paralog of Met-RS | mRNA | RNA editing |
| BirA, paralog of Ser-RS | Biotin | Biotin repressor |
| DNA pol γ, paralog of Gly-RS | Polynucleotides | DNA polymerase |
| EMAP II, paralog of Tyr-RS | Cytokine receptors | Cytokine |
| GCN2, paralog of His-RS | Proteins, Histidine | Protein kinase, Histidine biosynthesis |
| HIsZ, a paralog of His-RS | Histidine | Histidine biosynthesis |
| LysK, paralog of LysRS | Lysine | Lysine biosynthesis |
| PheX, paralog of Phe-RS | ? | Undetermined |
| Polyβ, paralog of GlyRS | Polynucleotides | DNA polymerase |
| PoxA, paralog of LysRS | Elongation factor P | Modifies EF-P |
| PoxA/GenX, paralog of Lys-RS | Pyruvate | Pyruvate oxidase |
| ProX, paralog of Pro-RS | tRNA | Trans-editing of mischarged tRNA |
| Slimp, paralog of mito SerRS | RNA binding | Regulates rRNA, COX2 & 3 mRNA |
| Trbp111, paralog of Met-RS | mRNA | RNA editing |
| YadB, or GluX, a paralog of Glu-RS | Adds glutamate to first anticodon position of tRNA-Asp | Stress response |
| Ybak, paralog of ProRS | Cys-tRNA(Pro)/Cys-tRNA(Cys) deacylase | Trans-editing of mischarged tRNA |
| YvmC-Blic, class Ic aaRS paralog | Cyclodileucine | Non-ribosomal peptide synthesis |
Figure 1Locations of 5S rRNA, 16S rRNA, and 23S rRNA frame 1 homologies within two strains of E. coli K12 genomes. Note that the scale is non-linear. Homology locations are not identical between the 5S, 16S, and 23S rRNAs, demonstrating that the sequences probably have independent origins. Similarly, the homology locations are not identical even between two closely related E. coli K12 strains, suggesting that these sequences are subject to reorganization processes within these genomes.
Figure 2Locations of the six frames of 5S rRNA homologies within two strains of E. coli K12. Note that the scale is non-linear. As in the previous Figure, these homology locations differ between the two strains indicating that they are subject to reorganization processes. Notably, the locations also differ considerably from one reading frame to another, demonstrating that the search algorithm is identifying non-overlapping sequences.
5S rRNA protein control sequences utilized to test the frequency and quality of homologous sequences in the E. coli K12 genome. Sample numbers identify the mRNA of control proteins of the E. coli K12 genome accessed from the U. C. Santa Cruz microbial genome list: http://microbes.ucsc.edu/lists/eschColi_K12/refSeq-list.html. See text for definitions of “identities”, “similarities”, etc.
| Protein 5S Control | Count Data Total, Sum of Six Reading Frames | Sums across All Matches | Means across All Matches | |||||
|---|---|---|---|---|---|---|---|---|
| Sample | No. Matches | No. Perfect Identities | Total Length of Matches | Total Number of Similarities | Total Number of Identities | Mean Match Length | Mean Similarity Length | Mean Identity Length |
| b4626 | 83 | 2 | 632 | 568 | 550 | 7.6 | 7 | 6.6 |
| b0466 | 241 | 1 | 3325 | 2326 | 1886 | 13.8 | 9.6 | 7.8 |
| b4631 | 83 | 1 | 948 | 791 | 672 | 11.4 | 9.5 | 8.1 |
| b4509 | 127 | 3 | 1545 | 1234 | 1101 | 12.1 | 9.7 | 8.7 |
| b4515 | 127 | 1 | 1428 | 1150 | 990 | 11.2 | 9.1 | 7.8 |
| mean | 132.2 | 1.6 | 1575.6 | 1213.8 | 1039.8 | 11.2 | 8.9 | 7.8 |
| SD | 64.7 | 0.9 | 1044.9 | 677.7 | 523.8 | 2.3 | 1.1 | 0.8 |
5S rRNA random control sequences utilized to test the frequency and quality of similar sequences in the E. coli K12 genome. See text for definitions of “identities”, “similarities”, etc.
| Random Sequence 5S Control | Count Data Total, Sum 6 Reading Frames | Sums across All Matches | Means across All Matches | |||||
|---|---|---|---|---|---|---|---|---|
| Sample | No. Matches | No. Perfect Identities | Total Length of Matches | Total Number of Similarities | Total Number of Identities | Mean Match Length | Mean Similarity Length | Mean Identity Length |
| 120 bp 1 | 130 | 0 | 1882 | 1251 | 959 | 14.5 | 9.7 | 7.3 |
| 120 bp 2 | 141 | 0 | 1432 | 1107 | 926 | 10.3 | 7.9 | 6.6 |
| 120 bp 3 | 151 | 0 | 1715 | 1287 | 1040 | 11.4 | 8.5 | 6.9 |
| mean | 140.7 | 0 | 1676.3 | 1215 | 975 | 12.1 | 8.7 | 6.9 |
| SD | 10.5 | 0 | 227.5 | 95.2 | 58.7 | 2.2 | 0.9 | 0.4 |
Frequency and quality of 5S rRNA homologies in the E. coli K12 genome. See text for definitions of “identity”, “similarity”, etc.
| 5S rRNA | Count Data Total, Sum 6 Reading Frames | Sums across All Matches | Means across All Matches | |||||
|---|---|---|---|---|---|---|---|---|
| Sample | No. Matches | No. Perfect Identities | Total Length of Matches | Total Number of Similarities | Total Number of Identities | Mean Match Length | Mean Similarity Length | Mean Identity Length |
| 5S rRNA | 480 | 158 | 5696 | 4580 | 4136 | 11.5 | 9.5 | 8.6 |
Figure 3Comparison of 5S rRNA variables with matched protein mRNA and randomized RNA controls (original data in Tables above). Error bars show SD. While the mean length of the matches found in each group was very similar, the number of total matches, the number of identical amino acids found within those matches, and the number of exact matches found to the 5S rRNA was very significantly greater than those found for the proteins’ control mRNA and randomized RNA sequences tested.
16S rRNA protein control sequences utilized to test the frequency and quality of homologous sequences in the E. coli K12 genome. Sample numbers identify mRNA of control proteins of the E. coli K12 genome accessed from the U. C. Santa Cruz microbial genome list: http://microbes.ucsc.edu/lists/eschColi_K12/refSeq-list.html. See text for definitions of “identities”, “similarities”, etc.
| Protein 16S Control | Count Data, across All Six Reading Frames | Sums across All Matches | Means across All Matches | |||||
|---|---|---|---|---|---|---|---|---|
| Sample | No. Matches | No. Perfect Identities | Total Length of Matches | Total Number of Similarities | Total Number of Identities | Mean Match Length | Mean Similarity Length | Mean Identity Length |
| b0135 | 389 | 1 | 10,999 | 7041 | 5647 | 23.8 | 18.1 | 14.5 |
| b0070 | 414 | 1 | 9725 | 3923 | 5378 | 27.2 | 14.8 | 10.8 |
| b0376 | 625 | 1 | 15,008 | 8069 | 6029 | 24.0 | 12.9 | 9.6 |
| b0014 | 289 | 2 | 10,260 | 7043 | 5852 | 35.5 | 24.4 | 20.2 |
| b0091 | 247 | 1 | 7924 | 5385 | 4649 | 32.1 | 21.8 | 18.8 |
| mean | 392.8 | 0.8 | 10,783.2 | 6292.2 | 5511 | 28.5 | 18.4 | 14.8 |
| SD | 147.0 | 0.8 | 2620.3 | 1636.9 | 539.4 | 5.2 | 4.8 | 4.7 |
16S rRNA random control sequences utilized to test the frequency and quality of similar sequences in the E. coli K12 genome. See text for definitions of “identities”, “similarities”, etc.
| Random Sequence6s Control | Count Data, across All Six Reading Frames | Sums across All Matches | Means across All Matches | |||||
|---|---|---|---|---|---|---|---|---|
| Sample | No. Matches | No. Perfect Identities | Total Length of Matches | Total Number of Similarities | Total Number of Identities | Mean Match Length | Mean Similarity Length | Mean Identity Length |
| 1500 bp 1 | 129 | 0 | 2668 | 1658 | 1347 | 20.7 | 12.9 | 10.4 |
| 1500 bp 2 | 61 | 0 | 1816 | 998 | 704 | 29.8 | 16.4 | 11.5 |
| 1500 bp 3 | 70 | 0 | 2100 | 1124 | 910 | 30 | 16.1 | 13 |
| 1500 bp 4 | 82 | 0 | 2273 | 1269 | 889 | 27.7 | 10.8 | 14.5 |
| 1500 bp 5 | 68 | 0 | 2090 | 1113 | 792 | 30.7 | 16.4 | 11.7 |
| mean | 82 | 0 | 2189.4 | 1232.4 | 928.4 | 27.8 | 15.4 | 11.5 |
| SD | 27.3 | 0 | 313.6 | 256.6 | 248 | 4.1 | 1.5 | 1.0 |
Frequency and quality of 16S rRNA homologies in the E. coli K12 genome. See text for definitions of “identity”, “similarity”, etc.
| 16S rRNA | Count Data, across All Reading Frames | Sums across All Matches | Means across All Matches | |||||
|---|---|---|---|---|---|---|---|---|
| Sample | No. Matches | No. Perfect Identities | Total Length of Matches | Total Number of Similarities | Total Number of Identities | Mean Match Length | Mean Similarity Length | Mean Identity Length |
| 16S rRNA | 651 | 49 | 36,488 | 28,126 | 25,848 | 56.1 | 43.2 | 39.7 |
Figure 4Comparison of 16S rRNA variables with matched protein mRNA and randomized RNA controls (original data in Tables above). Error bars show SD. The mean number of matches found within the E. coli genome was greatest for the 16S rRNA, as was the mean number of perfect identities, length of the sequences with identical sequences and the number of identical amino acid matches found within each match was significantly greater for the 23S rRNA than for the mRNA of the proteins controls or the randomized RNA sequences.
23S rRNA protein control sequences utilized to test the frequency and quality of homologous sequences in the E. coli K12 genome. Sample numbers identify mRNA of control proteins of the E. coli K12 genome accessed from the U. C. Santa Cruz microbial genome list: http://microbes.ucsc.edu/lists/eschColi_K12/refSeq-list.html. See text for definitions of “identities”, “similarities”, etc.
| Protein 23S Control | Count Data, across All Six Reading Frames | Sums across All Matches | Means across All Matches | |||||
|---|---|---|---|---|---|---|---|---|
| Sample | No. Matches | No. Perfect Identities | Total Length of Matches | Total Number of Similarities | Total Number of Identities | Mean Match Length | Mean Similarity Length | Mean Identity Length |
| b0397 | 374 | 1 | 12,537 | 8947 | 7894 | 39 | 24 | 21.1 |
| b0993 | 404 | 1 | 14,283 | 9795 | 8257 | 35.3 | 24.2 | 20.4 |
| mean | 389 | 1 | 13,405 | 9371 | 8075.5 | 37.2 | 24.1 | 20.8 |
| SD | 21.1 | 0 | 1234.6 | 599.6 | 256.7 | 2.6 | 0.1 | 0.5 |
23S rRNA random control sequences utilized to test the frequency and quality of similar sequences in the E. coli K12 genome. See text for definitions of “identities”, “similarities”, etc.
| Random Sequence23S Control | Count Data, across All Six Reading Frames | Sums across All Matches | Means across All Matches | |||||
|---|---|---|---|---|---|---|---|---|
| Sample | No. Matches | No. Perfect Identities | Total Length of Matches | Total Number of Similarities | Total Number of Identities | Mean Match Length | Mean Similarity Length | Mean Identity Length |
| 2900 bp 1 | 130 | 0 | 1882 | 1251 | 959 | 14.5 | 9.7 | 7.4 |
| 2900 bp 2 | 141 | 0 | 1432 | 1107 | 926 | 10.3 | 7.9 | 6.6 |
| 2900 bp 3 | 151 | 0 | 1715 | 1287 | 1040 | 11.3 | 8.5 | 6.9 |
| 2900 bp 4 | 117 | 0 | 1192 | 934 | 761 | 10.2 | 8.0 | 6.5 |
| 2900 bp 5 | 120 | 0 | 1235 | 958 | 805 | 10.3 | 8.0 | 6.7 |
| mean | 131.8 | 0 | 1491.2 | 1107.4 | 898.2 | 11.3 | 8.4 | 6.8 |
| SD | 14.3 | 0 | 300.6 | 162.2 | 114.1 | 1.8 | 0.7 | 0.4 |
Frequency and quality of 23S rRNA homologies in the E. coli K12 genome. See text for definitions of “identity”, “similarity”, etc.
| 23S rRNA | Count Data, across All Reading Frames | Sums across All Matches | Means across All Matches | |||||
|---|---|---|---|---|---|---|---|---|
| Sample | No. Matches | No. Perfect Identities | Total Length of Matches | Total Number of Similarities | Total Number of Identities | Mean Match Length | Mean Similarity Length | Mean Identity Length |
| 23S rRNA | 195 | 18 | 41,096 | 38,529 | 38,119 | 210.7 | 197.6 | 192.9 |
Figure 5Comparison of 23S rRNA variables with matched protein mRNA and randomized RNA controls (original data in Tables above). Error bars show SD. Although the mean number of matches found within the E. coli genome was greatest for the protein controls, the mean number of perfect identities, length of the sequences with identical sequences and the number of identical amino acid matches found within each match was significantly greater for the 23S rRNA than for the mRNA of the protein controls or the randomized RNA sequences.
Figure 6A summary of the evidence for pleiofunctionality in rRNA-related molecules supporting the hypothesis of a self-replicating ribosome. rRNA = ribosomal ribonucleic acid (only one rRNA is shown though there are three in the ribosome, the 5S, 16S, and 23S). crRNA = complementary strand of rRNA posited to exist to make possible rRNA replication in self-replicating ribosome. The crRNA would also act as a messenger RNA (mRNA), or act itself as a source of transcription to produce mRNAs (see Root-Bernstein and Root-Bernstein, 2016; 2017). tRNA = transfer ribonucleic acid. aa-synthetase = amino acyl synthetase, which loads amino acids onto tRNA. Ribo-protein = ribosomal proteins (see Table 3, Table 4, Table 5 and Table 6). Boxes summarize the data from Table 1, Table 2, Table 3, Table 4, Table 5, Table 6 and Table 7 and the related text.
Figure 7Schematic model of how RNAs perform various ribosomal functions and/or encoding various ribosome-related proteins may have become ligated to form the first genome. The visual formalisms are the same as in Figure 6 and illustrate the idea that “genes” that may have existed as overlapping sequences in a variety of reading frames within a self-replicating set or rRNAs may have been separated out into a single reading frame for better functional control. This schema does not identify whether the original primitive genome was constituted of RNA or DNA (though RNA seems likely if a ribosomal origin is correct) but given a common origin for RNA and DNA polymerases (see text) the existence of either one could have provided the basis for the other. It is also important to remember that SINEs and other insertion-facilitating RNAs most likely evolved from tRNAs or that they have common ancestors (see text), so that molecular mechanisms of genome growth would have been present from the outset in a ribosome-first scenario.