| Literature DB >> 29449833 |
Hervé Seligmann1, Didier Raoult1.
Abstract
We examine the hypothesis that de novo template-free RNAs still form spontaneously, as they did at the origins of life, invade modern genomes, contribute new genetic material. Previously, analyses of RNA secondary structures suggested that some RNAs resembling ancestral (t)RNAs formed recently de novo, other parasitic sequences cluster with rRNAs. Here positive control analyses of additional RNA secondary structures confirm ancestral and de novo statuses of RNA grouped according to secondary structure. Viroids with branched stems resemble de novo RNAs, rod-shaped viroids resemble rRNA secondary structures, independently of GC contents. 5' UTR leading regions of West Nile and Dengue flavivirid viruses resemble de novo and rRNA structures, respectively. An RNA homologous with Megavirus, Dengue and West Nile genomes, copperhead snake microsatellites and levant cotton repeats, not templated by Mimivirus' genome, persists throughout Mimivirus' infection. Its secondary structure clusters with candidate de novo RNAs. The saltatory phyletic distribution and secondary structure of Mimivirus' peculiar RNA suggest occasional template-free polymerization of this sequence, rather than noncanonical transcriptions (swinger polymerization, posttranscriptional editing).Entities:
Keywords: 3′-to-5′ polymerization; Acanthamoeba castellanii; invertase; swinger DNA polymerization; systematic nucleotide exchange; transcription
Year: 2018 PMID: 29449833 PMCID: PMC5799277 DOI: 10.3389/fmicb.2018.00101
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Figure 1Hierarchical cluster of secondary structures formed by diverse RNA molecules, adapted from Seligmann and Raoult (2016).
Secondary structure properties of sequences analyzed here.
| Circ1 | 22 | 36.36 | 62.50 | 36.36 | 27.27 | B1 |
| Circ2 | 22 | 27.27 | 83.33 | 27.27 | 27.27 | B1 |
| Circ3 | 22 | 59.09 | 38.46 | 59.09 | 27.27 | D2 |
| Circ4 | 22 | 54.55 | 41.67 | 54.55 | 27.27 | D2 |
| Circ5 | 22 | 63.64 | 35.71 | 63.64 | 27.27 | D2 |
| Circ6 | 22 | 54.55 | 35.71 | 63.64 | 27.27 | D2 |
| Circ7 | 22 | 72.73 | 25.00 | 54.55 | 13.64 | D2 |
| Circ8 | 22 | 54.55 | 31.25 | 72.73 | 27.27 | D1 |
| Circ9 | 22 | 45.46 | 41.67 | 54.55 | 27.27 | D2 |
| Circ10 | 22 | 18.18 | 30.00 | 45.46 | 13.64 | D2 |
| Circ11 | 22 | 18.18 | 75.00 | 18.18 | 13.64 | B1 |
| Circ12 | 22 | 63.64 | 75.00 | 18.18 | 13.64 | B1 |
| Circ13 | 22 | 54.55 | 35.71 | 63.64 | 27.27 | D2 |
| Circ14 | 22 | 27.27 | 25.00 | 54.55 | 13.64 | D2 |
| Circ15 | 22 | 18.18 | 50.00 | 27.27 | 13.64 | B1 |
| Circ16 | 22 | 18.18 | 75.00 | 18.18 | 13.64 | B1 |
| Circ17 | 22 | 54.55 | 75.00 | 18.18 | 13.64 | B1 |
| Circ18 | 22 | 63.64 | 25.00 | 54.55 | 13.64 | D2 |
| Circ19 | 22 | 18.18 | 21.43 | 63.64 | 13.64 | D1 |
| Circ20 | 22 | 27.27 | 75.00 | 18.18 | 13.64 | B1 |
| Circ21 | 22 | 27.27 | 50.00 | 27.27 | 13.64 | B1 |
| Circ22 | 22 | 27.27 | 50.00 | 27.27 | 13.64 | B1 |
| Circ23 | 22 | 27.27 | 83.33 | 27.27 | 27.27 | B1 |
| Circ24 | 22 | 27.27 | 83.33 | 27.27 | 27.27 | B1 |
| 51 | 16.67 | 40.00 | 56.00 | 60.00 | B2 | |
| MITE16 | 48 | 70.83 | 23.53 | 78.57 | 38.24 | D1 |
| MITE20 | 66 | 48.48 | 28.13 | 58.82 | 28.13 | D1 |
| MITE4 | 27 | 85.19 | 17.39 | 50.00 | 43.48 | D2 |
| MITE21 | 60 | 85.00 | 54.90 | 90.00 | 35.29 | D2 |
| MITE14 | 60 | 66.67 | 32.50 | 70.00 | 32.50 | D2 |
| MITE8 | 57 | 68.42 | 53.85 | 100.00 | 30.77 | D2 |
| MITE2 | 43 | 67.44 | 17.24 | 64.29 | 34.48 | D2 |
| MITE25 | 62 | 54.84 | 38.24 | 64.29 | 20.59 | D2 |
| MITE9 | 66 | 54.55 | 36.11 | 80.77 | 27.78 | D1 |
| MITE7 | 57 | 68.42 | 30.77 | 50.00 | 33.33 | D2 |
| MITE12 | 68 | 64.71 | 22.73 | 87.50 | 31.82 | D1 |
| MITE29 | 62 | 70.97 | 34.09 | 94.44 | 27.27 | D1 |
| MITE28 | 71 | 61.97 | 20.45 | 84.62 | 31.82 | D1 |
| MITE24 | 64 | 59.38 | 23.68 | 50.00 | 28.95 | D2 |
| MITE19 | 67 | 71.64 | 22.92 | 80.00 | 35.42 | D1 |
| MITE17 | 65 | 73.85 | 29.17 | 87.50 | 37.50 | D1 |
| MITE10 | 72 | 52.78 | 47.37 | 82.35 | 36.84 | D1 |
| MITE26 | 62 | 67.74 | 26.19 | 70.00 | 26.19 | D2 |
| MITE13 | 63 | 68.25 | 76.74 | 95.00 | 27.91 | D2 |
| MITE22 | 62 | 64.52 | 32.50 | 75.00 | 35.00 | D1 |
| MITE5 | 45 | 60.00 | 37.04 | 68.75 | 33.33 | D2 |
| MITE11 | 41 | 56.10 | 26.09 | 88.89 | 21.74 | D1 |
| MITE23 | 62 | 70.97 | 34.09 | 83.33 | 34.09 | D1 |
| MITE1 | 61 | 60.66 | 37.84 | 58.33 | 37.84 | D2 |
| MITE15 | 61 | 67.21 | 31.71 | 75.00 | 29.27 | D2 |
| MITE6 | 36 | 61.11 | 36.36 | 57.14 | 40.91 | D2 |
| MITE3 | 57 | 63.16 | 22.22 | 54.55 | 33.33 | D2 |
| MITE18 | 62 | 61.29 | 42.11 | 75.00 | 26.32 | D2 |
| MITE27 | 62 | 64.52 | 25.00 | 60.00 | 32.50 | D2 |
| Mito 5S | 44 | 36.36 | 37.50 | 14.29 | 0.00 | B2 |
| Mito 16S | 46 | 52.17 | 25.00 | 4.46 | 29.17 | D2 |
| Mito 18S | 59 | 66.10 | 28.21 | 45.00 | 20.51 | D2 |
| Mito 23S1 | 375 | 46.93 | 16.48 | 14.82 | 20.46 | D2 |
| Mito 23S2 | 276 | 43.12 | 19.33 | 29.30 | 23.53 | D2 |
| Mito 23S3 | 209 | 37.32 | 24.36 | 23.19 | 19.23 | B2 |
| Mimi 5S | 41 | 75.61 | 25.81 | 10.00 | 16.13 | B2 |
| Mimi 16S | 39 | 45.76 | 33.33 | 16.17 | 14.82 | B2 |
| Mimi 18S | 59 | 52.54 | 12.90 | 17.86 | 12.90 | D2 |
| Mimi 23S1 | 380 | 48.95 | 19.36 | 11.60 | 19.36 | D2 |
| Mimi 23S2 | 349 | 39.26 | 23.36 | 21.91 | 25.55 | B2 |
| Mimi 23S3 | 207 | 34.78 | 22.22 | 18.06 | 12.50 | B2 |
| WNV 3′-5 | 105 | 20.00 | 19.05 | 54.02 | 33.33 | D1 |
| DENV 3′-5 | 105 | 27.62 | 24.14 | 50.00 | 27.59 | D1 |
| JEV 3′-5 | 111 | 23.13 | 13.79 | 51.22 | 27.59 | D1 |
| YFV 3′-5 | 107 | 23.37 | 20.00 | 57.32 | 36.00 | D1 |
| 5′-UTR Dengue | 123 | 49.59 | 34.43 | 26.02 | 45.90 | D2 |
| Surrounding | 92 | 52.17 | 43.75 | 30.68 | 31.25 | B2 |
Variables are: N- sequence length (not used for classifying RNAs in further analyses); Loop-percentage of nucleotides not involved in self-hybridization; eLoop-percentage of nucleotides among those in loops, in closed loops at stem extremity; percentage of GC in stems, and loops. “Cluster” indicates the cluster in Figure 1 with the highest similarity in secondary structure properties. “Circ” indicates sequences generated by simulations attempting to mimic circular RNA genesis (Demongeot and Moreira, 2007); “De novo” is a template-free synthesized sequence (Béguin et al., 2015); MITE1-29 are Pandoravirus'miniature inverted-repeat transposable elements from Submariner family (Sun et al., 2015),
indicates MITE inserted in a protein coding gene; Mito and Mimi are amoeban rRNA sequences aligning with Mimivirus sequences (alignments described in Table .
Figure 2Multivariate comparisons between estimates of secondary structure variables for theoretical sequence Circ1 (from Table 1, x-axis) with corresponding variables for the cloverleaf (filled symbols, continous line) and OL-like (empty symbols, interrupted line) structures formed by the human mitochondrial tRNA Asn. Pearson correlation coefficients r = 0.969, P = 0.0; and r = 0.450, P < 0.05, respectively (values in the figure indicate the square of r). Such comparisons are calculated for each pair of secondary structures that are compared, RNAs are classified into secondary structure clusters (Figure 1) according to highest r.
Secondary structure variables and classification of 10 viroids forming rod-shaped secondary structures.
| CVd IV | 284 | 28.87 | 9.76 | 64.36 | 36.37 | D1 |
| TPMV | 360 | 33.06 | 10.08 | 36.37 | 42.02 | D2 |
| TASV | 360 | 28.33 | 11.77 | 60.02 | 34.31 | D1 |
| ASBV | 359 | 29.81 | 7.48 | 63.49 | 42.06 | D1 |
| PTSV | 247 | 32.39 | 17.50 | 40.12 | 31.25 | D2 |
| CSV | 366 | 28.96 | 6.60 | 55.39 | 41.51 | D2 |
| PBCVd | 315 | 31.43 | 31.32 | 71.76 | 37.37 | D1 |
| ADFVd | 310 | 34.52 | 11.22 | 61.58 | 37.37 | D1 |
| CEVd | 371 | 30.46 | 7.97 | 66.67 | 46.90 | D1 |
| HSVd | 302 | 31.46 | 8.42 | 66.67 | 33.68 | D1 |
Secondary structures are according to corresponding references 1–6. All rod-shaped viroids, independently of GC contents, group with 23S (D1) and 16S (D2) rRNAs, indicating ancient origins.
Citrus viroid IV (Puchta et al., 1991);
Tomato planta macho viroid, tomato apical stunt viroid (Kiefer et al., 1983);
Avocado sunblotch viroid, Potato spindle tuper viroid, Chrysanthemum stunt viroid (Symons, 1981);
Pear blister canker viroid (Hernandez et al., 1992);
Apple dimple fruit viroid, (Chiumenti et al., 2014);
Citrus exocortis viroid, Hop stunt viroid (Lin et al., .
Sequences aligned between Acanthamoeba castellani's mitochondrial rRNA genes and Acanthamoeba polyphaga Mimivirus' genome.
| 5S | 0.001 | 58–101 | 768283–768243 |
| 18S | 0.069 | 104–162 | 23259–23317 |
| 16S | 0.009 | 1103–1148 | 759756–759718 |
| 23S1 | 0.0007 | 3–377 | 500064–500442 |
| 23S2 | 0.002 | 962–1291 | 175514–175166 |
| 23S3 | 0.0003 | 1970–2181 | 89598–89391 |
Figure 3Kinetics of the swinger transcriptome (asymmetric nucleotide exchanges) of Mimivirus. Concentric circles indicate different periods in the cycle of Mimivirus. Bars indicate swinger RNAs detected for a given genomic region, the swinger transformation rule is indicated next to the bar. Swinger RNAs in boxes 1 and 2 are the exact inverse complements of each other, and correspond to the 5′ UTR leading region of Dengue and West Nile viruses.
Swinger transformations of the genome of Mimivirus, and swinger transcripts.
| A<->C | 5 | 51.40 | 7261 | 35.52 |
| A<->G | 8 | 211.63 | 5294 | 36.47 |
| A<->T | 0 | 7234 | 35.99 | |
| C<->G | 173 | 196.74 | 7314 | 35.92 |
| C<->T | 12 | 199.58 | 5800 | 36.29 |
| G<->T | 1 | 56.00 | 5553 | 35.63 |
| A<->C-G<->T | 0 | 2634 | 36.60 | |
| A<->G-C<->T | 1 | 68.00 | 2672 | 36.37 |
| A<->T-C<->G | 0 | 1805 | 37.14 | |
| A->C->G->A | 9 | 62.11 | 6238 | 35.73 |
| A->C->T->A | 1 | 64.00 | 4963 | 36.64 |
| A->G->C->A | 2 | 70.50 | 5013 | 36.68 |
| A->G->T->A | 1 | 79.00 | 6127 | 35.70 |
| A->T->C->A | 0 | 6325 | 35.97 | |
| A->T->G->A | 0 | 4826 | 36.72 | |
| C->G->T->C | 0 | 4856 | 36.68 | |
| C->T->G->C | 9 | 52.67 | 5962 | 35.70 |
| A->C->G->T->A | 4 | 84.50 | 6007 | 36.28 |
| A->C->T->G->A | 1 | 77.00 | 8421 | 35.83 |
| A->G->C->T->A | 2 | 74.50 | 7675 | 35.45 |
| A->G->T->C->A | 0 | 8444 | 35.85 | |
| A->T->C->G->A | 0 | 5289 | 35.71 | |
| A->T->G->C->A | 0 | 5429 | 36.41 | |
Columns are: 1–2. Number and mean length (nucleotides) of swinger RNAs matching the Mimivirus' genome swinger version over>half of its length with >80% identity, 454 sequencing, all times since infection pooled (Legendre et al., .
Figure 4Alignment between sequence in Boxes 1 and 2 (Figure 3) and homologous sequences: Mimivirus reference genome sequence.Underlined are the detected swinger RNA sequences. Small caps indicate neighboring sequences that are not transformed by nucleotide exchange.
Figure 5Secondary structure formed by the swinger C>T/U>G RNA (in box) and its untransformed neighboring regions.