| Literature DB >> 31075860 |
Pablo Catalán1,2, Santiago F Elena3,4, José A Cuesta5,6,7,8, Susanna Manrubia9,10.
Abstract
Viroids are small, non-coding, circular RNA molecules that infect plants. Different hypotheses for their evolutionary origin have been put forward, such as an early emergence in a precellular RNA World or several de novo independent evolutionary origins in plants. Here, we discuss the plausibility of de novo emergence of viroid-like replicons by giving theoretical support to the likelihood of different steps along a parsimonious evolutionary pathway. While Avsunviroidae-like structures are relatively easy to obtain through evolution of a population of random RNA sequences of fixed length, rod-like structures typical of Pospiviroidae are difficult to fix. Using different quantitative approaches, we evaluated the likelihood that RNA sequences fold into a rod-like structure and bear specific sequence motifs facilitating interactions with other molecules, e.g., RNA polymerases, RNases, and ligases. By means of numerical simulations, we show that circular RNA replicons analogous to Pospiviroidae emerge if evolution is seeded with minimal circular RNAs that grow through the gradual addition of nucleotides. Further, these rod-like replicons often maintain their structure if independent functional modules are acquired that impose selective constraints. The evolutionary scenario we propose here is consistent with the structural and biochemical properties of viroids described to date.Entities:
Keywords: RNA secondary structure; computational simulations; modular evolution; population dynamics; simple replicators; structure enumeration; viroid
Mesh:
Substances:
Year: 2019 PMID: 31075860 PMCID: PMC6563258 DOI: 10.3390/v11050425
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Structural properties of viroids of length ℓ. We list their G + C content, the minimal folding energy E in kcal mol (see Materials and Methods), and the number h of hairpins and p of base pairs. For comparison, the expected values of the number of hairpins, , and base pairs, , in exact calculations of structures of each length (cf. Equation (6)) are reported along the actual values.
| Family | Genus | Species |
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|---|
| Avsunviroidae |
| ASBVd | 247 | 38.1 |
| 3 | 10.0 | 84 | 71.5 |
|
| ELVd | 335 | 54.0 |
| 5 | 13.4 | 118 | 96.7 | |
|
| CChMVd | 399 | 55.4 |
| 12 | 15.8 | 140 | 115.1 | |
| PLMVd | 337 | 52.5 |
| 8 | 13.4 | 121 | 97.3 | ||
| Pospiviroidae |
| ADFVd | 306 | 58.5 |
| 2 | 12.3 | 111 | 88.4 |
| ASSVd | 329 | 60.5 |
| 2 | 13.1 | 110 | 95.0 | ||
| AGVd | 369 | 58.0 |
| 2 | 14.7 | 135 | 106.5 | ||
| CBLVd | 318 | 59.4 |
| 2 | 12.7 | 106 | 91.9 | ||
| CDVd | 294 | 54.4 |
| 2 | 11.8 | 101 | 85.0 | ||
| CVd-V | 294 | 60.2 |
| 2 | 11.8 | 102 | 85.0 | ||
| CVd-VI | 330 | 60.0 |
| 3 | 13.2 | 109 | 95.3 | ||
| GYSVd | 367 | 61.0 |
| 2 | 14.6 | 132 | 105.9 | ||
| GYSVd-1 | 363 | 60.0 |
| 7 | 14.4 | 133 | 104.8 | ||
| GYSVd-2 | 361 | 59.6 |
| 7 | 14.4 | 132 | 104.2 | ||
| PBCVd | 315 | 60.6 |
| 5 | 12.6 | 109 | 91.0 | ||
| PVd-2 | 358 | 59.8 |
| 6 | 14.2 | 128 | 103.3 | ||
|
| CBCVd | 284 | 55.6 |
| 2 | 11.4 | 102 | 82.1 | |
| CCCVd | 246 | 59.3 |
| 2 | 10.0 | 84 | 71.2 | ||
| CTiVd | 254 | 60.2 |
| 2 | 10.3 | 81 | 73.5 | ||
| HLVd | 256 | 57.0 |
| 2 | 10.4 | 85 | 74.1 | ||
|
| CBVd | 295 | 52.9 |
| 2 | 11.9 | 107 | 85.3 | |
| CBVd-1 | 248 | 55.2 |
| 2 | 10.1 | 90 | 71.8 | ||
| CBVd-2 | 301 | 60.1 |
| 4 | 12.1 | 114 | 87.0 | ||
| CBVd-3 | 361 | 55.7 |
| 2 | 14.4 | 134 | 104.2 | ||
|
| DLVd | 342 | 58.8 |
| 2 | 13.6 | 120 | 98.7 | |
| HSVd | 297 | 56.2 |
| 2 | 11.9 | 101 | 85.9 | ||
|
| CSVd | 354 | 52.8 |
| 2 | 14.1 | 125 | 102.2 | |
| CEVd | 371 | 60.4 |
| 2 | 14.7 | 135 | 107.1 | ||
| CLVd | 370 | 58.1 |
| 2 | 14.7 | 128 | 106.8 | ||
| IrVd | 370 | 61.6 |
| 2 | 14.7 | 131 | 106.8 | ||
| MPVd | 360 | 58.9 |
| 2 | 14.3 | 126 | 103.9 | ||
| PCFVd | 348 | 59.2 |
| 2 | 13.9 | 125 | 100.5 | ||
| PSTVd | 359 | 58.2 |
| 2 | 14.3 | 129 | 103.6 | ||
| TASVd | 360 | 55.6 |
| 2 | 14.3 | 133 | 103.9 | ||
| TCDVd | 360 | 56.7 |
| 2 | 14.3 | 127 | 103.9 | ||
| TPMVd | 360 | 57.5 |
| 2 | 14.3 | 123 | 103.9 | ||
| Unclassified | AFCVd | 372 | 56.7 |
| 2 | 14.8 | 132 | 107.3 | |
| CVd-VII | 368 | 52.2 |
| 3 | 14.6 | 128 | 106.2 | ||
| GLVd | 328 | 58.2 |
| 4 | 13.1 | 109 | 94.7 | ||
| PVd | 396 | 57.8 |
| 5 | 15.7 | 143 | 114.2 | ||
| RVd-I | 361 | 58.4 |
| 2 | 14.4 | 126 | 104.2 |
Number of circular RNA structures of length ℓ, exhibiting h hairpins and p base pairs.
|
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|
| 10 |
| 1 | |||||||||
| 11 |
| 1 | |||||||||
| 12 |
| 2 | 1 | ||||||||
| 13 |
| 2 | 1 | ||||||||
| 14 |
| 3 | 2 | 1 | |||||||
| 15 |
| 3 | 2 | 2 | |||||||
| 16 |
| 4 | 3 | 6 | 1 | ||||||
| 17 |
| 4 | 3 | 10 | 3 | ||||||
| 18 |
| 5 | 4 | 19 | 9 | 1 | |||||
| 19 |
| 5 | 4 | 28 | 18 | 4 | |||||
| 20 |
| 6 | 5 | 44 | 33 | 16 | 1 | ||||
| 21 |
| 6 | 5 | 60 | 53 | 36 | 5 | ||||
|
| 1 | ||||||||||
| 22 |
| 7 | 6 | 85 | 81 | 82 | 23 | 1 | |||
|
| 2 | ||||||||||
| 23 |
| 7 | 6 | 110 | 116 | 151 | 64 | 6 | |||
|
| 7 | 1 | |||||||||
| 24 |
| 8 | 7 | 146 | 161 | 276 | 157 | 34 | 1 | ||
|
| 20 | 6 | |||||||||
| 25 |
| 8 | 7 | 182 | 215 | 452 | 322 | 106 | 7 | ||
|
| 42 | 21 | 2 | ||||||||
| 26 |
| 9 | 8 | 231 | 281 | 731 | 614 | 294 | 45 | 1 | |
|
| 84 | 56 | 14 | ||||||||
| 27 |
| 9 | 8 | 280 | 358 | 1106 | 1068 | 665 | 166 | 8 | |
|
| 156 | 126 | 57 | 4 | |||||||
| 28 |
| 10 | 9 | 344 | 449 | 1652 | 1773 | 1401 | 507 | 60 | 1 |
|
| 264 | 252 | 176 | 28 | |||||||
|
| 1 | ||||||||||
| 29 |
| 10 | 9 | 408 | 553 | 2360 | 2783 | 2668 | 1279 | 248 | 9 |
|
| 429 | 462 | 456 | 130 | 5 | ||||||
|
| 2 |
Figure 1Summary of the results for the evolution of populations of circular RNA sequences of fixed length. Histograms in color correspond to numerical simulations with parameters specified in the legend. Lines are kernel density estimates of the underlying distribution. Grey bars correspond to viroids in Table 1. (a) Number of hairpins h; (b) number of base pairs p; (c) minimum free energy of the secondary structure; (d) G + C content.
Figure 2Evolutionary dynamics of RNA populations with fixed (a,c) and variable (b,d) length. In the former case, simulations start with a random sequences of length 300 nt; in the latter, of length 30 nt. (a) The number of hairpins decreased initially, but rapidly froze around six; the number of base pairs increased until a value near the maximum possible for that number of hairpins. The most abundant structure is shown at three different time points, as indicated by arrows. (b) As in (a), for sequences growing in length. (c) Dynamics of energy and G + C content for the same run shown in (a). (d) As in (c), for the run shown in (b).
Figure 3Effects of modular evolution in the structure of rod-like folds. Small arrows signal the recombination site; large arrows indicate the transformation of the structure upon recombination with the hammerhead. (a) Structure of the example hammerhead to be recombined with evolved sequences. (b) Part of the structure is maintained and part disrupted. The hammerhead structure is lost. (c) Both recombining structures are preserved. (d) The hammerhead structure is preserved, but the rod-like fold is partly disrupted.
Figure 4Schematics of a parsimonious scenario leading to the emergence of viroid-like replicons de novo. Structures in black represent circulating RNAs with various structures and functions; structures in red stand for hairpins with RNA ligase activity; stars indicate positions in the sequence that are fixed, different colors corresponding to possible motifs that interact with different molecules. (a) Circulating pool. There is a pool of RNA sequences of various origins. Short random sequences spontaneously fold into hairpin structures. (b) Circularization. Hairpins are able to catalyze ligation reactions, so a fraction of open chains would close in the presence of hairpins. (c) Modular growth. Independent RNAs can ligate through reactions analogous to those causing circularization. Both reactions in (b,c) yield novel molecules that add to the circulating pool. (d) A non-negligible fraction of circulating RNAs might have specific sequence positions that promote interactions with a polymerase. Once this process starts, selection for improved replication is triggered. Those RNAs would become more prevalent in the circulating pool. (e) Minimal circular replicons might grow in length through the random addition of nucleotides to their sequences. Sequence motifs with no specific function can evolve to improve the replicative ability of the molecule (e.g., by increasing mobility or selecting additional positions to interact with the polymerase). (f) New functions can be acquired through recombination of functional RNAs in the circulating pool. (g) Sufficiently long replicons that may arise from processes as those in (e) and/or (f) might respond to a variety of selection pressures. In practice, these replicons occupy a niche in the molecular ecology equivalent to that of viroids.