| Literature DB >> 36048827 |
Okiemute Beatrice Omoru1, Filipe Pereira2,3, Sarath Chandra Janga1,4,5, Amirhossein Manzourolajdad1,6.
Abstract
SARS-CoV-2 has affected people worldwide as the causative agent of COVID-19. The virus is related to the highly lethal SARS-CoV-1 responsible for the 2002-2003 SARS outbreak in Asia. Research is ongoing to understand why both viruses have different spreading capacities and mortality rates. Like other beta coronaviruses, RNA-RNA interactions occur between different parts of the viral genomic RNA, resulting in discontinuous transcription and production of various sub-genomic RNAs. These sub-genomic RNAs are then translated into other viral proteins. In this work, we performed a comparative analysis for novel long-range RNA-RNA interactions that may involve the Spike region. Comparing in-silico fragment-based predictions between reference sequences of SARS-CoV-1 and SARS-CoV-2 revealed several predictions amongst which a thermodynamically stable long-range RNA-RNA interaction between (23660-23703 Spike) and (28025-28060 ORF8) unique to SARS-CoV-2 was observed. The patterns of sequence variation using data gathered worldwide further supported the predicted stability of the sub-interacting region (23679-23690 Spike) and (28031-28042 ORF8). Such RNA-RNA interactions can potentially impact viral life cycle including sub-genomic RNA production rates.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36048827 PMCID: PMC9436084 DOI: 10.1371/journal.pone.0260331
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Fig 1Predicted long-range RNA-RNA base-pairing interactions between Spike and the full genomic RNA.
Spike sequence segments of length 500nt and overlap of 50nt were queried against the full genomes using IntaRNA software package. Each individual test resulted in at most five hits. All hits are summarized for both SARS-CoV-1 and SARS-CoV-2 (See Materials and Methods for details).
Top quantile predicted long-range RNA-RNA base-pairing interactions between the Spike region the full genome for both SARS-CoV-1 and SARS-CoV-2 using IntaRNA software package.
| Rank | SARS-CoV | Hit Start | Hit End | Target Start | Target End | Total Length | Energy | Residual | Target Gene |
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
| 2 | 1 | 22604 | 22631 | 12507 | 12532 | 54 | -21.23 | -7.1602061 | ORF1a |
|
|
|
|
|
|
|
|
|
|
|
| 4 | 1 | 24396 | 24414 | 25582 | 25602 | 40 | -18.19 | -4.5667802 | ORF3a |
| 5 | 1 | 24841 | 24877 | 2247 | 2288 | 79 | -19.26 | -4.3927522 | ORF1a |
| 6 | 1 | 25198 | 25239 | 17014 | 17053 | 82 | -19.1 | -4.1370578 | ORF1b |
|
|
|
|
|
|
|
|
|
|
|
| 8 | 1 | 23698 | 23734 | 26957 | 27000 | 81 | -18.87 | -3.9389559 | M |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 14 | 1 | 21523 | 21559 | 2321 | 2354 | 71 | -17.53 | -2.9179375 | ORF1a |
| 15 | 2 | 25331 | 25358 | 18595 | 18620 | 54 | -16.79 | -2.7202061 | ORF1b |
| 16 | 1 | 24667 | 24677 | 24104 | 24114 | 22 | -15.37 | -2.320947 | Spike |
|
|
|
|
|
|
|
|
|
|
|
| 18 | 1 | 22530 | 22558 | 20039 | 20077 | 68 | -16.67 | -2.1536319 | ORF1b |
See Materials and Methods for details. There was a total of 69 independent hits across both genomes. Complete results included as S2 Table. Column SARS-CoV denotes the strain. Column TotalLength denotes length of the interacting regions (query + target). Ranking is according to residual values against the generalized linear model where length of interaction was used to estimate interaction energy. The built-in function glm(energy~length, data = data, family = "gaussian")in R programming language was used to fit the model. Length coefficient = -0.03190. Length was a significant factor in the model. (Pr(>|t|) for length = 0.00067. Median of residuals = -0.2287). 1-Quantile of residuals = -2.1536. SARS-CoV-2 hits are shown as bold. Rank 11 also shown with * denotes the SARS-CoV-2 Spike-ORF8 interaction.
Fig 2Long-range RNA-RNA interaction between Spike and ORF8 regions of SARS-CoV-2 genome.
Interacting intervals are (23660–23703 Spike) and (28025–28060 ORF8). Prediction done via IntraRNA software. Base pairs with ‘plus’ notation denote stable sub-interactions. The stable sub-interaction is shown within the red rectangle in and denoted as the Core interacting region: (23679–23690 Spike) and (28031–28042 ORF8).
Coordinates of interacting base pairs between (23660–23703 Spike) and (28025–28060 ORF8) for which nucleotide variations were observed.
|
|
|
|
| 23660 | 28060 | 0 |
| 23661 | 28059 | 0 |
| 23662 | 28058 | 0 |
|
|
|
|
|
|
|
|
| 23671 | 28050 | 0 |
| 23672 | 28049 | 0 |
|
|
|
|
| 23674 | 28046 | 0 |
|
|
|
|
| 23676 | 28044 | 0.04 |
| 23677 | 28043 | 0 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Total number of sequences was 206,745. Column power is an output of the R-scape software that is proportional to the statistical power of substitutions. Mutations in coordinates in black bold are shown Fig 3. Base pairs within the Core interacting region (Fig 2) are shown in cells shaded red.
Fig 3Consensus structure of the predicted Spike-ORF8 RNA-RNA interaction.
RNA-RNA interaction coordinates were (23660–23703 Spike) and (28025–28060 ORF8). Total number of sequences was 206,745. Number of mutations observed for four locations with highest power are shown. The Core interacting region is shown by the transparent red rectangle.