| Literature DB >> 36232353 |
Matthew Dukeshire1, David Schaeper1, Pravina Venkatesan1, Amirhossein Manzourolajdad1,2.
Abstract
Since the start of the COVID-19 pandemic, understanding the pathology of the SARS-CoV-2 RNA virus and its life cycle has been the priority of many researchers. Currently, new variants of the virus have emerged with various levels of pathogenicity and abundance within the human-host population. Although much of viral pathogenicity is attributed to the viral Spike protein's binding affinity to human lung cells' ACE2 receptor, comprehensive knowledge on the distinctive features of viral variants that might affect their life cycle and pathogenicity is yet to be attained. Recent in vivo studies into the RNA structure of the SARS-CoV-2 genome have revealed certain long-range RNA-RNA interactions. Using in silico predictions and a large population of SARS-CoV-2 sequences, we observed variant-specific evolutionary changes for certain long-range RRIs. We also found statistical evidence for the existence of one of the thermodynamic-based RRI predictions, namely Comp1, in the Beta variant sequences. A similar test that disregarded sequence variant information did not, however, lead to significant results. When performing population-based analyses, aggregate tests may fail to identify novel interactions due to variant-specific changes. Variant-specific analyses can result in de novo RRI identification.Entities:
Keywords: COVID-19; RNA structure; RNA-RNA interaction; SARS-CoV-2; compensatory mutations; viral evolution
Mesh:
Substances:
Year: 2022 PMID: 36232353 PMCID: PMC9570297 DOI: 10.3390/ijms231911050
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 6.208
List of experimentally reported and computationally predicted long-range RRIs used in the analysis. The first four are reported from a COMRADES experiment, the last four were predicted using IntaRNA, both with and without SHAPE reactivities as a constraint.
| NAME | 5′ START | 5′ END | 5′ GENE | 3′START | 3′END | 3′ GENE | ENERGY | # NEARBY LOCAL STRUCTURES/ |
|---|---|---|---|---|---|---|---|---|
| EXP1 | 15,424 | 15,433 | orf1ab | 17,442 | 17,452 | orf1ab | −7.4 | 6 |
| EXP2 | 18,541 | 18,554 | orf1ab | 21,107 | 21,119 | orf1ab | −12.3 | 1 |
| EXP3 | 2070 | 2081 | orf1a | 5688 | 5699 | orf1a | −20.5 | 4 |
| EXP4 | 5652 | 5661 | orf1a | 9076 | 9084 | orf1a | −11 | 2 |
| COMP1 | 1394 | 1411 | orf1a | 10,951 | 10,968 | orf1a | −21.55 | −6.94 |
| COMP2 | 11,912 | 11,950 | orf1a | 18,905 | 18,944 | orf1ab | −21.88 | −8.97 |
| COMP3 | 3047 | 3073 | orf1a | 28,809 | 28,836 | N | −20.67 | −8.74 |
| COMP4 | 3177 | 3194 | orf1a | 25,546 | 25,563 | orf3a | −20 | −8.89 |
| FSE | 13,462 | - | orf1a | - | 13,545 | orf1ab | −27.2 | FUNCTIONAL IMPORTANCE |
Dates of sequences from each variant as well as calculated background mutation rate. Previous reports are 2.38E-06 per base per day.
| Variant | Oldest Sequence | Most Recent Sequence | Background/Genome-Wide Mutation Rate (per Base, per Day) |
|---|---|---|---|
|
| 8 November 2021 | 17 May 2022 | 2.51 × 10−6 |
|
| 22 September 2020 | 15 August 2021 | 1.96 × 10−6 |
|
| 18 February 2020 | 5 November 2021 | 2.03 × 10−6 |
|
| 1 December 2020 | 20 April 2021 | 2.43 × 10−6 |
Pairwise distances between variants (mean number of differing bases for all sequences). Calculated from the GISAID data using the MEGA software [32]. Number of sequences: Omicron-8457, Delta-10000, Beta-4237, Alpha-10000.
| Pairwise Distances | ||||
|---|---|---|---|---|
|
| ||||
|
| 86.88513 | |||
|
| 79.98906 | 56.01551 | ||
|
| 78.80249 | 60.95150 | 49.47034 | |
|
|
|
|
| |
Analysis of 8 RRI candidates across 32,714 SARS-CoV-2 genome sequences. Mutations per sequence were calculated and normalized by interacting sequence length.
| Name | Mutations per Base per Sequence | Relative Variation | Ratio of Compensatory Mutations |
|---|---|---|---|
| Exp1 | 4.29 × 105 | 0.016 | 0.654 |
| Exp2 | 3.96 × 104 | 1.209 | 0.947 |
| Exp3 | 1.28 × 104 | 0.307 | 0.440 |
| Exp4 | 6.31 × 105 | 0.059 | 0.725 |
| Comp1 | 1.51 × 104 | 1.510 | 0.924 |
| Comp2 | 1.08 × 104 | 1.440 | 0.521 |
| Comp3 | 9.36 × 105 | 0.009 | 0.315 |
| Comp4 | 7.62 × 103 | 2.203 | 0.125 |
| FSE | 1.35 × 104 | 1.383 | 0.882 |
Figure 1Survival Plots of two of the 8 RRI candidate regions. Each plot has two lines: Black indicates a null distribution; Blue is all base pairs in the interaction. Exp4 is shown on Panel (a), with Comp2 on Panel (b). Significance cutoff used was 0.05.
Variant-specific analysis of the 8 RRI candidates. Relative variation is normalized by the number of sequences (value of 1 indicates an equal per base variation in the surrounding region, 100nt in each direction). E-value reports the significance value of the R-scape covariation analysis, with the level of power categorized as low if it is reported to be <10% (power is fraction of base pairs expected to covary). Dispersion represents the standard deviation of % compensatory mutations for all variants.
| Name | Variant | Variation Relative to Context | Ratio of Compensatory Mutations | E-Value of Significantly Covarying Base Pairs (Low/High Power) | Dispersion |
|---|---|---|---|---|---|
| Exp1 | Alpha | 0.409 | 0.500 | NA (low) | 0.208 |
| Beta | 0.935 | 0.571 | NA (low) | ||
| Delta | 0.011 | 0.833 | NA (low) | ||
| Omicron | 0.006 | 0.333 | NA (high) | ||
| Exp2 | Alpha | 0.180 | 0.684 | NA (low) | 0.140 |
| Beta | 3.916 | 0.993 | NA (low) | ||
| Delta | 1.919 | 0.943 | NA (low) | ||
| Omicron | 0.281 | 0.800 | NA (high) | ||
| Exp3 | Alpha | 0.141 | 0.714 | NA (low) | 0.200 |
| Beta | 1.070 | 0.250 | NA (low) | ||
| Delta | 0.499 | 0.377 | NA (low) | ||
| Omicron | 0.160 | 0.545 | NA (high) | ||
|
| Alpha | 0.344 | 0.600 | NA (low) |
|
|
|
|
|
| ||
| Delta | 0.025 | 0.750 | NA (high) | ||
| Omicron | 0.765 | 0.846 | NA (high) | ||
|
| Alpha | 0.277 | 0.714 | NA (high) |
|
|
|
|
|
| ||
| Delta | 2.377 | 0.978 | NA (low) | ||
| Omicron | 0.667 | 0.357 | NA (high) | ||
| Comp2 | Alpha | 0.673 | 0.722 | NA (low) | 0.229 |
| Beta | 0.505 | 0.909 | NA (low) | ||
| Delta | 1.982 | 0.360 | NA (low) | ||
| Omicron | 1.932 | 0.708 | NA (low) | ||
| Comp3 | Alpha | 0.007 | 0.208 | NA (low) | 0.157 |
| Beta | 0.016 | 0.400 | NA (low) | ||
| Delta | 0.016 | 0.309 | NA (low) | ||
| Omicron | 0.004 | 0.577 | NA (high) | ||
| Comp4 | Alpha | 3.307 | 0.898 | NA (low) | 0.357 |
| Beta | 6.028 | 0.033 | NA (low) | ||
| Delta | 1.329 | 0.500 | NA (low) | ||
| Omicron | 0.051 | 0.424 | NA (low) | ||
| FSE | Alpha | 1.626 | 0.968 | NA (low) | 0.208 |
| Beta | 0.668 | 0.571 | NA (low) | ||
| Delta | 0.982 | 0.887 | NA (low) | ||
| Omicron | 1.383 | 0.571 | NA (low) |
Figure 2Survival plots from the variant-specific covariation analysis. Displayed are the most significant results. Beta showed significant covariation for Exp4 panel (a) and Comp1 panel (b). Significance cutoff E-value used was 0.05.
List of computationally predicted RRIs with SHAPE data used in the analysis.
| NAME | 5′ START | 5′ END | 5′ GENE | 3′START | 3′END | 3′ GENE | ENERGY | RESIDUAL |
|---|---|---|---|---|---|---|---|---|
| SHAPE1 | 15,893 | 15,970 | orf1ab | 28,383 | 28,454 | N | −25.77 | −8.40 |
| SHAPE2 | 20,068 | 20,193 | orf1ab | 22,327 | 22,433 | S | −27.22 | −8.32 |
| SHAPE3 | 9077 | 9222 | orf1a | 12,748 | 12,876 | orf1a | −27.86 | −8.19 |
| SHAPE4 | 5367 | 5402 | orf1a | 24,114 | 24,157 | S | −24.21 | −8.14 |
Figure 3Survival plots from the variant-specific covariation analysis. Displayed are the most significant results. Omicron Shape1 is shown in panel (a) Delta Shape 3 is shown in panel (b). Significance cutoff E-value used was 0.05.