| Literature DB >> 32511381 |
Ryan J Andrews1, Jake M Peterson1, Hafeez S Haniff2, Jonathan Chen2, Christopher Williams2, Maison Grefe2, Matthew D Disney2, Walter N Moss1.
Abstract
SARS-CoV-2 is a positive-sense single-stranded RNA virus that has exploded throughout the global human population. This pandemic coronavirus strain has taken scientists and public health researchers by surprise and knowledge of its basic biology (e.g. structure/function relationships in its genomic, messenger and template RNAs) and modes for therapeutic intervention lag behind that of other human pathogens. In this report we used a recently-developed bioinformatics approach, ScanFold, to deduce the RNA structural landscape of the SARS-CoV-2 transcriptome. We recapitulate known elements of RNA structure and provide a model for the folding of an essential frameshift signal. Our results find that the SARS-CoV-2 is greatly enriched in unusually stable and likely evolutionarily ordered RNA structure, which provides a huge reservoir of potential drug targets for RNA-binding small molecules. Our results also predict regions that are accessible for intermolecular interactions, which can aid in the design of antisense therapeutics. All results are made available via a public database (the RNAStructuromeDB) where they may hopefully drive drug discovery efforts to inhibit SARS-CoV-2 pathogenesis.Entities:
Year: 2020 PMID: 32511381 PMCID: PMC7263510 DOI: 10.1101/2020.04.17.045161
Source DB: PubMed Journal: bioRxiv
Figure 1.Overview of RNA folding analysis of the SARS-CoV-2 genome. a) Genomic features and coordinates are labeled at the top and annotated based on reference sequence NC_045512.2. The FSE element has been highlighted to show its location within two overlapping coding sequences in ORF1. The results of a ScanFold-Scan analysis (Methods) are shown below, mapped to the corresponding regions of the genome. The MFE and z-score are depicted using a 120 nt moving averages of values, raw values can be seen in Table S1. b) Overall distribution of raw z-score values calculated across the genome are shown alongside two other positive strand RNA genomes, ZIKV and HIV-1, which were analyzed using the same parameters as SARS-CoV-2. c) Generic model of the first four stem loops found in the 5′ UTR of Betacoronavirus genomes[7,18](based on the Rfam entry for SL1-2; RF02910). d) Generic model showing the general architecture of the frameshift element (FSE) found in the similar coronavirus genomes[11] (based on Rfam entry RF00507).
Figure 2.Full analysis of the 5′ UTR of SARS-CoV-2. a) The results of the full ScanFold pipeline are shown. ScanFold metrics and base pairs have been loaded into the IGV desktop browser[36]. Metric type and ranges are shown on the left side of the panel (metric descriptions can be found in methods). Here the start codon has been highlighted with a green bar and structures which correspond to previously named elements have been annotated. b) ScanFold RNA 2D structures are shown for the 5′UTR. All base pairs shown are consistent between SARS-CoV and SARS-CoV-2, and nucleotide variations which are present within structures have been highlighted with green circles. Structures have been visualized here using VARNA[37].
Figure 3.Full analysis of the FSE region from SARS-CoV-2. a) The results of the full ScanFold pipeline are shown. ScanFold metrics and base pairs have been loaded into the IGV desktop browser[36]. Metric type and ranges are listed on the left side of the panel (metric descriptions can be found in the Methods section). Arc diagrams for ScanFold model base pairs are colored based on their average z-score (blue for <= −2, green for <= −1, and yellow for <= 0). b) The ScanFold RNA 2D structures for the full FSE region are shown. All differences between the SARS-CoV and SARS-CoV-2 primary structure are reported directly next to the corresponding nucleotide of SARS-CoV-2 (with the corresponding SARS-CoV nucleotide in a red circle). Structures have been visualized here using VARNA[37].
Figure 4.ScanFold results of the region encompassing the Ecds. a) The full ScanFold-Scan metrics are shown. b) ScanFold-Fold predicted model for the region is shown here. Nucleotides here are colored according to their average z-score from Table S2. The TRS nucleotides have been highlighted and labeled with a dark blue outline, and the start and stop codons have been highlighted and labeled green and red respectively. Differences between SARS-CoV and SARS-CoV-2 have been annotated as described in Figures 2 and 3.