| Literature DB >> 32511373 |
Kevin Wu1,2,3, James Zou1,2, Howard Y Chang3,4.
Abstract
The SARS-CoV-2 coronavirus is driving a global pandemic, but its biological mechanisms are less well understood. SARS-CoV-2 is an RNA virus whose multiple genomic and subgenomic RNA (sgRNA) transcripts hijack the host cell's machinery, located across distinct cytotopic locations. Subcellular localization of its viral RNA could play important roles in viral replication and host antiviral immune response. Here we perform computational modeling of SARS-CoV-2 viral RNA localization across eight subcellular neighborhoods. We compare hundreds of SARS-CoV-2 genomes to the human transcriptome and other coronaviruses and perform systematic sub-sequence analyses to identify the responsible signals. Using state-of-the-art machine learning models, we predict that the SARS-CoV-2 RNA genome and all sgRNAs are enriched in the host mitochondrial matrix and nucleolus. The 5' and 3' viral untranslated regions possess the strongest and most distinct localization signals. We discuss the mitochondrial localization signal in relation to the formation of double-membrane vesicles, a critical stage in the coronavirus life cycle. Our computational analysis serves as a hypothesis generation tool to suggest models for SARS-CoV-2 biology and inform experimental efforts to combat the virus.Entities:
Year: 2020 PMID: 32511373 PMCID: PMC7263502 DOI: 10.1101/2020.04.28.065201
Source DB: PubMed Journal: bioRxiv
Figure 1:Depictions of the SARS-CoV-2 genome (a), the eight compartments that RNA-GPS predicts transcript localization to (b), and the predicted localizations for SARS-CoV-2 sgRNAs (c, d) and its individual 5’/CDS/3’ sequence segments (e).
The SARS-CoV-2 genome produces a series of sub-genomic RNAs (sgRNAs), each encoding one or more genes/proteins (a). These sgRNAs share a common leader 5’ sequence and a common trailing 3’ UTR sequence (arrow blocks). For each sgRNA, RNA-GPS predicts localization to each compartment shown in (b) (figure reproduced from (Wu et al., 2020b)), the results of which are shown in (c). This heatmap shows rank scores, indicating how strongly each sgRNA (rows) localizes to each compartment (columns), compared to a typical endogenous human transcript localizing to that compartment. Colors directly correlate with indicated rank scores. Most sgRNAs exhibit similar localization patterns, with a general enrichment towards the mitochondrial matrix and nucleolus. We also computed these rank scores against a baseline of other coronavirus localization signals in (d). SARS-CoV-2 exhibits a stronger mitochondrial matrix localization signal than most other coronaviruses, along with greater overall nuclear localization, particularly at the nucleolus. Interestingly, localization to the ER membrane is not present as a particularly strong signal in either context. (e) Shows the predicted localization rank scores for shared 5’ and 3’ segments, and an averaged localization rank score for CDS segments. Even on their own, the short ~90–250 base pair 5’ and 3’ segments carry the mitochondrial and nucleolar localization signals.