| Literature DB >> 32446559 |
Hiroyasu Wakida1, Kentaro Kawata2, Yuta Yamaji1, Emi Hattori3, Takaho Tsuchiya4, Youichiro Wada2, Haruka Ozaki4, Nobuyoshi Akimitsu5.
Abstract
Most viruses inhibit the innate immune system and/or the RNA degradation processes of host cells to construct an advantageous intracellular environment for their survival. Characteristic RNA sequences within RNA virus genomes or RNAs transcribed from DNA virus genomes contribute toward this inhibition. In this study, we developed a method called "Fate-seq" to comprehensively identify the RNA sequences derived from RNA and DNA viruses, contributing RNA stability in the cells. We examined the stabilization activity of 5,924 RNA fragments derived from 26 different viruses (16 RNA viruses and 10 DNA viruses) using next-generation sequencing of these RNAs fused 3' downstream of GFP reporter RNA. With the Fate-seq approach, we detected multiple virus-derived RNA sequences that stabilized GFP reporter RNA, including sequences derived from severe acute respiratory syndrome-related coronavirus (SARS-CoV). Comparative genomic analysis revealed that these RNA sequences and their predicted secondary structures are highly conserved between SARS-CoV and the novel coronavirus, SARS-CoV-2, which is responsible for the global outbreak of the coronavirus-associated disease that emerged in December 2019 (COVID-19). These sequences have the potential to enhance the stability of viral RNA genomes, thereby augmenting viral replication efficiency and virulence.Entities:
Keywords: COVID-19; Functional sequence; RNA stability; SARS-CoV; SARS-CoV-2; Virus
Mesh:
Substances:
Year: 2020 PMID: 32446559 PMCID: PMC7200376 DOI: 10.1016/j.bbrc.2020.05.008
Source DB: PubMed Journal: Biochem Biophys Res Commun ISSN: 0006-291X Impact factor: 3.575
Fig. 1Schematic illustration of Fate-seq. (A) Design of the viral sequences. The viral sequences of 260 nt in length were generated by sliding the capture region at 100-nt intervals along each viral genome. A total of 26 different viruses were used to generate the viral sequences. (B) Structure of the vector library. The viral sequences were inserted immediately downstream of the EGFP coding sequence (CDS) in a reporter vector harboring the T7 promoter, the EGFP CDS, and a polyA stretch (A60). (C) The mRNA library. In vitro transcription (IVT) of the vector library was used to produce the mRNA library, in which each mRNA contained the EGFP CDS, a viral sequence, as well as a polyA stretch. (D) Transfection and retrieval of the mRNA library. The mRNA library was transfected into HeLa cells using electroporation, and the transfected cells were then divided into two wells. The mRNA retrieved from the cells immediately after the transfection was termed the “0 h sample”. The mRNA retrieved from the cells 6 h after the transfection was termed the “6 h sample”. (E) The MA plot comparing read counts between 0 h and 6 h samples. The x-axis average of log2 read counts in 0 h and 6 h. The y-axis represents log2 ratio of 6 h–0 h. The red and black dots represent viral sequences showing significant increases in 6 h compared to 0 h (adjusted p-value<0.05) or not, respectively. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)
Fig. 2Identification of genomic sequences contributing to the stability of the SARS-CoV genome. (A) Fold changes in the CPM values of individual viral sequences. The x-axis indicates the location of individual viral sequences along the SARS-CoV genome from which these viral sequences were derived. The y-axis indicates the fold change (%) in the counts per million mapped read (CPM) values between the RNA samples extracted at 0 h and 6 h after transfection into HeLa cells. Gray dots indicate viral sequences with a CPM >5 and coefficient of variation (CV) < 10% at both 0 h and 6 h after transfection. Red dots indicate viral sequences that displayed significantly increased abundance in the 6 h sample compared with the 0 h sample (adjusted p-value<0.05). (B) Conservation rate of each base along the SARS-CoV genome. The x-axis indicates the locations of individual bases, while the y-axis indicates the conservation rate of each base compared with other viruses within the Coronaviridae. (C and D) Sequence alignment between 16,901 and 17,160 nt (COV001) (C) and 20,901–21,160 nt (COV002) (D) of the SARS-CoV genome with corresponding regions along the SARS-CoV-2 genome. Vertical lines indicate base homology. Red squares indicate mutations critical for the formation of the stem-loops specific to SARS-CoV-2 (further described in Fig. 3). (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)
Fig. 3Predicted secondary structure of viral sequences highly conserved between SARS-CoV and SARS-CoV-2. (A) Secondary structure of the COV002 region of SARS-CoV (left) and the corresponding region of the SARS-CoV-2 genome (right). (B) Secondary structure of the COV001 region of SARS-CoV (left) and the corresponding region of the SARS-CoV-2 genome (right). (C) The secondary structure of the COV001 region of SARS-CoV was mutated in silico. The putative stem-loop structure caused by in silico mutation at the positions indicated by the black arrows suggest that these mutations are critical for the formation of a stem-loop similar to that of Stem #1 observed for the corresponding region of SARS-CoV-2. Inferred stem-loops are indicated with red boxes (Stem #1–4), and base coloring was used to indicate the base-pairing probability. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)