| Literature DB >> 34834915 |
João H C Campos1, Juliana T Maricato2, Carla T Braconi2, Fernando Antoneli1, Luiz Mario R Janini2, Marcelo R S Briones1.
Abstract
The causative agent of COVID-19 pandemic, SARS-CoV-2, has a 29,903 bases positive-sense single-stranded RNA genome. RNAs exhibit about 150 modified bases that are essential for proper function. Among internal modified bases, the N6-methyladenosine, or m6A, is the most frequent, and is implicated in SARS-CoV-2 immune response evasion. Although the SARS-CoV-2 genome is RNA, almost all genomes sequenced thus far are, in fact, reverse transcribed complementary DNAs. This process reduces the true complexity of these viral genomes because the incorporation of dNTPs hides RNA base modifications. Here, we present an initial exploration of Nanopore direct RNA sequencing to assess the m6A residues in the SARS-CoV-2 sequences of ORF3a, E, M, ORF6, ORF7a, ORF7b, ORF8, N, ORF10 and the 3'-untranslated region. We identified fifteen m6A methylated positions, of which, six are in ORF N. Additionally, because m6A is associated with the DRACH motif, we compared its distribution in major SARS-CoV-2 variants. Although DRACH is highly conserved among variants, we show that variants Beta and Eta have a fourth position C > U change in DRACH at 28,884b that could affect methylation. This is the first report of direct RNA sequencing of a Brazilian SARS-CoV-2 sample coupled with the identification of modified bases.Entities:
Keywords: COVID-19; Epitranscriptomics; RNA methylation; SARS-CoV-2; direct RNA sequencing; m6A
Mesh:
Substances:
Year: 2021 PMID: 34834915 PMCID: PMC8620083 DOI: 10.3390/v13112108
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Figure 1Assembly of Nanopore direct RNA reads to the reference sequence (A) and map of m6A methylation probability along the SARS-CoV-2 RNA (B). In (A), green horizontal bars indicate the ORFs, blue horizontal bars of decreasing size indicate the Nanopore reads and the red area at the top indicates the log scale coverage from 1× to 1600×. In (B), blue vertical bars indicate the DRACH motifs, red vertical bars indicate m6A (>50% probability) and the yellow vertical bars indicate two potential m6A with probabilities 0.38 and 0.44 in the 3′-untranslated region. The G + C content is indicated in the plot just above the annotation.
Distribution and sequencing coverage of potential methylated adenosines in SARS-CoV-2 RNA genome. Coverage > 100× is underlined and probability > 80 is in boldface as calculated by m6Anet [34]. After m6Anet analysis, only sites with coverage above 60 were considered. Position numbering according to Wuhan reference sequence (GenBank NC_045512). In Coverage and Probability columns, the results for two technical replicates are shown. The first numbers on these columns represent the experiment shown in Figure 1.
| # | RNA Id—ORF | Position | Coverage | Probability |
|---|---|---|---|---|
| 1 | EPI_ISL_413016—3a | 25,935 | 65/75 | 0.5472/0.2749 |
| 2 | EPI_ISL_413016—3a | 25,940 | 67/61 | 0.5094/0.5865 |
| 3 | EPI_ISL_413016—3a | 26,070 | 72/83 | 0.6659/0.2497 |
| 4 | EPI_ISL_413016—3a/E | 26,241 | 83/100 | 0.5480/0.3461 |
| 5 | EPI_ISL_413016—M | 26,933 | 162/295 | 0.5102/ |
| 6 | EPI_ISL_413016—7a | 27,562 | 266/679 | |
| 7 | EPI_ISL_413016—7b | 27,764 | 309/849 | |
| 8 | EPI_ISL_413016—7b | 27,854 | 318/839 | |
| 9 | EPI_ISL_413016—7b/8 | 27,892 | 357/859 | 0.5770/0.4420 |
| 10 | EPI_ISL_413016—N | 28,616 | 699/812 | |
| 11 | EPI_ISL_413016—N | 28,633 | 314/364 | 0.5022/0.4134 |
| 12 | EPI_ISL_413016—N | 28,766 | 794/817 | 0.5403/0.4098 |
| 13 | EPI_ISL_413016—N | 28,886 | 871/835 | 0.7020/0.2212 |
| 14 | EPI_ISL_413016—N | 29,450 | 922/803 | 0.5523/0.2701 |
| 15 | EPI_ISL_413016—N | 29,517 | 887/783 | 0.5705/0.3598 |
Figure 2Variability of DRACH sequences among SARS-CoV-2 variants. In (A), the C > U change at 28,886 disrupts the m6A (marked red) DRACH motif (D1) in five variant Beta sequences and four variant Eta sequences, while a single variant Zeta has a C > U mutation that disrupts the DRACH motif (D2). In (B), a four bases insertion in the intergenic region ORF8/N does not disrupt DRACH (D3), while (D4) shows a fully conserved DRACH.