| Literature DB >> 33225958 |
Wai Sing Chan1, Chun Hang Au1, Ho Yin Lam1, Candy Ling Na Wang1, Dona Ngar-Yin Ho1, Yuk Man Lam1, Daniel Ka Wing Chu2, Leo Lit Man Poon2, Tsun Leung Chan1, Jonpaul Sze-Tsing Zee1, Edmond Shiu Kwan Ma1, Bone Siu Fai Tang3.
Abstract
<span class="Disease">Coronavirus disease 2019 (<span class="Disease">COVID-19) pandemic has been a catastrophic burden to global healthcare systems. The fast spread of the etiologic agent, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), highlights the need to identify unknown coronaviruses rapidly for prompt clinical and public health decision making. Moreover, owing to the high mutation rate of RNA viruses, periodic surveillance on emerging variants of key virus components is essential for evaluating the efficacy of antiviral drugs, diagnostic assays and vaccines. These 2 knowledge gaps formed the basis of this study. In the first place, we evaluated the feasibility of characterizing coronaviruses directly from respiratory specimens. We amplified partial RdRP gene, a stable genetic marker of coronaviruses, from a collection of 57 clinical specimens positive for SARS-CoV-2 or other human coronaviruses, and sequenced the amplicons with Nanopore Flongle and MinION, the fastest and the most scalable massively-parallel sequencing platforms to-date. Partial RdRP sequences were successfully amplified and sequenced from 82.46% (47/57) of specimens, ranging from 75 to 100% by virus type, with consensus accuracy of 100% compared with Sanger sequences available (n = 40). In the second part, we further compared 19 SARS-CoV-2 RdRP sequences collected from the first to third waves of COVID-19 outbreak in Hong Kong with 22,173 genomes from GISAID EpiCoV™ database. No single nucleotide variants (SNVs) were found in our sequences, and 125 SNVs were observed from global data, with 56.8% being low-frequency (n = 1-47) missense mutations affecting the rear part of RNA polymerase. Among the 9 SNVs found on 4 conserved domains, the frequency of 15438G > T was highest (n = 34) and was predominantly found in Europe. Our data provided a glimpse into the sequence diversity of a primary antiviral drug and diagnostic target. Further studies are warranted to investigate the significance of these mutations.Entities:
Keywords: COVID-19; Coronavirus; Flongle; MinION; Missense mutation; Nanopore; RdRP; SARS-CoV-2
Mesh:
Substances:
Year: 2020 PMID: 33225958 PMCID: PMC7681180 DOI: 10.1186/s12985-020-01454-3
Source DB: PubMed Journal: Virol J ISSN: 1743-422X Impact factor: 4.099
Fig. 1An overview of this study. The flowchart summarizes the workflow of this study. First, we assessed the feasibility of amplifying and sequencing partial RdRP gene directly from a collection of 61 clinical specimens. Second, we performed sequence analysis on 19 SARS-CoV-2 RdRP sequences from our study and 22,173 GISAID genomes, using SARS-CoV-2 reference genome Wuhan-Hu-1 (NC_045512.2:15309-15702) as reference
Results of routine RT-PCR assays and Nanopore sequencing
| Specimen types | Routine RT-PCR test results | Nanopore sequencing results | |||||
|---|---|---|---|---|---|---|---|
| HCoV | Ct value¶ | Flow cell type | Run time | Consensus sequence length | Consensus accuracy | ||
| 1* | VC | SARS-CoV-2 | N: 29.65 orf1b: ND | Flongle | 23 m | 394 bp (268) | 100% (394 bp) |
| 2* | VC | SARS-CoV-2 | N: 35 orf1b: ND | Flongle | 32 m | 394 bp (99) | 100% (394 bp) |
| 3* | NPS-TS | SARS-CoV-2 | N: 29.8 orf1b: 30.42 | Flongle | 29 m | 394 bp (248) | 100% (297 bp) |
| 4* | SP | SARS-CoV-2 | N: 35 orf1b: 35 | Sequencing was not performed due to absence of PCR target band | |||
| 5# | pOS | SARS-CoV-2 | N: 28.8 orf1b: 28.66 | Flongle | 22 m | 394 bp (895) | 100% (394 bp) |
| 6# | pOS | SARS-CoV-2 | N: 28.1 orf1b: 28.41 | Flongle | 16 m | 394 bp (932) | 100% (394 bp) |
| 7# | pOS | SARS-CoV-2 | N: 32.96 orf1b: 33.94 | MinION | 3 h 18 m | 394 bp (36) | 100%‡ |
| 8# | pOS | SARS-CoV-2 | N: 33.5 orf1b: 33.5 | Sequencing was not performed due to absence of PCR target band | |||
| 9# | pOS | SARS-CoV-2 | N: > 35 orf1b: > 35 | Sequencing was not performed due to absence of PCR target band | |||
| 10# | pOS | SARS-CoV-2 | N: > 35 orf1b: > 35 | Sequencing was not performed due to absence of PCR target band | |||
| 11# | pOS | SARS-CoV-2 | N: > 35 orf1b: > 35 | MinION | Not detected | ||
| 12† | pOS | SARS-CoV-2 | N: 14.5 orf1b: 15.48 | MinION | 17 m | 394 bp (3420) | 100% (394 bp) |
| 13† | pOS | SARS-CoV-2 | N: 15.6 orf1b: 16.67 | MinION | 31 m | 394 bp (3036) | 100% (394 bp) |
| 14† | pOS | SARS-CoV-2 | N: 19.2 orf1b: 19.92 | MinION | 31 m | 394 bp (3449) | 100% (394 bp) |
| 15† | pOS | SARS-CoV-2 | N: 19.52 orf1b: 21.2 | MinION | 22 m | 394 bp (2176) | 100% (394 bp) |
| 16† | pOS | SARS-CoV-2 | N: 21.1 orf1b: 21.76 | MinION | 23 m | 394 bp (2617) | 100% (394 bp) |
| 17† | pOS | SARS-CoV-2 | N: 22.1 orf1b: 23.27 | MinION | 17 m | 394 bp (2484) | 100% (394 bp) |
| 18† | pOS | SARS-CoV-2 | N: 22.6 orf1b: 22.77 | MinION | 7 m | 394 bp (99) | 100%‡ |
| 19† | pOS | SARS-CoV-2 | N: 22.67 orf1b: 22.86 | MinION | 16 m | 394 bp (3826) | 100% (394 bp) |
| 20† | pOS | SARS-CoV-2 | N: 24.23 orf1b: 24.81 | MinION | 43 m | 394 bp (58) | 100%‡ |
| 21† | pOS | SARS-CoV-2 | N: 24.5 orf1b: 25.19 | MinION | 31 m | 394 bp (126) | 100%‡ |
| 22† | pOS | SARS-CoV-2 | N: 24.7 orf1b: 24.9 | MinION | 17 m | 394 bp (2362) | 100% (394 bp) |
| 23† | pOS | SARS-CoV-2 | N: 24.94 orf1b: 26.07 | MinION | 34 m | 394 bp (54) | 100%‡ |
| 24† | pOS | SARS-CoV-2 | N: 27.76 orf1b: 28.9 | MinION | 1 h | 394 bp (1)§ | 97.21%‡§ |
| 25† | pOS | SARS-CoV-2 | N: 30.47 orf1b: 30.6 | MinION | 33 m | 394 bp (1846) | 100% (326 bp) |
| 26† | pOS | SARS-CoV-2 | N: 31.26 orf1b: 31.61 | MinION | 1 h | 394 bp (2)§ | 99.24%‡§ |
| 27† | pOS | SARS-CoV-2 | N: 31.68 orf1b: 32.23 | MinION | Not detected | ||
| 28† | pOS | SARS-CoV-2 | N: 35 orf1b: 35 | MinION | Not detected | ||
| 29 | NPS | HKU1 | N/A | MinION | Not detected | ||
| 30 | NPS | HKU1 | N/A | MinION | 52 m | 394 bp (3804) | 100% (394 bp) |
| 31 | NPS | HKU1 | N/A | MinION | 5 m | 394 bp (3572) | 100% (394 bp) |
| 32 | NPS | HKU1 | N/A | MinION | 6 m | 394 bp (3877) | 100% (394 bp) |
| 33 | NPA | 229E | N/A | MinION | 53 m | 394 bp (4045) | 100% (394 bp) |
| 34 | NPS | 229E | N/A | MinION | 51 m | 394 bp (3908) | 100% (394 bp) |
| 35 | NPS | 229E | N/A | MinION | 40 m | 394 bp (4044) | 100% (394 bp) |
| 36 | NPS | 229E | N/A | MinION | 1 h 27 m | 394 bp (3882) | 100% (394 bp) |
| 37 | NPS | 229E | N/A | MinION | 53 m | 394 bp (4076) | 100% (394 bp) |
| 38 | NPS | 229E | N/A | MinION | 46 m | 394 bp (3971) | 100% (394 bp) |
| 39 | NPS | 229E | N/A | MinION | 36 m | 394 bp (4062) | 100% (394 bp) |
| 40 | NPS | OC43 | N/A | MinION | 52 m | 394 bp (3556) | 100% (394 bp) |
| 41 | TS | OC43 | N/A | Sequencing was not performed due to absence of PCR target band | |||
| 42 | NPS | OC43 | N/A | MinION | 56 m | 394 bp (4126) | 100% (394 bp) |
| 43 | TS | OC43 | N/A | MinION | 1 h 8 m | 394 bp (3977) | 100% (394 bp) |
| 44 | NS | OC43 | N/A | MinION | 2 m | 394 bp (3845) | 100% (394 bp) |
| 45 | NS | OC43 | N/A | MinION | 46 m | 394 bp (4120) | 100% (394 bp) |
| 46 | NPS | OC43 | N/A | MinION | 1 m | 394 bp (85) | 100% (394 bp) |
| 47 | NPS | OC43 | N/A | MinION | 1 h 26 m | 394 bp (1543) | 100% (364 bp) |
| 48 | NPA | OC43 | N/A | MinION | 1 h 14 m | 394 bp (4066) | 100% (394 bp) |
| 49 | NPS | OC43 | N/A | MinION | 44 m | 394 bp (3998) | 100% (394 bp) |
| 50 | NPS | OC43 | N/A | MinION | 43 m | 394 bp (4096) | 100% (394 bp) |
| 51 | NPS | OC43 | N/A | MinION | 40 m | 394 bp (4015) | 100% (394 bp) |
| 52 | NPS | NL63 | N/A | MinION | 54 m | 394 bp (4029) | 100% (394 bp) |
| 53 | NPS | NL63 | N/A | MinION | 35 m | 394 bp (4019) | 100% (394 bp) |
| 54 | NPS | NL63 | N/A | Flongle | 47 m | 394 bp (851) | 100% (298 bp) |
| 55 | NPS | NL63 | N/A | MinION | 37 m | 394 bp (4030) | 100% (394 bp) |
| 56 | NPS | NL63 | N/A | Sequencing was not performed due to absence of PCR target band | |||
| 57 | NPS | NL63 | N/A | MinION | 1 h 11 m | 394 bp (3898) | 100% (394 bp) |
| 58 | SP | Not detected | MinION | Not detected | |||
| 59 | SP | Not detected | MinION | Not detected | |||
| 60 | NPS | Not detected | MinION | Not detected | |||
| 61 | pOS | Not detected | MinION | Not detected | |||
¶N and orf1b correspond to nucleocapsid gene and open reading frame 1b, respectively. The results of BioFire® FilmArray® Respiratory 2 Panel were qualitative and therefore Ct values were not available
*Collected during the first wave of COVID-19 outbreak (January to early March, 2020, n = 4)
#Collected during the second wave of COVID-19 outbreak (mid-March to May, 2020, n = 7)
†Collected during the third wave of COVID-19 outbreak (late June onwards, 2020, n = 17)
‡Sanger sequence was not available due to high level of background noise. The Nanopore read/ consensus sequence was compared to SARS-CoV-2 reference genome (NC_045512.2)
§The number of Nanopore reads was insufficient for generating accurate consensus sequence (< 30×)
Ct threshold cycle, HCoV human coronavirus, N/A not available, ND not done, NPA nasopharyngeal aspirate, NPS nasopharyngeal swab, NS nasal swab, pOS posterior oropharyngeal saliva, RT-PCR reverse transcription polymerase chain reaction, SP sputum, TS throat swab, VC virus culture
Published primers and nested RT-PCR conditions
| Primer sequences | ||
|---|---|---|
| Forward | 5′-GGN TGG GAY TAY CCN AAR TGY GA-3′ | 760 bp amplicon |
| Reverse | 5′-RHG GRT ANG CRT CWA TDG C-3′ | |
| Forward | 5′-GGT TGG GAC TAT CCT AAG TGT GA-3′ | 440 bp amplicon |
| Reverse | 5′-CCA TCA TCA GAT AGA ATC ATC AT-3′ | |
In-house developed primers and RT-PCR protocol
| Primer sequences | ||
|---|---|---|
| Forward 1 | 5′-ATG GGN TGG GAY TAY CC-3′ | ~ 440 bp amplicon |
| Forward 2 | 5′-GGA YTA YCC NAA RTG YGA-3′ | |
| Reverse | 5′-CCA TCA TCA SWN ARN ATS AT-3′ | |
The primers were designed with reference to 56 coronavirus reference genomes
Coronavirus reference genomes considered for designing primers
NC_028752.1, NC_002645.1, NC_005831.2, NC_032107.1, NC_028824.1, NC_009988.1, NC_010437.1, NC_010438.1, NC_028814.1, NC_018871.1, NC_003436.1, NC_009657.1, NC_022103.1, NC_028811.1, NC_028833.1, NC_028806.1, NC_038861.1, NC_002306.3, NC_030292.1, NC_023760.1, NC_034972.1, NC_032730.1, NC_035191.1, NC_038294.1, NC_019843.3, NC_034440.1, NC_009020.1, NC_009019.1, NC_039207.1, NC_004718.3, NC_045512.2, NC_014470.1, NC_025217.1, NC_030886.1, NC_009021.1, NC_003045.1, NC_006213.1, NC_017083.1, NC_026011.1, NC_001846.1, AC_000192.1, NC_012936.1, NC_006577.2, NC_011547.1, NC_011549.1, NC_016993.1, NC_011550.1, NC_039208.1, NC_016992.1, NC_016991.1, NC_016996.1, NC_016994.1, NC_016995.1, NC_001451.1, NC_010800.1, NC_010646.1
Number of GISAID sequences by time, geographical regions and SNV status
| Time | SNV | Africa | America | Asia/Middle East | Europe | Oceania | Subtotal | Total |
|---|---|---|---|---|---|---|---|---|
| Dec 2019 | Absent | 0 | 0 | 17 | 0 | 0 | 17 | 17 |
| Present | 0 | 0 | 0 | 0 | 0 | 0 | ||
| Jan 2020 | Absent | 0 | 18 | 318 | 24 | 9 | 369 | 384 |
| Present | 0 | 0 | 15 | 0 | 0 | 15 (3.91%) | ||
| Feb 2020 | Absent | 1 | 108 | 418 | 122 | 17 | 666 | 688 |
| Present | 0 | 2 | 18 | 1 | 1 | 22 (3.20%) | ||
| Mar 2020 | Absent | 72 | 319 | 660 | 10,816 | 1006 | 12,873 | 13,435 |
| Present | 22 | 63 | 21 | 407 | 49 | 562 (4.18%) | ||
| Apr 2020 | Absent | 27 | 1,803 | 455 | 4233 | 219 | 6,737 | 7082 |
| Present | 30 | 90 | 25 | 194 | 6 | 345 (4.87%) | ||
| May 2020 | Absent | 0 | 118 | 87 | 330 | 15 | 550 | 567 |
| Present | 1 | 2 | 6 | 6 | 2 | 17 (3.00%) | ||
| Subtotal | Absent | 100 | 2366 | 1955 | 15,525 | 1266 | 21,212 | 22,173 |
| Present | 53 | 157 | 85 | 608 | 58 | 961 | ||
| Total | 153 | 2523 | 2,040 | 16,133 | 1324 | Global | ||
| % with SNV | 34.64 | 6.22 | 4.17 | 3.77 | 4.38 | 4.33 | ||
Fig. 2Missense mutations in partial RdRP gene of SARS-CoV-2 from Dec 2019 to May 2020. The number of sequences contributed by different a geographical regions (normalized) and b months are represented by different colours. The top 4 nucleotide positions with SNVs are labelled
Summary of missense mutations on partial gene of SARS-CoV-2 and corresponding amino acid changes. Nucleotide and amino acid positions correspond to reference sequences NC_045512.2 and YP_009725307.1, respectively
| SNVs | Amino acid changes | No. of sequences | Remarks |
|---|---|---|---|
| 15319C > T | P627S | 1 | §End of conserved polymerase motif A (17/17), containing the classic divalent-cation-binding residue D618, related to RdRP fidelity |
| 15327G > A | M629I | 2 | |
| 15327G > C | M629I | 1 | |
| 15327G > T | M629I | 13 | |
| 15328C > T | L630F | 2 | |
| 15338T > C | M633T | 1 | |
| 15346C > A | L636I | 2 | |
| 15346C > T | L636F | 4 | |
| 15349G > C | V637L | 1 | |
| 15352C > T | L638F | 1 | |
| 15356C > T | A639V | 2 | |
| 15358C > A | R640S | 1 | |
| 15359G > A | R640H | 1 | |
| 15364C > A | H642N | 1 | §Zn binding site (6/8) |
| 15368C > T | T643I | 4 | |
| 15371C > T | T644M | 4 | |
| 15380G > T | S647I | 47 | |
| 15384G > T | L648F | 1 | |
| 15392G > A | R651H | 1 | |
| 15406G > A | A656T | 17 | |
| 15406G > T | A656S | 1 | |
| 15407C > T | A656V | 1 | |
| 15412G > T | E658* | 1 | |
| 15418G > T | A660S | 3 | |
| 15438G > T | M666I | 34 | §End of nsp8 interaction site (52/52) |
| 15439G > A | V667I | 5 | |
| 15444G > T | M668I | 3 | |
| 15448G > A | G670S | 1 | |
| 15448G > T | G670C | 1 | |
| 15451G > A | G671S | 2 | |
| 15452G > T | G671V | 2 | |
| 15460T > C | Y674H | 2 | |
| 15463G > T | V675F | 2 | |
| 15472G > T | G678C | 1 | |
| 15488G > T | G683V | 1 | §Conserved polymerase motif B (4/30), related to RdRP fidelity |
| 15491A > G | D684G | 1 | §Conserved polymerase motif B (5/30), related to RdRP fidelity |
| 15535G > T | A699S | 3 | §Conserved polymerase motif B (20/30), related to RdRP fidelity |
| 15538G > A | V700I | 1 | §Conserved polymerase motif B (21/30), related to RdRP fidelity |
| 15539T > C | V700A | 1 | §Conserved polymerase motif B (21/30), related to RdRP fidelity |
| 15553A > G | N705D | 1 | §Conserved polymerase motif B (26/30), related to RdRP fidelity |
| 15571G > T | D711Y | 1 | |
| 15574G > A | G712S | 1 | |
| 15575G > T | G712V | 1 | |
| 15586G > T | A716S | 1 | |
| 15589G > T | D717Y | 1 | |
| 15593A > G | K718R | 2 | |
| 15594G > T | K718N | 1 | |
| 15596A > G | Y719C | 2 | |
| 15598G > T | V720F | 1 | |
| 15602G > A | R721H | 1 | |
| 15613C > T | H725Y | 2 | |
| 15619C > T | L727F | 1 | |
| 15627G > T | E729D | 1 | |
| 15636T > A | Y732* | 1 | |
| 15638G > A | R733K | 1 | |
| 15640A > T | N734Y | 1 | |
| 15641A > C | N734T | 1 | |
| 15647A > G | D736G | 4 | |
| 15652G > T | D738Y | 2 | |
| 15656C > T | T739I | 1 | |
| 15664G > A | V742M | 1 | |
| 15665T > C | V742A | 1 | |
| 15665T > G | V742G | 1 | |
| 15668A > G | N743S | 1 | |
| 15672G > T | E744D | 8 | |
| 15682T > A | Y748N | 1 | |
| 15683A > T | Y748F | 1 | |
| 15685T > A | L749M | 1 | |
| 15688C > T | R750C | 1 | |
| 15689G > A | R750H | 1 | |
| 15696T > G | H752Q | 1 |
§Depicted from NCBI Conserved Domain database [14]
Fig. 3Second PCR amplicon flanking the codons for amino acid 624–754 of SARS-CoV-2 RNA-dependent RNA polymerase (YP_009725307.1). Related conserved domains are as shown (the image was
adopted from NCBI Conserved Domains database [15])