| Literature DB >> 33060796 |
Jun Li1, Haoqiu Wang2, Lingfeng Mao3, Hua Yu2, Xinfen Yu2, Zhou Sun4, Xin Qian2, Shi Cheng2, Shuchang Chen2, Junfang Chen4, Jingcao Pan2, Jueliang Shi3, Xuchu Wang5,6.
Abstract
The novel SARS-CoV-2 outbreak has swiftly spread worldwide. The rapid genome sequencing of SARS-CoV-2 strains has become a helpful tool for better understanding the genomic characteristics and origin of the virus. To obtain virus whole-genome sequences directly from clinical specimens, we performed nanopore sequencing using a modified ARTIC protocol in a portable nanopore sequencer and validated a routine 8-h workflow and a 5-h rapid pipeline. We conducted some optimization to improve the genome sequencing workflow. The sensitivity of the workflow was also tested by serially diluting RNA from clinical samples. The optimized pipeline was finally applied to obtain the whole genomes of 29 clinical specimens collected in Hangzhou from January to March 2020. In the 29 obtained complete genomes of SARS-CoV-2, 33 variations were identified and analyzed. The genomic variations and phylogenetic analysis hinted at multiple sources and different transmission patterns during the COVID-19 epidemic in Hangzhou, China. In conclusion, the genomic characteristics and origin of the virus can be quickly determined by nanopore sequencing following our workflows.Entities:
Mesh:
Year: 2020 PMID: 33060796 PMCID: PMC7566627 DOI: 10.1038/s41598-020-74656-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Overview of two nanopore sequencing workflows drawn by an online website (https://www.processon.com/). The white boxes represent the series of tasks that are the components of the 8-h routine workflow and the 5-h rapid workflow. The numbers in the colored boxes indicate the time required to complete the tasks.
Figure 2Analysis of the genome sequencing data for the SARS-CoV-2 viruses obtained with Oxford Nanopore Technologies using a MinION sequencer. (A) Trends of the depth data appearing over 16 h of sequencing using a Ligation Sequencing Kit 109. (B) The genome-wide depth of coverage using the Rapid Barcoding Kit 004. (C) Comparative analysis of the average depth and genome-wide coverage between the routine workflow and the rapid workflow. (D–G) Method optimization for nucleic acid extraction (magnetic bead or spin column method) and PCR amplification. (H–J) Repeated nanopore sequencing assays of viral RNA at a tenfold dilution. (K) The depth of genome-wide coverage appearing when the optimized methods were applied to 29 clinical specimens.
Information for 29 Hangzhou SARS-CoV-2 viruses from COVID-19 patients in this study.
| Strain number | Source | Gender | Age | Date of onset | Collection date | History | qRT-PCR (Ct) | GISAID accession number | Average depth | Completeness (%) | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| ORF1a/b | N | ||||||||||
| HZCDC0001 | Sputum | Male | 31 | 2020-01-18 | 2020-01-19 | Infected in Wuhan | 26.51 | 27.03 | EPI_ISL_407313 | 754.88 | 100.00 |
| HZCDC0012 | Nasal, oropharyngeal swab | Male | 47 | 2020-01-18 | 2020-01-20 | Spouse of HZCDC0013 | 22.2 | 21.67 | EPI_ISL_421236 | 326.64 | 100.00 |
| HZCDC0013 | Nasal, oropharyngeal swab | Female | 45 | 2020-01-15 | 2020-01-20 | Infected in Wuhan | 22.34 | 21.88 | EPI_ISL_421235 | 418.66 | 100.00 |
| HZCDC0025 | Nasal, oropharyngeal swab | Male | 51 | 2020-01-21 | 2020-01-21 | Infected in Wuhan | 35.42 | 34.83 | EPI_ISL_421234 | 509.46 | 99.99 |
| HZCDC0048 | Nasal, oropharyngeal swab | Male | 35 | 2020-01-16 | 2020-01-21 | Contact with a patient from Wuhan | 27.95 | 27.72 | EPI_ISL_421233 | 506.14 | 100.00 |
| HZCDC0048L | Tracheal aspirate sample | 19.7 | 19.58 | EPI_ISL_421232 | 385.17 | 100.00 | |||||
| HZCDC0049L | Tracheal aspirate sample | Female | 40 | 2020-01-17 | 2020-01-21 | Contact with a patient from Wuhan | 25.11 | 24.64 | EPI_ISL_421231 | 413.09 | 100.00 |
| HZCDC0090 | Nasal, oropharyngeal swab | Female | 34 | 2020-01-17 | 2020-01-21 | Contact with a patient from Wuhan | Neg | Neg | EPI_ISL_421230 | 233.75 | 99.15 |
| HZCDC0090L | Bronchoalveolar-lavage fluid | 30.29 | 29.54 | EPI_ISL_421229 | 555.60 | 100.00 | |||||
| HZCDC0091 | Nasal, oropharyngeal swab | Male | 31 | 2020-01-17 | 2020-01-21 | Contact with a patient from Wuhan | Neg | 37.14 | EPI_ISL_421228 | 264.45 | 99.16 |
| HZCDC0091L | Bronchoalveolar-lavage fluid | 33.09 | 32.95 | EPI_ISL_421227 | 535.46 | 99.97 | |||||
| HZCDC0119 | Nasal, oropharyngeal swab | Female | 41 | 2020-01-21 | 2020-01-22 | Contact with a patient from Wuhan | 26.2 | 25.99 | EPI_ISL_421226 | 464.11 | 100.00 |
| HZCDC0135 | Nasal, oropharyngeal swab | Male | 62 | 2020-01-15 | 2020-01-22 | Infected in Wuhan | 35.5 | 34.79 | EPI_ISL_421225 | 509.26 | 99.95 |
| HZCDC0162 | Nasal, oropharyngeal swab | Male | 46 | 2020-01-21 | 2020-01-23 | – | 35.93 | 34.96 | EPI_ISL_421224 | 488.18 | 99.61 |
| HZCDC0167 | Nasal, oropharyngeal swab | Female | 53 | 2020-01-21 | 2020-01-23 | – | 33.33 | 32.85 | EPI_ISL_421223 | 464.55 | 98.08 |
| HZCDC6111 | Nasal, oropharyngeal swab | Female | 39 | 2020-03-04 | 2020-03-05 | Contact with imported cases | 28.51 | 28.53 | EPI_ISL_482575 | 576.70 | 100.00 |
| HZCDC6224 | Nasal, oropharyngeal swab | Female | 39 | 2020-03-04 | 2020-03-06 | Contact with imported cases | 29.47 | 28.79 | EPI_ISL_482576 | 558.21 | 100.00 |
| HZCDC6228 | Nasal, oropharyngeal swab | Male | 29 | 2020-03-06 | 2020-03-07 | Infected in France | 31.74 | 31.17 | EPI_ISL_482577 | 459.86 | 100.00 |
| HZCDC6445 | Nasal, oropharyngeal swab | Male | 46 | 2020-03-08 | 2020-03-12 | Contact with imported cases | 26.97 | 27.67 | EPI_ISL_482578 | 550.81 | 100.00 |
| HZCDC6706 | Nasal, oropharyngeal swab | Female | 37 | 2020-03-12 | 2020-03-14 | Infected in U.S.A | 26.71 | 28.10 | EPI_ISL_421222 | 507.37 | 100.00 |
| HZCDC6789 | Nasal, oropharyngeal swab | Male | 21 | 2020-03-13 | 2020-03-15 | Infected in U.K | 22.79 | 22.41 | EPI_ISL_421221 | 365.88 | 100.00 |
| HZCDC6948 | Nasal, oropharyngeal swab | Male | 17 | 2020-03-16 | 2020-03-17 | Infected in U.K | 17.79 | 18.29 | EPI_ISL_482579 | 434.33 | 100.00 |
| HZCDC7019 | Nasal, oropharyngeal swab | Female | 43 | 2020-03-17 | 2020-03-18 | Mother of HZCDC6948 | 26.12 | 26.41 | EPI_ISL_482580 | 588.17 | 100.00 |
| HZCDCp145 | Nasal, oropharyngeal swab | Male | 12 | 2020-03-19 | 2020-03-21 | Infected in Switzerland | 33.99 | 36.89 | EPI_ISL_482581 | 577.25 | 100.00 |
| HZCDC7155 | Nasal, oropharyngeal swab | Female | 26 | 2020-03-19 | 2020-03-21 | Infected in Switzerland | 24.44 | 25.82 | EPI_ISL_482582 | 351.02 | 100.00 |
| HZCDC7184F | Feces | Male | 23 | 2020-03-07 | 2020-03-22 | Contact with imported cases | 19.83 | 20.89 | EPI_ISL_482583 | 560.85 | 100.00 |
| HZCDC7328 | Nasal, oropharyngeal swab | Female | 21 | 2020-03-22 | 2020-03-24 | Infected in France | 23.02 | 25.03 | EPI_ISL_482584 | 562.57 | 100.00 |
| HZCDC7363 | Nasal, oropharyngeal swab | Male | 21 | 2020-03-24 | 2020-03-25 | Infected in France | 17.77 | 17.19 | EPI_ISL_482585 | 590.53 | 100.00 |
| HZCDC7378 | Nasal, oropharyngeal swab | Female | 15 | 2020-03-25 | 2020-03-25 | Infected in U.S.A | 29.11 | 28.74 | EPI_ISL_482586 | 541.30 | 100.00 |
Figure 3Phylogenetic analysis of 29 Hangzhou SARS-CoV-2 strains with reference genome sequences. (A) The phylogenetic relationships of the 29 Hangzhou SARS-CoV-2 genomes with 196 reference sequences from GISAID. The genomes collected in January (blue) or March (red) are indicated by solid circles with different colors, and the genetic clusters are differentiated according to which clade they belong to. Signature nucleotide substitutions and amino acid changes are annotated with their positions. (B) The condensed phylogenetic tree with the matrix of variations plotted in a heat map drawn with phyD3 version 1.3 (https://phyd3.bits.vib.be/). The missense and synonymous variants are indicated with dark blue and light blue, respectively. The numbers listed on the side are genome positions according to the reference SARS-CoV-2 genome (GenBank accession number MN908947).
The list of substitutions in all 29 obtained complete genomes of SARS-CoV-2.
| Site | Variations | Variation type | Gene | Amino acid change | Frequency |
|---|---|---|---|---|---|
| 241 | C → T | Intergenic region | 5′UTR | None | 1 (3.45%) |
| 241 | C → T | Intergenic region | 5′UTR | None | 11 (37.93) |
| 1059 | C → T | Missense variant | orf1ab | Thr265Ile | 2 (6.90%) |
| 1917 | C → T | Missense variant | orf1ab | Thr551Ile | 1 (3.45%) |
| 2480 | A → G | Missense variant | orf1ab | Ile739Val | 2 (6.90%) |
| 2558 | C → T | Missense variant | orf1ab | Pro765Ser | 2 (6.90%) |
| 3037 | C → T | Synonymous variant | orf1ab | None | 10 (34.48%) |
| 6187 | C → T | Synonymous variant | orf1ab | None | 1 (3.45%) |
| 6689 | G → T | Missense variant | orf1ab | Ala2142Ser | 1 (3.45%) |
| 9190 | G → T | Synonymous variant | orf1ab | None | 1 (3.45%) |
| 11,083 | G → T | Missense variant | orf1ab | Leu3606Phe | 6 (20.69%) |
| 14,034 | T → C | Missense variant | orf1ab | Met4590Thr | 1 (3.45%) |
| 14,248 | T → C | Synonymous variant | orf1ab | None | 1 (3.45%) |
| 14,408 | C → T | Missense variant | RdRp | Pro323Leu | 8 (27.59) |
| 14,616 | C → T | Missense variant | orf1ab | Ala4784Val | 1 (3.45%) |
| 14,805 | C → T | Missense variant | orf1ab | Thr4847Ile | 3 (10.34%) |
| 15,324 | C → T | Missense variant | orf1ab | Thr5020Ile | 1 (3.45%) |
| 16,468 | C → T | Synonymous variant | orf1ab | None | 1 (3.45%) |
| 17,247 | T → C | Missense variant | orf1ab | Val5661Ala | 1 (3.45%) |
| 19,646 | T → C | Synonymous variant | orf1ab | None | 1 (3.45%) |
| 19,839 | T → C | Missense variant | orf1ab | Ile6525Thr | 1 (3.45%) |
| 21,282 | A → G | Missense variant | orf1ab | Asp7006Gly | 2 (6.90%) |
| 21,850 | G → T | Missense variant | S | Glu96Asp | 1 (3.45%) |
| 23,403 | A → G | Missense variant | S | Asp614Gly | 10 (34.48%) |
| 23,575 | C → T | Synonymous variant | S | None | 5 (17.24%) |
| 24,133 | C → T | Synonymous variant | S | None | 1 (3.45%) |
| 25,563 | G → T | Missense variant | ORF3a | Gln57His | 2 (6.90%) |
| 26,144 | G → T | Missense variant | ORF3a | Gly251Val | 5 (17.24%) |
| 26,939 | A → G | Synonymous variant | M | None | 1 (3.45%) |
| 28,881 | G → A | Missense variant | N | Arg203Lys | 1 (3.45%) |
| 28,882 | G → A | Synonymous variant | N | None | 1 (3.45%) |
| 28,883 | G → C | Missense variant | N | Gly204Arg | 1 (3.45%) |
| 28,977 | C → T | Missense variant | N | Ser235Phe | 1 (3.45%) |