| Literature DB >> 35328676 |
Yu-Chieh Liao1, Feng-Jui Chen2,3, Min-Chieh Chuang4, Han-Chieh Wu2, Wan-Chen Ji4, Guann-Yi Yu2, Tsi-Shu Huang5.
Abstract
For tiling of the SARS-CoV-2 genome, the ARTIC Network provided a V4 protocol using 99 pairs of primers for amplicon production and is currently the widely used amplicon-based approach. However, this technique has regions of low sequence coverage and is labour-, time-, and cost-intensive. Moreover, it requires 14 pairs of primers in two separate PCRs to obtain spike gene sequences. To overcome these disadvantages, we proposed a single PCR to efficiently detect spike gene mutations. We proposed a bioinformatic protocol that can process FASTQ reads into spike gene consensus sequences to accurately call spike protein variants from sequenced samples or to fairly express the cases of missing amplicons. We evaluated the in silico detection rate of primer sets that yield amplicon sizes of 400, 1200, and 2500 bp for spike gene sequencing of SARS-CoV-2 to be 59.49, 76.19, and 92.20%, respectively. The in silico detection rate of our proposed single PCR primers was 97.07%. We demonstrated the robustness of our analytical protocol against 3000 Oxford Nanopore sequencing runs of distinct datasets, thus ensuring high-integrity sequencing of spike genes for variant SARS-CoV-2 determination. Our protocol works well with the data yielded from versatile primer designs, making it easy to determine spike protein variants.Entities:
Keywords: SARS-CoV-2; nanopore sequencing; spike gene; variant
Mesh:
Substances:
Year: 2022 PMID: 35328676 PMCID: PMC8954144 DOI: 10.3390/ijms23063257
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Basic descriptions and primer evaluations of amplicon tiling methods for SARS-CoV-2 genomes.
| Name | ARTIC nCoV-2019 V4, | SARS-Cov-2-Midnight, | 14 × 2.5 kb Amplicons, |
|---|---|---|---|
| Amplicon length | 400 bp | 1200 bp | 2500 bp |
| Primer pairs | 99 | 29 | 14 |
| Protocol/Reference | [ | ||
| Primer sequences | [ | ||
| Projects in SRA | PRJEB37886 [ | PRJNA645718 [ | PRJNA675364 [ |
| Percentage of genome containing all primer pairs (%) | V4: 9.58; V4.1: 14.53 (spanning from 344 to 29,512) | 28.30 (spanning from 1128 to 29,790) | 58.53 (spanning from 1897 to 28,145) |
| Primer covering spike gene | 71–84 | 22–25 | A6 and B6 |
| Percentage of genome containing spike gene primers (%) | V4: 57.87 and V4.1: 59.49 (spanning from 21,316 to 25,438) | 76.19 (spanning from 21,562 to 25,790) | 92.20 (spanning from 21,386 to 25,646) |
Figure 1In silico evaluation of primer sets across the SARS-CoV-2 genome. (A) Set I: ARTIC nCoV-2019 V4, updated primers (primer pairs 10, 23, 27, 76, 79, 88, 89, and 90 were updated in V4.1) in black lines, (B) set II: SARS-CoV-2-Midnight, and (C) set III: 1.4 × 2.5 kb amplicons. An amplicon-wise detection rate corresponding to each primer pair was determined: No. of genome containing the sequences of primer pair/Total number of genomes 848,003 SARS-CoV-2 genome sequences downloaded from NCBI on 14 February 2022.
Figure 2Coordinates of primer sets and the mutation rate of spike protein based on the hCov19 Mutation Dashboard on 12 February 2022. Green, blue, red, and black horizontal lines represent primer set I, set II, set III, and our primer set (set IV) covering the spike gene of SARS-CoV-2, respectively.
Figure 3Sample count profiles with sequencing depth <25 across the spike gene. A total of 3000 Nanopore sequencing runs of SARS-CoV-2 samples downloaded from Sequence Read Archive (SRA). One thousand sequencing runs of (A) set I with AvgSpotLength < 600 bp; (B) set II, 600–1600 bp; and (C) set III, ≥1600 bp.