| Literature DB >> 34757201 |
Kevin S Kuchinski1, Jason Nguyen2, Tracy D Lee2, Rebecca Hickman2, Agatha N Jassem3, Linda M N Hoang3, Natalie A Prystajecky3, John R Tyson4.
Abstract
Mutations in emerging severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) lineages can interfere with laboratory methods used to generate viral genome sequences for public health surveillance. We identified 20 mutations that are widespread in variant of concern lineages and affect widely used sequencing protocols by the ARTIC network and Freed et al. Three of these mutations disrupted sequencing of P.1 lineage specimens during a recent outbreak in British Columbia, Canada. We provide laboratory validation of protocol modifications that restored sequencing performance. The study findings indicate that genomic sequencing protocols require immediate updating to address emerging mutations. This work also suggests that routine monitoring and protocol updates will be necessary as SARS-CoV-2 continues to evolve. The bioinformatic and laboratory approaches used here provide guidance for this kind of assay maintenance.Entities:
Keywords: Amplicon sequencing; COVID-19; Genomic surveillance; SARS-CoV-2; Variant of concern; Viral genomics
Mesh:
Year: 2021 PMID: 34757201 PMCID: PMC8555373 DOI: 10.1016/j.ijid.2021.10.050
Source DB: PubMed Journal: Int J Infect Dis ISSN: 1201-9712 Impact factor: 12.074
Nucleotide mismatches between SARS-CoV-2 variant of concern lineages and primers from the ARTIC version 3 and Freed et al. protocols. Global sequences from variant of concern lineages were downloaded from GISAID, comprising 8353 B.1.1.7 sequences, 3025 B.1.351 sequences, 9202 P.1 sequences, and 6664 B.1.617+ sequences. All sequences were categorized as complete and high-coverage by GISAID (>29 kb, <1% Ns, <0.05% novel amino acid substitutions, and verified insertion/deletions only). All sequences were submitted before May 22, 2021. Lineage B.1.351, B.1.617+, and P.1+ sequences were collected after April 1, 2021, and lineage B.1.1.7 sequences were collected after May 7, 2021. Sequence details and submitting laboratories, who are gratefully acknowledged for their contributions, are provided in File S1. Primer site variants were identified using the PCR_strainer pipeline with default parameters. Primer site variants were only reported if they were present in at least 15% of their lineage's sequences. A longer table is provided as Table S2, which reports all variants present in at least 1% of genomes within a lineage. Primer site variant sequences are provided in ‘oligo sense’, i.e. the reverse complement of the primer site and the sequence that the perfectly identical primer would have for the targeted location. Lower case bases in the primer site variant sequences indicate mismatches with the assay's original primer sequence.
| Protocol | Lineage | Primer name | Primer site variant sequence (5′ to 3′) | Prevalence within lineage (%) |
|---|---|---|---|---|
| ARTIC v3 | B.1.1.7 | nCoV-2019_93_LEFT | TGAGGCTGGTTCTtAATCACCCA | 47.11 |
| ARTIC v3 | B.1.351 | nCoV-2019_76_LEFT | AGGGCAAACTGGAAAtATTGCT | 97.09 |
| ARTIC v3 | B.1.351 | nCoV-2019_76_LEFT_alt3 | GGGCAAACTGGAAAtATTGCTGA | 97.09 |
| ARTIC v3 | B.1.351 | nCoV-2019_86_LEFT | TtAGGTGATGGCACAACAAGTC | 96.86 |
| ARTIC v3 | B.1.351 | nCoV-2019_74_LEFT | ACATCACTAGGTTTCAAACTTTACaTag | 92.99 |
| ARTIC v3 | B.1.617 | nCoV-2019_93_RIGHT | AGGcCTTCCTTGCCATGTTGAG | 87.03 |
| ARTIC v3 | B.1.617 | nCoV-2019_81_LEFT | GCACTTGGAAAACTTCAAaATGTGG | 85.98 |
| ARTIC v3 | B.1.617 | nCoV-2019_72_RIGHT | gaataaActcCACTTTCCATCCAAC | 82.28 |
| ARTIC v3 | B.1.617 | nCoV-2019_73_LEFT | CAATTTTGTAATGATCCATTTTTGGaTGT | 63.61 |
| ARTIC v3 | B.1.617 | nCoV-2019_64_LEFT | TCGATAGATATCCTGtTAATTCCATTGT | 49.29 |
| ARTIC v3 | P.1 | nCoV-2019_12_RIGHT | TTCACTCTTCATTTCCAAAAAGCTTaA | 99.54 |
| ARTIC v3 | P.1 | nCoV-2019_92_RIGHT | AGGTTtCTGGCAATTAATTGTAAAAGG | 99.41 |
| ARTIC v3 | P.1 | nCoV-2019_73_LEFT | CAATTTTGTAATtATCCATTTTTGGGTGT | 97.14 |
| ARTIC v3 | P.1 | nCoV-2019_76_LEFT_alt3 | GGGCAAACTGGAAcGATTGCTGA | 94.27 |
| ARTIC v3 | P.1 | nCoV-2019_76_LEFT | AGGGCAAACTGGAAcGATTGCT | 94.19 |
| Freed | B.1.617 | SARSCoV_1200_5_LEFT | ACCTACTAAAAAGtCTGGTGGC | 49.32 |
| Freed | B.1.617 | SARSCoV_1200_27_RIGHT | TGTTCGTTTAGGCGTGACAAaT | 49.2 |
| Freed | P.1 | SARSCoV_1200_24_LEFT | GCTGAAtATGTCAACAACTCATATGA | 99.47 |
| Freed | P.1 | SARSCoV_1200_21_RIGHT | GCAGaGGGTAATTGAGTTCTGt | 99.25 |
| Freed | P.1 | SARSCoV_1200_25_LEFT | TGCTGCTAtTAAAATGTCAGAGTGT | 98.87 |
Figure 1(A) Predominant mutations in the P.1 lineage reduced sequencing depth across three amplicons. A total 373 clinical specimens, collected during March and April 2021 in British Columbia, Canada, were sequenced using the Freed et al. primers. Normalized mean depths of coverage are shown for three amplicons affected by mutations in primer sites (the mean depth of coverage across each library's most deeply sequenced amplicon was used for normalization). (B) Amplicon drop-out was corrected by spiking in supplemental primers. Twenty-four clinical specimens were sequenced with non-spiked and spiked primer pools, then changes in normalized mean depth of coverage were calculated for the three affected amplicons. Supplemental primers were spiked in at the same molarity as the original primers. Primer 24_LEFT_P.1 was also assayed at two-times and four-times molarity after one-times molarity did not significantly improve performance. (C) Spiked primer pools had no detrimental effects on non-P1 lineages. A total 373 clinical specimens were sequenced with non-spiked and spiked primer pools containing primers 21_RIGHT_P.1 and 25_LEFT_P.1 at one-times molarity and primer 24_LEFT_P.1 at four-times molarity. Changes in normalized mean depth of coverage were calculated for the three affected amplicons.