Literature DB >> 34757201

Mutations in emerging variant of concern lineages disrupt genomic sequencing of SARS-CoV-2 clinical specimens.

Kevin S Kuchinski1, Jason Nguyen2, Tracy D Lee2, Rebecca Hickman2, Agatha N Jassem3, Linda M N Hoang3, Natalie A Prystajecky3, John R Tyson4.   

Abstract

Mutations in emerging severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) lineages can interfere with laboratory methods used to generate viral genome sequences for public health surveillance. We identified 20 mutations that are widespread in variant of concern lineages and affect widely used sequencing protocols by the ARTIC network and Freed et al. Three of these mutations disrupted sequencing of P.1 lineage specimens during a recent outbreak in British Columbia, Canada. We provide laboratory validation of protocol modifications that restored sequencing performance. The study findings indicate that genomic sequencing protocols require immediate updating to address emerging mutations. This work also suggests that routine monitoring and protocol updates will be necessary as SARS-CoV-2 continues to evolve. The bioinformatic and laboratory approaches used here provide guidance for this kind of assay maintenance.
Copyright © 2021 The Authors. Published by Elsevier Ltd.. All rights reserved.

Entities:  

Keywords:  Amplicon sequencing; COVID-19; Genomic surveillance; SARS-CoV-2; Variant of concern; Viral genomics

Mesh:

Year:  2021        PMID: 34757201      PMCID: PMC8555373          DOI: 10.1016/j.ijid.2021.10.050

Source DB:  PubMed          Journal:  Int J Infect Dis        ISSN: 1201-9712            Impact factor:   12.074


Genomic sequencing of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has played a crucial role in managing the ongoing coronavirus disease 2019 (COVID-19) pandemic. This is especially true for variant of concern (VOC) lineages that have emerged globally since December 2020 (Chand et al., 2020; Cherian et al., 2021; Faria et al., 2021; Tegally et al., 2021a, 2021b; Rambault et al., 2020). Genomic sequencing has been instrumental in detecting and characterizing these lineages, tracking their global spread, and identifying local cases to control transmission. Due to low quantities of viral genomic material in typical clinical specimens, the SARS-CoV-2 genome must be enriched for high-throughput sequencing. This is commonly done by multiplex PCR with numerous primer pairs tiled across the viral genome (Freed et al., 2020; Quick et al., 2017; Tyson et al., 2020). The performance of these primer schemes can be disrupted by mutations arising in emerging viral lineages. These mutations inhibit the amplification of certain amplicons and reduce the amount of sequencing data generated across affected areas of the genome. At best, this necessitates over-sequencing of specimens to compensate for areas with reduced sequencing depth, thereby increasing costs and reducing throughput. At worst, this causes inaccurate nucleotide base calls or gaps in the genome sequence. This undermines public health surveillance by interfering with lineage identification and obscuring genomic linkages between outbreak specimens. A previously published bioinformatic pipeline called PCR_strainer was used to assess whether current VOC lineages contain mutations impacting two widely used sequencing primer schemes (Kuchinski et al., 2020). A total of 27 244 recent, complete, high-quality genome sequences from lineages B.1.1.7, B.1.351, P.1, and B.1.617+ submitted to GISAID from laboratories around the world were analyzed. All primer site variants present in at least 15% of their lineage's genome sequences are reported in Table 1 . Fifteen primer site variants affecting the popular, commercially available ARTIC version 3 protocol (Tyson et al., 2020) were identified. Five mutated primer sites affecting the protocol by Freed et al., favored by some laboratories for its longer amplicon size (Freed et al., 2020), were also identified. The B.1.1.7 lineage had the fewest primer site variants (n = 1), followed by the B.1.351 lineage (n = 4), then the B.1.617+ lineage (n = 7), and the P.1 lineage (n = 8). Many of these primer site variants were predominant globally within their lineage, with 14 of them being present in at least 85% of their lineage's sequences.
Table 1

Nucleotide mismatches between SARS-CoV-2 variant of concern lineages and primers from the ARTIC version 3 and Freed et al. protocols. Global sequences from variant of concern lineages were downloaded from GISAID, comprising 8353 B.1.1.7 sequences, 3025 B.1.351 sequences, 9202 P.1 sequences, and 6664 B.1.617+ sequences. All sequences were categorized as complete and high-coverage by GISAID (>29 kb, <1% Ns, <0.05% novel amino acid substitutions, and verified insertion/deletions only). All sequences were submitted before May 22, 2021. Lineage B.1.351, B.1.617+, and P.1+ sequences were collected after April 1, 2021, and lineage B.1.1.7 sequences were collected after May 7, 2021. Sequence details and submitting laboratories, who are gratefully acknowledged for their contributions, are provided in File S1. Primer site variants were identified using the PCR_strainer pipeline with default parameters. Primer site variants were only reported if they were present in at least 15% of their lineage's sequences. A longer table is provided as Table S2, which reports all variants present in at least 1% of genomes within a lineage. Primer site variant sequences are provided in ‘oligo sense’, i.e. the reverse complement of the primer site and the sequence that the perfectly identical primer would have for the targeted location. Lower case bases in the primer site variant sequences indicate mismatches with the assay's original primer sequence.

ProtocolLineagePrimer namePrimer site variant sequence (5′ to 3′)Prevalence within lineage (%)
ARTIC v3B.1.1.7nCoV-2019_93_LEFTTGAGGCTGGTTCTtAATCACCCA47.11
ARTIC v3B.1.351nCoV-2019_76_LEFTAGGGCAAACTGGAAAtATTGCT97.09
ARTIC v3B.1.351nCoV-2019_76_LEFT_alt3GGGCAAACTGGAAAtATTGCTGA97.09
ARTIC v3B.1.351nCoV-2019_86_LEFTTtAGGTGATGGCACAACAAGTC96.86
ARTIC v3B.1.351nCoV-2019_74_LEFTACATCACTAGGTTTCAAACTTTACaTag92.99
ARTIC v3B.1.617nCoV-2019_93_RIGHTAGGcCTTCCTTGCCATGTTGAG87.03
ARTIC v3B.1.617nCoV-2019_81_LEFTGCACTTGGAAAACTTCAAaATGTGG85.98
ARTIC v3B.1.617nCoV-2019_72_RIGHTgaataaActcCACTTTCCATCCAAC82.28
ARTIC v3B.1.617nCoV-2019_73_LEFTCAATTTTGTAATGATCCATTTTTGGaTGT63.61
ARTIC v3B.1.617nCoV-2019_64_LEFTTCGATAGATATCCTGtTAATTCCATTGT49.29
ARTIC v3P.1nCoV-2019_12_RIGHTTTCACTCTTCATTTCCAAAAAGCTTaA99.54
ARTIC v3P.1nCoV-2019_92_RIGHTAGGTTtCTGGCAATTAATTGTAAAAGG99.41
ARTIC v3P.1nCoV-2019_73_LEFTCAATTTTGTAATtATCCATTTTTGGGTGT97.14
ARTIC v3P.1nCoV-2019_76_LEFT_alt3GGGCAAACTGGAAcGATTGCTGA94.27
ARTIC v3P.1nCoV-2019_76_LEFTAGGGCAAACTGGAAcGATTGCT94.19
FreedB.1.617SARSCoV_1200_5_LEFTACCTACTAAAAAGtCTGGTGGC49.32
FreedB.1.617SARSCoV_1200_27_RIGHTTGTTCGTTTAGGCGTGACAAaT49.2
FreedP.1SARSCoV_1200_24_LEFTGCTGAAtATGTCAACAACTCATATGA99.47
FreedP.1SARSCoV_1200_21_RIGHTGCAGaGGGTAATTGAGTTCTGt99.25
FreedP.1SARSCoV_1200_25_LEFTTGCTGCTAtTAAAATGTCAGAGTGT98.87
Nucleotide mismatches between SARS-CoV-2 variant of concern lineages and primers from the ARTIC version 3 and Freed et al. protocols. Global sequences from variant of concern lineages were downloaded from GISAID, comprising 8353 B.1.1.7 sequences, 3025 B.1.351 sequences, 9202 P.1 sequences, and 6664 B.1.617+ sequences. All sequences were categorized as complete and high-coverage by GISAID (>29 kb, <1% Ns, <0.05% novel amino acid substitutions, and verified insertion/deletions only). All sequences were submitted before May 22, 2021. Lineage B.1.351, B.1.617+, and P.1+ sequences were collected after April 1, 2021, and lineage B.1.1.7 sequences were collected after May 7, 2021. Sequence details and submitting laboratories, who are gratefully acknowledged for their contributions, are provided in File S1. Primer site variants were identified using the PCR_strainer pipeline with default parameters. Primer site variants were only reported if they were present in at least 15% of their lineage's sequences. A longer table is provided as Table S2, which reports all variants present in at least 1% of genomes within a lineage. Primer site variant sequences are provided in ‘oligo sense’, i.e. the reverse complement of the primer site and the sequence that the perfectly identical primer would have for the targeted location. Lower case bases in the primer site variant sequences indicate mismatches with the assay's original primer sequence. The corollary is that some region-specific primer site variants exist within VOC sub-lineages, and public health laboratories should assess their sequencing protocols against locally circulating VOC sequences. This can direct limited laboratory resources towards correcting the most locally relevant primer site variants, which is especially important when global databases are biased by differences in genomics capacity and surveillance priorities between submitting jurisdictions. This kind of assessment was done by the British Columbia Centre for Disease Control Public Health Laboratory. While investigating P.1 lineage outbreaks in March and April of this year, significantly reduced depths of coverage were observed across three amplicons covering parts of the orf1ab and spike genes (Figure 1 A). The affected amplicons were amplicon 21, amplicon 24, and amplicon 25 from the Freed et al. primer scheme, which all had mutated primer sites identified by PCR_strainer in global P.1 lineage sequences (Table 1). Compared to non-P.1 lineage specimens, median depths of coverage across these amplicons were reduced up to 32-fold. Genome sequences from 907 local P.1 lineage specimens were analyzed with PCR_strainer, confirming that the same mutations identified above in primers 21_LEFT, 24_RIGHT, and 25_RIGHT were present in 99.4%, 100%, and 95.8% of local sequences, respectively.
Figure 1

(A) Predominant mutations in the P.1 lineage reduced sequencing depth across three amplicons. A total 373 clinical specimens, collected during March and April 2021 in British Columbia, Canada, were sequenced using the Freed et al. primers. Normalized mean depths of coverage are shown for three amplicons affected by mutations in primer sites (the mean depth of coverage across each library's most deeply sequenced amplicon was used for normalization). (B) Amplicon drop-out was corrected by spiking in supplemental primers. Twenty-four clinical specimens were sequenced with non-spiked and spiked primer pools, then changes in normalized mean depth of coverage were calculated for the three affected amplicons. Supplemental primers were spiked in at the same molarity as the original primers. Primer 24_LEFT_P.1 was also assayed at two-times and four-times molarity after one-times molarity did not significantly improve performance. (C) Spiked primer pools had no detrimental effects on non-P1 lineages. A total 373 clinical specimens were sequenced with non-spiked and spiked primer pools containing primers 21_RIGHT_P.1 and 25_LEFT_P.1 at one-times molarity and primer 24_LEFT_P.1 at four-times molarity. Changes in normalized mean depth of coverage were calculated for the three affected amplicons.

(A) Predominant mutations in the P.1 lineage reduced sequencing depth across three amplicons. A total 373 clinical specimens, collected during March and April 2021 in British Columbia, Canada, were sequenced using the Freed et al. primers. Normalized mean depths of coverage are shown for three amplicons affected by mutations in primer sites (the mean depth of coverage across each library's most deeply sequenced amplicon was used for normalization). (B) Amplicon drop-out was corrected by spiking in supplemental primers. Twenty-four clinical specimens were sequenced with non-spiked and spiked primer pools, then changes in normalized mean depth of coverage were calculated for the three affected amplicons. Supplemental primers were spiked in at the same molarity as the original primers. Primer 24_LEFT_P.1 was also assayed at two-times and four-times molarity after one-times molarity did not significantly improve performance. (C) Spiked primer pools had no detrimental effects on non-P1 lineages. A total 373 clinical specimens were sequenced with non-spiked and spiked primer pools containing primers 21_RIGHT_P.1 and 25_LEFT_P.1 at one-times molarity and primer 24_LEFT_P.1 at four-times molarity. Changes in normalized mean depth of coverage were calculated for the three affected amplicons. Supplemental primers were designed by copying the three mutant primer site sequences identified by PCR_strainer: 5′-GCAGAGGGTAATTGAGTTCTGT-3′, 5′-GCTGAATATGTCAACAACTCATATGA-3′, and 5′-TGCTGCTATTAAAATGTCAGAGTGT-3′, which we called 21_RIGHT_P.1, 24_LEFT_P.1, and 25_LEFT_P.1 respectively. Primers 21_RIGHT_P.1 and 25_LEFT_P.1 were spiked into existing Freed et al. primer pools at the same molarity as the rest of the primers. Primer 24_LEFT_P.1 was spiked in at four-times molarity after a titration experiment on 24 clinical specimens showed that spiking in at one-times molarity did not significantly improve performance (Figure 1B). We then sequenced 373 clinical specimens with both non-spiked and spiked primer pools. Using these paired data, changes in depth of coverage across the affected amplicons were calculated for each specimen. It was observed that the spiked primers pools significantly improved depth of coverage for all three impacted amplicons without detrimental effects on non-P.1 lineages (Figure 1C). From these analyses, it was concluded that the established amplicon sequencing protocols, like ARTIC version 3 and Freed et al., need immediate updates to address numerous prevalent mutations in VOC lineages. We have shown that spike-in primers can restore performance for impacted amplicons, but we caution that these spike-in primers are likely a temporary measure. As new SARS-CoV-2 lineages continue to emerge, multiplex PCR amplification-based sequencing protocols will need to evolve alongside their viral target. This principle was demonstrated in the months following the work presented above. B.1.617+ lineages, collectively renamed ‘delta’ by the World Health Organization, went from representing only a handful of cases in British Columbia to being predominant. We assessed 388 specimens from local delta lineage specimens with PCR_strainer ( Table S1), which confirmed the presence of the two Freed et al. primer site mutations previously identified in B.1.617+ sequences from GISAID (Table 1). It was also noted that an additional, less prevalent primer site mutation had emerged in delta lineages in the intervening months. The mutation in the left primer of amplicon 5 had no meaningful impact on depth of coverage, which was not surprising given its location in the middle of the oligo ( Figure S1). The mutations in the left and right primers of amplicon 27 both caused reduced depth of coverage for these amplicons, but not to the same magnitude as the primer site mutations impacting P.1 lineages discussed above ( Figure S1). Emerging lineages must be monitored for disruptive mutations so that primer schemes can be updated routinely. As demonstrated here, spike-in primers can be effective stopgaps to maintain performance between major redesigns. The PCR_strainer pipeline can facilitate both tasks by screening tens-of-thousands of genome sequences and identifying relevant mutations.

Acknowledgements

We thank the dedicated staff at the British Columbia Centre for Disease Control Public Health Laboratory for processing and sequencing SARS-CoV-2 clinical specimens, especially the Molecular Microbiology and Genomics program for optimizing genomics methods, and the Bacteriology and Mycology program for routine sequencing of clinical specimens. We also thank the analytical staff for routine bioinformatic analysis.

Declarations

Funding source

This work was funded in part by the Canadian COVID genomics network (CanCOGeN), and a CIHR grant (OV4-170361).

Ethical approval

This work was conducted under a surveillance mandate, authorized by the Provincial Health Officer of British Columbia under the Health Act, without requirement for research ethics board review.

Conflict of interest

The authors have no conflicts of interest to declare.
  7 in total

1.  Multiplex PCR method for MinION and Illumina sequencing of Zika and other virus genomes directly from clinical samples.

Authors:  Joshua Quick; Nathan D Grubaugh; Steven T Pullan; Ingra M Claro; Andrew D Smith; Karthik Gangavarapu; Glenn Oliveira; Refugio Robles-Sikisaka; Thomas F Rogers; Nathan A Beutler; Dennis R Burton; Lia Laura Lewis-Ximenez; Jaqueline Goes de Jesus; Marta Giovanetti; Sarah C Hill; Allison Black; Trevor Bedford; Miles W Carroll; Marcio Nunes; Luiz Carlos Alcantara; Ester C Sabino; Sally A Baylis; Nuno R Faria; Matthew Loose; Jared T Simpson; Oliver G Pybus; Kristian G Andersen; Nicholas J Loman
Journal:  Nat Protoc       Date:  2017-05-24       Impact factor: 13.491

2.  Sixteen novel lineages of SARS-CoV-2 in South Africa.

Authors:  Houriiyah Tegally; Eduan Wilkinson; Richard J Lessells; Jennifer Giandhari; Sureshnee Pillay; Nokukhanya Msomi; Koleka Mlisana; Jinal N Bhiman; Anne von Gottberg; Sibongile Walaza; Vagner Fonseca; Mushal Allam; Arshad Ismail; Allison J Glass; Susan Engelbrecht; Gert Van Zyl; Wolfgang Preiser; Carolyn Williamson; Francesco Petruccione; Alex Sigal; Inbal Gazy; Diana Hardie; Nei-Yuan Hsiao; Darren Martin; Denis York; Dominique Goedhals; Emmanuel James San; Marta Giovanetti; José Lourenço; Luiz Carlos Junior Alcantara; Tulio de Oliveira
Journal:  Nat Med       Date:  2021-02-02       Impact factor: 53.440

3.  Detection of a SARS-CoV-2 variant of concern in South Africa.

Authors:  Houriiyah Tegally; Eduan Wilkinson; Marta Giovanetti; Arash Iranzadeh; Vagner Fonseca; Jennifer Giandhari; Deelan Doolabh; Sureshnee Pillay; Emmanuel James San; Nokukhanya Msomi; Koleka Mlisana; Anne von Gottberg; Sibongile Walaza; Mushal Allam; Arshad Ismail; Thabo Mohale; Allison J Glass; Susan Engelbrecht; Gert Van Zyl; Wolfgang Preiser; Francesco Petruccione; Alex Sigal; Diana Hardie; Gert Marais; Nei-Yuan Hsiao; Stephen Korsman; Mary-Ann Davies; Lynn Tyers; Innocent Mudau; Denis York; Caroline Maslo; Dominique Goedhals; Shareef Abrahams; Oluwakemi Laguda-Akingba; Arghavan Alisoltani-Dehkordi; Adam Godzik; Constantinos Kurt Wibmer; Bryan Trevor Sewell; José Lourenço; Luiz Carlos Junior Alcantara; Sergei L Kosakovsky Pond; Steven Weaver; Darren Martin; Richard J Lessells; Jinal N Bhiman; Carolyn Williamson; Tulio de Oliveira
Journal:  Nature       Date:  2021-03-09       Impact factor: 49.962

4.  Assessing oligonucleotide designs from early lab developed PCR diagnostic tests for SARS-CoV-2 using the PCR_strainer pipeline.

Authors:  Kevin S Kuchinski; Agatha N Jassem; Natalie A Prystajecky
Journal:  J Clin Virol       Date:  2020-08-21       Impact factor: 3.168

5.  Improvements to the ARTIC multiplex PCR method for SARS-CoV-2 genome sequencing using nanopore.

Authors:  John R Tyson; Phillip James; David Stoddart; Natalie Sparks; Arthur Wickenhagen; Grant Hall; Ji Hyun Choi; Hope Lapointe; Kimia Kamelian; Andrew D Smith; Natalie Prystajecky; Ian Goodfellow; Sam J Wilson; Richard Harrigan; Terrance P Snutch; Nicholas J Loman; Joshua Quick
Journal:  bioRxiv       Date:  2020-09-04

6.  SARS-CoV-2 Spike Mutations, L452R, T478K, E484Q and P681R, in the Second Wave of COVID-19 in Maharashtra, India.

Authors:  Sarah Cherian; Varsha Potdar; Santosh Jadhav; Pragya Yadav; Nivedita Gupta; Mousumi Das; Partha Rakshit; Sujeet Singh; Priya Abraham; Samiran Panda; Nic Team
Journal:  Microorganisms       Date:  2021-07-20

7.  Rapid and inexpensive whole-genome sequencing of SARS-CoV-2 using 1200 bp tiled amplicons and Oxford Nanopore Rapid Barcoding.

Authors:  Nikki E Freed; Markéta Vlková; Muhammad B Faisal; Olin K Silander
Journal:  Biol Methods Protoc       Date:  2020-07-18
  7 in total
  6 in total

Review 1.  Advances and Trends in Omics Technology Development.

Authors:  Xiaofeng Dai; Li Shen
Journal:  Front Med (Lausanne)       Date:  2022-07-01

2.  Spike Mutation Profiles Associated With SARS-CoV-2 Breakthrough Infections in Delta Emerging and Predominant Time Periods in British Columbia, Canada.

Authors:  Chad D Fibke; Yayuk Joffres; John R Tyson; Caroline Colijn; Naveed Z Janjua; Chris Fjell; Natalie Prystajecky; Agatha Jassem; Hind Sbihi
Journal:  Front Public Health       Date:  2022-07-04

3.  Future-proofing and maximizing the utility of metadata: The PHA4GE SARS-CoV-2 contextual data specification package.

Authors:  Emma J Griffiths; Ruth E Timme; Catarina Inês Mendes; Andrew J Page; Nabil-Fareed Alikhan; Dan Fornika; Finlay Maguire; Josefina Campos; Daniel Park; Idowu B Olawoye; Paul E Oluniyi; Dominique Anderson; Alan Christoffels; Anders Gonçalves da Silva; Rhiannon Cameron; Damion Dooley; Lee S Katz; Allison Black; Ilene Karsch-Mizrachi; Tanya Barrett; Anjanette Johnston; Thomas R Connor; Samuel M Nicholls; Adam A Witney; Gregory H Tyson; Simon H Tausch; Amogelang R Raphenya; Brian Alcock; David M Aanensen; Emma Hodcroft; William W L Hsiao; Ana Tereza R Vasconcelos; Duncan R MacCannell
Journal:  Gigascience       Date:  2022-02-16       Impact factor: 6.524

4.  Development and validation of a high throughput SARS-CoV-2 whole genome sequencing workflow in a clinical laboratory.

Authors:  Sun Hee Rosenthal; Anna Gerasimova; Rolando Ruiz-Vega; Kayla Livingston; Ron M Kagan; Yan Liu; Ben Anderson; Renius Owen; Laurence Bernstein; Alla Smolgovsky; Dong Xu; Rebecca Chen; Andrew Grupe; Pranoot Tanpaiboon; Felicitas Lacbawan
Journal:  Sci Rep       Date:  2022-02-08       Impact factor: 4.379

5.  Optimization of the SARS-CoV-2 ARTIC Network V4 Primers and Whole Genome Sequencing Protocol.

Authors:  Arnold W Lambisia; Khadija S Mohammed; Timothy O Makori; Leonard Ndwiga; Maureen W Mburu; John M Morobe; Edidah O Moraa; Jennifer Musyoki; Nickson Murunga; Jane N Mwangi; D James Nokes; Charles N Agoti; Lynette Isabella Ochola-Oyier; George Githinji
Journal:  Front Med (Lausanne)       Date:  2022-02-17

6.  ProbeTools: designing hybridization probes for targeted genomic sequencing of diverse and hypervariable viral taxa.

Authors:  Kevin S Kuchinski; Jun Duan; Chelsea Himsworth; William Hsiao; Natalie A Prystajecky
Journal:  BMC Genomics       Date:  2022-08-12       Impact factor: 4.547

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.