Literature DB >> 34849439

Looking for a needle in a haystack. SARS-CoV-2 variant characterization in sewage.

Marta Itarte1,2, Sílvia Bofill-Mas1,2, Sandra Martínez-Puchol1,2, Helena Torrell3, Adrià Ceretó3, Marina Carrasco1, Eva Forés1,2, Núria Canela3, Rosina Girones1,2, Marta Rusiñol4.   

Abstract

SARS-CoV-2 variants are emerging worldwide, and monitoring them is key in providing early warnings. Here, we summarize the different analytical approaches currently used to study the dissemination of SARS-CoV-2 variants in wastewater and discuss their advantages and disadvantages. We also provide preliminary results of two sensitive and cost-effective approaches: variant-specific reverse transcription-nested PCR assays and a nonvariant-specific amplicon deep sequencing strategy that targets three key regions of the viral spike protein. Next-generation sequencing approaches enable the simultaneous detection of signature mutations of different variants of concern in a single assay and may be the best option to explore the real picture at a particular time. Targeted PCR approaches focused on specific signature mutations will need continuous updating but are sensitive and cost-effective.
© 2021 The Authors.

Entities:  

Keywords:  Next-generation sequencing (NGS); SARS-CoV-2; Signature mutations; Variants of concern (VOCs); Variants of interest (VOIs); Wastewater-based epidemiology (WBE)

Year:  2021        PMID: 34849439      PMCID: PMC8621506          DOI: 10.1016/j.coesh.2021.100308

Source DB:  PubMed          Journal:  Curr Opin Environ Sci Health        ISSN: 2468-5844


Introduction

Wastewater surveillance for SARS-CoV-2 has proved to be useful in monitoring the evolution of the COVID-19 pandemic. However, new emerging variants are posing new challenges. The SARS-CoV-2 variants α, β, γ and δ (also known as lineages B.1.1.7, B.1.351, P.1 and B.1.617.2, respectively) were first detected in the United Kingdom, South Africa, Brazil and India, respectively, and were immediately considered to be variants of concern (VOCs). Such variants, which have been associated with the fluctuations seen with the pandemic waves, possess mutations that affect viral infectivity and antigenicity. These mutations are mainly located in the gene encoding the viral spike (S) protein. In particular, mutations leading to the E484K and N501Y substitutions within the receptor-binding domain of the S protein have been demonstrated to give the S protein a greater affinity for the human ACE2 receptor [13]. The commonly applied PCR methods used to quantify the concentration of the virus in environmental samples use specific primers and probes targeting the nucleocapsid (N), envelope (E) or RNA-dependent RNA polymerase (RdRp) regions. However, as stated above, the VOCs and the new variants of interest (VOIs) have most of their signature mutations within the S gene. Figure 1 summarizes the signature mutations identified in each VOC and VOI.
Figure 1

Spike protein mutations that can affect both tropism (receptor binding) and immune evasion and are therefore the focus of surveillance. All mutations indicated are related to the reference sequence (NC_045512). Variants of concern correspond to: α, β, γ, and δ. To date (15 July 2021), the rest are variants of interest. Orange ticks indicate deletions and yellow ticks amino acid mutations.

Spike protein mutations that can affect both tropism (receptor binding) and immune evasion and are therefore the focus of surveillance. All mutations indicated are related to the reference sequence (NC_045512). Variants of concern correspond to: α, β, γ, and δ. To date (15 July 2021), the rest are variants of interest. Orange ticks indicate deletions and yellow ticks amino acid mutations. Although the combination of genome sequence analysis of samples from COVID-19 patients with epidemiological datasets has produced reliable assessments of the extent of SARS-CoV-2 transmission in the community [22], the time lag between infection and symptoms and the future decrease in sequencing will add further delays compared to the expected immediacy of the results from wastewater surveillance. At the beginning of October 2020, several new SARS-CoV-2 variants started to circulate globally [7]. At that moment, the minimum number of clinical samples that had to be sequenced to find the α variant was 400, assuming that only 5% of the positive clinical samples had been sequenced and that the prevalence of this VOC in the population was 5% [20]. Thus, the analysis of SARS-CoV-2 genomes sequenced from clinical samples is limited to the fraction of the clinical samples subjected to whole-genome sequencing. Monitoring the circulation of variants in wastewater has its caveats when dealing with mixtures of variants and/or the presence of inhibitors. Although the environmental surveillance of other epidemic viruses (like noroviruses) have been observed to be sensitive in detecting variants [17], the consensus sequences obtained from wastewater samples might lead to artificial genomes that do not represent an existing virus. However, SNPs can be linked to particular variant clusters or clades and give information about SARS-CoV-2 variants circulating in a region [15]. Thus, the study of the viral RNA sequences found in wastewater is important to understand viral transmission patterns and to establish an alert system for new SARS-CoV-2 variants.

Recent trends in studies on SARS-CoV-2 variants in wastewater samples

A recently published study using the EU Sewage Sentinel System for SARS-CoV-2 provided an extensive report of ‘The HERA Incubator’ [10], with next-generation sequencing (NGS) information about the diversity of SARS-CoV-2 variants and their associated mutations at the community level. It determined the relative abundance of each VOC based on the abundance of reads associated with certain amino acid mutations [11]. The categorization of the mutations as unique or shared was based on the percentage of the sequences for associated mutations submitted to GISAID.

Quantitative RT-PCR based approaches

New quantitative reverse transcription PCR (RT-qPCR) protocols targeting specific mutations or deletions have been described to differentiate between SARS-CoV-2 variants. The first multiplex RT-qPCR assay was published by Ref. [26], which uses the deletion within the ORF1a gene (that exists in most of the VOCs) and the HV69/70 deletion (present in the α variant) to differentiate this variant from the rest. Other research groups have developed allele-specific RT-qPCRs for the α variant [5,12,19,29] or multiplex assays for specific S protein mutations (L452R, E484K and N501Y) [27]. These RT-qPCR strategies can be used when there is already a high prevalence of the VOC in the community, or in other words, when SARS-CoV-2 RNA levels, measured with assays targeting the N gene, for example, are high. Using the same basis, reverse transcription droplet digital PCR (RT-ddPCR) is an alternative that might be more sensitive and allows the discrimination of closely related sequences [1, 6,14]. [Abachin_et_al_2017] designed an RT-ddPCR assay using two different probes to discriminate between wild-type sequences and sequences containing the N501Y signature mutation (present in the α, β, γ and θ variants) in wastewater.

Amplicon sequencing based approaches

Reverse transcription-nested PCR (RT-nPCR) assays followed by Sanger sequencing and/or NGS analysis have been published for SARS-CoV-2 characterization. In October 2020, Martin and collaborators designed an RT-nPCR approach followed by Sanger sequencing and NGS analysis of the amplified products from five different regions of the viral genome, which demonstrated changes in the predominance of the virus variants [20]. La Rosa and coworkers [25] adopted a similar approach involving conventional Sanger sequencing of the amplicon but focusing only on key mutations of the S gene, which allowed rapid screening of the SARS-CoV-2 variants [_Rosa_et_al_2021]. Recently, another group from the United Kingdom used two different RT-nPCR assays targeting the RdRP and ORF8b gene regions for diagnostics and two primer sets targeting the S gene regions to discriminate between the α, β and γ variants [28]. Sequencing amplicons using NGS, commonly known as amplicon deep sequencing (ADS), has not only been applied to selected parts of the SARS-CoV-2 genome but also to the whole genome as an informative method for detecting and identifying SARS-CoV-2 variants. Several custom enrichment strategies based on designing primer sets coupled with Illumina-compatible library preparation kits have been used to sequence amplified fragments spanning the whole or near-complete genome of SARS-CoV-2 from environmental samples [2,15,18,20,28]. Other studies have used the open-source ARTIC protocol [3,16,23]. This protocol, released in March 2020 and designed to sequence the virus from clinical samples, uses 98 multiplexing PCR primer pairs to amplify the whole genome of the virus [24]. Similarly, the commercial AmpliSeq SARS-CoV-2 Research Panel (Thermo Fisher Scientific) consists of two pools with amplicons ranging from 125 bp to 275 bp that covers >99% of the SARS-CoV-2 genome and are compatible with either Illumina or Ion Torrent sequencing platforms [2]. Another strategy based on NGS is the use of a commercial oligo-capture approach, like the Illumina Respiratory Virus Oligo Panel (Illumina, Inc.) or the VirCapSeq Enrichment Kit (Roche), which are designed to enrich the sequences of human respiratory or vertebrate viruses, respectively, and both have been applied to complex environmental samples prior to massive sequencing [8,21]. Based on the findings of available studies, the most abundant single nucleotide variations (SNVs) that have been identified in wastewater to date correspond to the most abundant SNVs in clinical samples [8]. The identification of an individual or several signature mutations (Figure 1) located in close proximity to one another within the sample amplicon can help identify new SNVs in the population being analyzed. When using these approaches in environmental samples containing a mixture of variant sequences, there is a possibility of generating artificial genome reconstructions or artefacts during sequence assembly, which could result in unreliable VOC or VOI assignations. The ADS of selected regions provides a more robust characterization of genomic variants compared to broader genome reconstructions within individual samples. When applied to clinical samples, long-read sequencing platforms have been proven to be efficient in obtaining highly accurate consensus-level sequences despite the higher error rates [4]. However, to our knowledge, this approach has not been applied in the study of SARS-CoV-2 variants in sewage.

Specific regions for the characterization of SARS-CoV-2 genomic variants

Approaches targeting selected regions of the SARS-CoV-2 genome in which signature mutations are located generate more interest compared to the sequencing of other regions that are more conserved and less informative about genomic variants. For discriminating between variants, European authorities have established that sequencing should cover at least the S gene, particularly that encoding the entire N-terminal region and the receptor-binding domain (RBD) corresponding to amino acids 1 to 541 [9]. Preliminary data obtained from two different approaches that were developed by our research group are detailed below. These approaches involved specific RT-nPCR assays targeting the signature mutations of the main VOCs and VOIs followed by Sanger sequencing (assay A and B) and an ADS strategy targeting three different regions of the S gene (assays A1, A2 and A3). Both approaches were tested in parallel in samples collected from February to May 2021 from wastewater treatments plants (WWTP) of different sizes located in Catalonia, northeast Spain. More information about the methodology is provided in the Supplementary Material. The results obtained are summarized in Table 1 and the datasets generated are available in Zenodo under the DOI number https://doi.org/10.5281/zenodo.5497909.
Table 1

Summary of SARS-CoV-2 concentrations (GC/L) detected using RT-qPCR and signature mutations detected using RT-nPCR and Sanger sequencing or ADS in a MiSeq platform. ND: not detected.

Summary of SARS-CoV-2 concentrations (GC/L) detected using RT-qPCR and signature mutations detected using RT-nPCR and Sanger sequencing or ADS in a MiSeq platform. ND: not detected. As indicated in Table 1, Sanger sequencing allowed the identification of signature mutations in the samples, in which the following were predominant: Del69/70, Del144, K417N and E484K. The ADS approach gave information about the genomic diversity in each sample, showing different signature mutation combinations that are compatible with different variants as expected in mixtures coming from wastewater samples. Interestingly, ADS indicated the moment when the α variant probably became predominant in Catalonia. From all the sequences obtained from the NGS analysis of the samples collected on 2nd February 2021, the Del69/70 mutation was present in 0.1%, 64.1% and 0% of the sequences obtained from WWTP1, WWTP2 and WWTP3, respectively. One week later, these percentages increased to 99.5%–100%, which was also observed in other signature mutations of the α variant (N501Y, A570D and D614G). These ADS results associated with α variant predominance agree with the information obtained from the Sanger sequencing. The detection of the signature mutations compatible with the α variant with Sanger sequencing was only possible in samples that showed a high percentage of the signature mutations of the α variant by ADS, or in other words, when these mutations were predominant among the mixture of sequences. The other signature mutation identified by ADS was S477N, which is characteristic of the ι variant.

Variant study approaches: the pros and cons

Different analytical approaches for the study of SARS-CoV-2 variants in wastewater samples have been developed, each one providing different types of information. In Table 2 , the pros and cons of the different methodologies that have been used to date are listed. Depending on their intrinsic properties, a suitable application has been suggested.
Table 2

List of pros and cons of the different methodologies used in the study of SARS-CoV-2 variants in sewage samples.

MethodProsConsApplicability
RT-qPCRLow costApproximation of the specific signature mutation vs. WT proportion in a mixtureFast obtention of resultsDetects rare mutations and discriminates closely related sequencesDifferent target sensitivities when multiplexingDesigned to detect a signature mutation of a specific variant only, thus not giving information about other possible variants also present in the sampleMonitoring of a specific variant in a region where it has spread
RT-ddPCRFast obtention of resultsApproximation of the specific signature mutation vs. WT proportion in a mixtureDetects rare mutations and discriminates closely related sequencesMore sensitive than RT-qPCRDesigned to detect a signature mutation of a specific variant only, thus not giving information about other possible variants also present in the sampleMore expensive than RT-qPCRMore sensitive monitoring of a specific variant in a region where it has spread
RT-nPCR + Sanger SequencingLow costFast obtention of resultsEasy interpretation of the resultsMay use primers specific targeting defined signature mutationsDetecting only the predominant variant in the mixtureNot quantitativeCannot be effectively performed in conditions of low virus titersFast elucidation of the predominant variant circulating in a region
NGSShows the diversity of variants circulatingMore extensive information about mutations in a larger range of the genomeExpensiveExtensive bioinformatics analysisNot quantitativeLabour intensiveTime consumingMight lead to artificial consensus genomesCannot be effectively performed in conditions of low virus titersCharacterization of variant diversity circulating in a region
List of pros and cons of the different methodologies used in the study of SARS-CoV-2 variants in sewage samples. RT-qPCR and RT-ddPCR are designed to detect a signature mutation of a particular variant and are the fastest at providing results. Both methodologies are often designed as duplex or multiplex, allowing the simultaneous detection of other variants and giving an estimation of their percentages among other simultaneously occurring variants. Thus, they are appropriate for monitoring a specific variant in a region where it has spread and become established since a certain proportion of the target variant with respect to the others is needed to be detected. RT-ddPCR might be more sensitive and precise than RT-qPCR, but it is also more expensive [1, 6,14] [_Abachin_et_al_2017]. However, wastewater is a complex sample, and it is likely to contain a mixture of variants. In a region where the predominant variant circulating within the population is not clear or where the situation is constantly changing, non-variant-specific methodologies might be more suitable since they do not need continuous updating of the assay. In such cases, RT-nPCR assays followed by Sanger sequencing of specific regions containing signature mutations would be highly informative and would identify the predominant variant circulating in the population, as this type of sequencing gives information about the most abundant sequence amplified. Furthermore, RT-nPCR can use specific primers for a defined mutation that can target specific variants and regions where other mutations may occur. By contrast, if the objective is to perform an accurate characterization of the diversity present in wastewater, or in other words, identify different variants present in a mixture, NGS analysis would be more appropriate. The extensive information provided by NGS techniques, considered to be expensive, requires an exhaustive bioinformatics analysis and expertise.

Conclusions

Monitoring SARS-CoV-2 variants in wastewater is important for epidemiological surveillance in a community. Different analytical approaches have been developed to identify and study the dissemination of SARS-CoV-2 variants in wastewater samples, including RT-qPCR, RT-nPCR, and NGS approaches. Due to their intrinsic nature, each method has pros and cons and provides different types of information that is important to consider when selecting the appropriate method for a specific objective. In a postpandemic scenario, when PCR-based assays and sequencing of clinical samples will decrease, the sequencing of a subset of wastewater samples may be enough to monitor the circulation of different VOCs and VOIs in a community. A representative sample needs to be collected regularly from a certain region to accurately estimate and monitor the prevalence of SARS-CoV-2 variants. Nonvariant-specific techniques may be the best option to explore the real picture of all the circulating variants at a particular time, providing broader information that can contribute to community surveillance. This study provides guidance on available approaches for detecting and identifying circulating SARS-CoV-2 variants considering different scenarios. Further work on the application of massive sequencing of SARS-CoV-2 from environmental samples is needed towards producing longer fragments in order to avoid overlapping and chimera constructions, and also shorter bioinformatic processing for an effective early warning.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
  20 in total

1.  Rapid SARS-CoV-2 whole-genome sequencing and analysis for informed public health decision-making in the Netherlands.

Authors:  Aura Timen; Marion Koopmans; Bas B Oude Munnink; David F Nieuwenhuijse; Mart Stein; Áine O'Toole; Manon Haverkate; Madelief Mollers; Sandra K Kamga; Claudia Schapendonk; Mark Pronk; Pascal Lexmond; Anne van der Linden; Theo Bestebroer; Irina Chestakova; Ronald J Overmars; Stefan van Nieuwkoop; Richard Molenkamp; Annemiek A van der Eijk; Corine GeurtsvanKessel; Harry Vennema; Adam Meijer; Andrew Rambaut; Jaap van Dissel; Reina S Sikkema
Journal:  Nat Med       Date:  2020-07-16       Impact factor: 53.440

2.  Comparison of reverse-transcriptase qPCR and droplet digital PCR for the quantification of dengue virus nucleic acid.

Authors:  Eric Abachin; Samantha Convers; Stephanie Falque; Raphaël Esson; Laurent Mallet; Nolwenn Nougarede
Journal:  Biologicals       Date:  2018-02-03       Impact factor: 1.856

3.  Monitoring Emergence of the SARS-CoV-2 B.1.1.7 Variant through the Spanish National SARS-CoV-2 Wastewater Surveillance System (VATar COVID-19).

Authors:  Albert Carcereny; Adán Martínez-Velázquez; Albert Bosch; Ana Allende; Pilar Truchado; Jenifer Cascales; Jesús L Romalde; Marta Lois; David Polo; Gloria Sánchez; Alba Pérez-Cataluña; Azahara Díaz-Reolid; Andrés Antón; Josep Gregori; Damir Garcia-Cehic; Josep Quer; Margarita Palau; Cristina González Ruano; Rosa M Pintó; Susana Guix
Journal:  Environ Sci Technol       Date:  2021-08-16       Impact factor: 11.357

4.  Multiplex SARS-CoV-2 Genotyping Reverse Transcriptase PCR for Population-Level Variant Screening and Epidemiologic Surveillance.

Authors:  Hannah Wang; Jacob A Miller; Michelle Verghese; Mamdouh Sibai; Daniel Solis; Kenji O Mfuh; Becky Jiang; Naomi Iwai; Marilyn Mar; ChunHong Huang; Fumiko Yamamoto; Malaya K Sahoo; James Zehnder; Benjamin A Pinsky
Journal:  J Clin Microbiol       Date:  2021-07-19       Impact factor: 5.948

5.  Spatial and temporal distribution of SARS-CoV-2 diversity circulating in wastewater.

Authors:  Alba Pérez-Cataluña; Álvaro Chiner-Oms; Enric Cuevas-Ferrando; Azahara Díaz-Reolid; Irene Falcó; Walter Randazzo; Inés Girón-Guzmán; Ana Allende; María A Bracho; Iñaki Comas; Gloria Sánchez
Journal:  Water Res       Date:  2021-12-24       Impact factor: 11.236

6.  Tracking SARS-CoV-2 in Sewage: Evidence of Changes in Virus Variant Predominance during COVID-19 Pandemic.

Authors:  Javier Martin; Dimitra Klapsa; Thomas Wilton; Maria Zambon; Emma Bentley; Erika Bujaki; Martin Fritzsche; Ryan Mate; Manasi Majumdar
Journal:  Viruses       Date:  2020-10-09       Impact factor: 5.048

7.  Rapid Increase of SARS-CoV-2 Variant B.1.1.7 Detected in Sewage Samples from England between October 2020 and January 2021.

Authors:  Thomas Wilton; Erika Bujaki; Dimitra Klapsa; Manasi Majumdar; Maria Zambon; Martin Fritzsche; Ryan Mate; Javier Martin
Journal:  mSystems       Date:  2021-06-15       Impact factor: 6.496

8.  Assessing sensitivity and reproducibility of RT-ddPCR and RT-qPCR for the quantification of SARS-CoV-2 in wastewater.

Authors:  Mark Ciesielski; Denene Blackwood; Thomas Clerkin; Raul Gonzalez; Hannah Thompson; Allison Larson; Rachel Noble
Journal:  J Virol Methods       Date:  2021-07-09       Impact factor: 2.014

View more
  1 in total

1.  Delta SARS-CoV-2 variant is entirely substituted by the omicron variant during the fifth COVID-19 wave in Attica region.

Authors:  Aikaterini Galani; Athina Markou; Lampros Dimitrakopoulos; Aikaterini Kontou; Marios Kostakis; Vasileios Kapes; Marios A Diamantopoulos; Panagiotis G Adamopoulos; Margaritis Avgeris; Evi Lianidou; Andreas Scorilas; Dimitrios Paraskevis; Sotirios Tsiodras; Meletios-Athanasios Dimopoulos; Nikolaos Thomaidis
Journal:  Sci Total Environ       Date:  2022-09-28       Impact factor: 10.753

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.