| Literature DB >> 34438144 |
Candice L Swift1, Mirza Isanovic1, Karlen E Correa Velez1, R Sean Norman2.
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the virus responsible for causing the COVID-19 pandemic, can be detected in untreated wastewater. Wastewater surveillance of SARS-CoV-2 complements clinical data by offering earlier community-level detection, removing underlying factors such as access to healthcare, sampling asymptomatic patients, and reaching a greater population. Here, we compare 24-hour composite samples from the influents of two different wastewater treatment plants (WWTPs) in South Carolina, USA: Columbia and Rock Hill. The sampling intervals span the months of July 2020 and January 2021, which cover the first and second waves of elevated SARS-CoV-2 transmission and COVID-19 clinical cases in these regions. We identify four signature mutations in the surface glycoprotein (spike) gene that are associated with the following variants of interest or concern, VOI or VOC (listed in parenthesis): S477N (B.1.526, Iota), T478K (B.1.617.2, Delta), D614G (present in all VOC as of May 2021), and H655Y (P.1, Gamma). The N501Y mutation, which is associated with three variants of concern, was identified in samples from July 2020, but not detected in January 2021 samples. Comparison of mutations identified in viral sequence databases such as NCBI Virus and GISAID indicated that wastewater sampling detected mutations that were present in South Carolina, but not reflected in the clinical data deposited into databases.Entities:
Keywords: SARS-CoV-2; Sewage surveillance; Variant detection; Wastewater-based epidemiology
Mesh:
Substances:
Year: 2021 PMID: 34438144 PMCID: PMC8372435 DOI: 10.1016/j.scitotenv.2021.149691
Source DB: PubMed Journal: Sci Total Environ ISSN: 0048-9697 Impact factor: 7.963
Fig. 1COVID-19 clinical cases peaked in Columbia (orange) and Rock Hill (blue), South Carolina, USA during July 2020 and January 2021. Stacked area chart depicting the seven-day moving average of clinical COVID-19 case counts reported for the USA postal zip codes served by the Columbia and Rock Hill WWTPs. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Non-synonymous mutations with >100× position coverage that are present in January 2021, but absent in July 2020. RdRp = RNA-dependent RNA polymerase. The entirety of the GISAID database was searched on June 17, 2021. Acknowledgement for contributions to GISAID are included in Supplementary Dataset S8.
| Nucleotide | Mutation in product | Coverage | SRA counts | GISAID | GISAID filtered |
|---|---|---|---|---|---|
| 6032 | NSP3_A1105T | 129 | 0 | 797 | 5 |
| 8083 | NSP3_M1788I | 116 | 120 | >10 k | 128 |
| 9928 | NSP4_M458I | 163 | 42 | 8321 | 61 |
| 11,085 | NSP6_Y38F | 124 | 0 | 218 | 1 |
| 12,986 | NSP9_M101V | 106 | 0 | 430 | 1 |
| 14,805 | RdRp_T455I | 102 | 0 | 0 | 0 |
| 16,175 | RdRp_L912F | 138 | 0 | 0 | 0 |
| Other | |||||
| 21,707 | Surface glycoprotein | 217 | 51 | 4923 | 1 |
| 27,964 | ORF8 protein | 107 | 2544 | 0 | 0 |
Location filtered by South Carolina and Collection date between January 1 and January 31, 2021, inclusive.
Absent from database search results, but present in the literature (Bui et al., 2021; Zeng et al., 2020).
GISAID not searchable by ORF8 substitutions.
Spike gene mutations identified in this work and associated references. NCBI SRA and GISAID were searched on June 16, 2021. High-confidence mutations are located in positions with at least 100× coverage, whereas medium-confidence mutations are located in positions with at least 50× coverage. Literature references are not comprehensive. Acknowledgement for contributions from GISAID are included in Supplementary Dataset S7.
| Nucleotide position | Mutation | Sample set | References | SRA Counts | GISAID | GISAID filtered | |
|---|---|---|---|---|---|---|---|
| High-confidence, >100× coverage | |||||||
| 21,707 | H49Y | January 2021 | ( | 51 | 4919 | 1 | |
| 23,063 | N501Y | July 2020 | ( | 49 | >10 k | 0 | |
| 24,319 | Q920K | July 2020 | ( | 0 | 2 | 0 | |
| Medium-confidence, >50× coverage | |||||||
| 21,614 | L18F | January 2021 | ( | 59 | >10 k | 4 | |
| 23,525 | H655Y | July 2020 | NA | 35 | 69 | 0 | |
| 24,007 | L822F | January 2021 | ( | 52 | 1523 | 1 | |
| 24,023 | L858F | January 2021 | NA | 0 | 197 | 0 | |
| 24,026 | S884F | January 2021 | ( | 0 | 438 | 0 | |
| 24,134 | S939F | January 2021 | ( | 119 | 6254 | 0 | |
| 24,213 | V1228L | January 2021 | ( | 45 | 4633 | 11 | |
| 24,733 | H1058Y | July 2020 | NA | 0 | 285 | 1 | |
Location filtered by South Carolina and Collection date between January 1 and January 31, 2021, and/or June 28, 2020 and July 31, 2020, depending on the sample sets where the mutation was observed.
Filtering by Collection Date of June 28 to July 31, 2020, but unrestricted location, resulted in 34 viruses with the N501Y mutation, 26 of which originated from samples in the USA.
Fig. 2Spike gene mutations from the Wuhan SARS-CoV-2 reference genome (accession MN908947.3) detected in Columbia (orange) and Rock Hill (blue) WWTP influents during the month of January 2021. Bubble sizes represent the variant nucleotide frequency. Data labels are shown only for mutations identified in one or more variants of concern, but all detected mutations, both synonymous and nonsynonymous, are plotted (Supplementary Datasets S1 and S2). (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Comparison of signature mutations detected in wastewater to sequenced Variants of Interest (VOI) or Variants of Concern (VOC) from clinical samples collected during January 2021 in South Carolina. Substitutions in bold were detected in WWTP influent samples. Clinical sequence data was taken from GISAID, and acknowledgements are included Supplementary Dataset S5. Spike protein substitutions were taken from the Center for Disease Control (Centers for Disease Control and Prevention, 2021). Substitutions shown in parenthesis indicate that the substitution is only observed in some sequences.
| Variant Pango lineage/WHO designation | Number of clinical Sequences (GISAID) | Collection date range | Spike Mutations | WWTP and collection date |
|---|---|---|---|---|
| B.1.351/Beta (VOC) | 8 | 01-08-21 to | D80A, D215G, 241del, 242del, 243del, K417N, E484K, N501Y, | D614G: Columbia 01-10-21 and 01-18-21 |
| B.1.427, B.1.429/Epsilon (VOC) | 35 | 01-03-21 | L452R, | D614G: Columbia 01-10-21 and 01-18-21 |
| P.1/Gamma (VOC) | 0 | N/A | L18F, T20N, P26S, D138Y, R190S, K417T, E484K, N501Y, | H655Y: Rock Hill 01-17-21, D614G: Columbia 01-10-21 and 01-18-21 |
| B.1.617.2/Delta (VOI) | 0 | N/A | Spike: T19R, (G142D), 156del, 157del, R158G, L452R, | T478K: Columbia 01-18-21 |
| B.1.525/Eta (VOI) | 1 | 01-25-21 | A67V, 69del, 70del, 144del, E484K, | D614G: Columbia 01-10-21 and 01-18-21 |
| B.1.526/Iota (VOI) | 1 | (L5F*), T95I, D253G, ( | S477N: Columbia 01-18-21 |
First sequence for this variant in GISAID was submitted from a sample collected on 02-24-21.
First sequence for this variant in GISAID was submitted from a sample collected on 05-05-21.
Fig. 3Non-synonymous shared or unique mutations detected in wastewater from July 2020 and January 2021. Each reported mutation had at least 100× coverage in its position and at least 10 divergent reads. RdRp = RNA-dependent RNA polymerase, 3'-to-5' = 3'-to-5' exonuclease.