| Literature DB >> 35677072 |
Devon A Gregory1, Monica Trujillo2, Clayton Rushford1, Anna Flury3, Sherin Kannoly4, Kaung Myat San4, Dustin Lyfoung5, Roger W Wiseman5, Karen Bromert6, Ming-Yi Zhou6, Ellen Kesler6, Nathan Bivens6, Jay Hoskins7, Chung-Ho Lin8, David H O'Connor5, Chris Wieberg9, Jeff Wenzel10, Rose S Kantor11, John J Dennehy3,4, Marc C Johnson1.
Abstract
Wastewater-based epidemiology (WBE) is an effective way of tracking the appearance and spread of SARS-COV-2 lineages through communities. Beginning in early 2021, we implemented a targeted approach to amplify and sequence the receptor binding domain (RBD) of SARS-COV-2 to characterize viral lineages present in sewersheds. Over the course of 2021, we reproducibly detected multiple SARS-COV-2 RBD lineages that have never been observed in patient samples in 9 sewersheds located in 3 states in the USA. These cryptic lineages contained between 4 to 24 amino acid substitutions in the RBD and were observed intermittently in the sewersheds in which they were found for as long as 14 months. Many of the amino acid substitutions in these lineages occurred at residues also mutated in the Omicron variant of concern (VOC), often with the same substitution. One of the sewersheds contained a lineage that appeared to be derived from the Alpha VOC, but the majority of the lineages appeared to be derived from pre-VOC SARS-COV-2 lineages. Specifically, several of the cryptic lineages from New York City appeared to be derived from a common ancestor that most likely diverged in early 2020. While the source of these cryptic lineages has not been resolved, it seems increasingly likely that they were derived from immunocompromised patients or animal reservoirs. Our findings demonstrate that SARS-COV-2 genetic diversity is greater than what is commonly observed through routine SARS-CoV-2 surveillance. Wastewater sampling may more fully capture SARS-CoV-2 genetic diversity than patient sampling and could reveal new VOCs before they emerge in the wider human population. Author Summary: During the COVID-19 pandemic, wastewater-based epidemiology has become an effective public health tool. Because many infected individuals shed SARS-CoV-2 in feces, wastewater has been monitored to reveal infection trends in the sewersheds from which the samples were derived. Here we report novel SARS-CoV-2 lineages in wastewater samples obtained from 3 different states in the USA. These lineages appeared in specific sewersheds intermittently over periods of up to 14 months, but generally have not been detected beyond the sewersheds in which they were initially found. Many of these lineages may have diverged in early 2020. Although these lineages share considerable overlap with each other, they have never been observed in patients anywhere in the world. While the wastewater lineages have similarities with lineages observed in long-term infections of immunocompromised patients, animal reservoirs cannot be ruled out as a potential source.Entities:
Year: 2022 PMID: 35677072 PMCID: PMC9176656 DOI: 10.1101/2022.06.03.22275961
Source DB: PubMed Journal: medRxiv
Fig. 1RBD amplification. A. Schematic of regions targeted by the RBD and S1 primer sets (see Methods for primer sequences). Overview of the SARS-COV-2 Spike RBD lineages identified in B. the MO33 sewershed and C. the MO45 sewershed. Each row represents a unique lineage and each column is an amino acid position in the Spike protein (left). Amino acid changes similar to (green boxes) or identical to (orange boxes) changes in Omicron (BA.1) are indicated. The heatmap (right) illustrates lineage (row) detection by date (column), colored by the log10 percent relative abundance of that lineage.
| Location | Date range when lineages appeared | Days within range | Number of samples | Number of RBD mutations |
|---|---|---|---|---|
| NY2 | 8/16/21–02/28/22 | 170 | 10 | 4–18 |
| NY3 | 1/31/21 [ | 437 | 7 | 16–24 |
| NY10 | 4/4/21–11/29/21 | 239 | 22 | 4–11 |
| NY11 | 4/19/21–11/22/22 | 217 | 20 | 4–9 |
| NY13 | 10/26/21–2/14/22 | 111 | 5 | 12–15 |
| NY14 | 5/10/21–10/18/21 | 161 | 9 | 8–15 |
| MO33 | 3/15/21–4/27/21 | 43 | 12 | 4–6 |
| MO45 | 6/8/21–2/22/22 | 259 | 3 | 4–5 |
| CA | 11/4/21–12/21/21 | 47 | 3 | 16 |
Fig. 2NY3 and NY14 RBD amplifications. Overview of the SARS-COV-2 Spike RBD lineages identified from the A. NY3 and B. NY14 sewershed. Amino acid changes similar to or identical to changes in Omicron (BA.1) are indicated. The lineage previously referred to as WNY4 in [8] is indicated.
Fig. 3NY10 RBD amplifications. Overview of the SARS-COV-2 Spike RBD lineages identified from the NY10 sewershed. Amino acid changes similar to or identical to changes in Omicron (BA.1) are indicated. Two lineages previously referred to as WNY1 and WNY2 in [8] are indicated.
Fig. 4NY11 RBD amplifications. Overview of the SARS-COV-2 Spike RBD lineages identified from the NY11 sewershed. Lineages designated A and B belong to two lineages groups that appear unrelated. Amino acid changes similar to or identical to changes in Omicron (BA.1) are indicated. The lineage previously referred to as WNY3 in [8] is indicated.
Fig. 5NY2 and NY13 RBD amplifications. Overview of the SARS-COV-2 Spike RBD lineages identified from the A. NY3 and B. NY13 sewershed. Lineages designated A and B belong to two lineages groups that appear unrelated. Amino acid changes similar to or identical to changes in Omicron (BA.1) are indicated.
Fig. 6Overview of the SARS-COV-2 Spike RBD lineages identified from the California sewershed. Amino acid changes similar to or identical to changes in Omicron (BA.1) are indicated.
Fig. 7S1 amplifications. A. Overview of the SARS-COV-2 Spike S1 lineages in the Alpha, Delta, Omicron VOCs and six of the sewersheds with cryptic lineages. S1 amplifications were sequenced by subcloning (SC) and Sanger sequencing, or were sequenced using a PacBio (PB) deep sequencing. B. Plot of the number of synonymous and non-synonymous changes in the S1 sequences shown.
Cryptic lineage whole genome sequences from nationwide surveys.
| SRA Accession | State | Submitter | Sample Date | Percent cryptic lineage | Genome coverage | Sewershed | PANGO assignment | RBD Changes |
|---|---|---|---|---|---|---|---|---|
|
| CA | Aquavitas | 2021-01-04 | 7% | 27,403 | n/a[ | NDb | E484A/Q498H/H519N |
|
| NJ | Aquavitas | 2021-01-18 | 7% | 28,185 | n/a[ | NDb | E484A/Q498H/H519N |
|
| NJ | Aquavitas | 2021-01-18 | 7% | 27,295 | n/a[ | NDb | E484A/Q498H/H519N |
|
| NJ | Aquavitas | 2021-01-04 | 100% | 15,217 | n/a[ | NDb | E484A/Q498H/H519N |
|
| NY | Biobot Analytics | 2021-08-17 | 79% | 28,227 | NY2 | B.1.503 | Y449P/E484A/F490Y/Q498H |
|
| NY | Biobot Analytics | 2021-08-09 | 92% | 24,595 | NY11 | B.1.503 | K417T/K444T/Y449H/N460K/E484A/F490Y/Q498H |
|
| NY | Biobot Analytics | 2021-08-09 | 100% | 11,877 | NY11 | NDb | K417T/K444T/Y449H/N460K/E484A/F490Y/Q498H/A570D |
|
| NY | Biobot Analytics | 2021-07-12 | 99% | 24,001 | NY10 | B.1 | Q493K/Q498Y/H519N/T572N) |
|
| NY | Biobot Analytics | 2021-07-05 | 100% | 22,316 | NY11 | P.1.15 | K417T/K444T/Y449H/E484A/F490Y/Q498H |
|
| NY | Biobot Analytics | 2021-07-04 | 100% | 28,634 | NY10 | B.1 | Q493K/Q498/H519N/T572N |
|
| NY | Biobot Analytics | 2021-06-28 | 100% | 12,209 | NY2 | NDb | K444S/V445K/G446V/Y449R/L452Q/N460K/K462R/S477N/T478E/T478R/DEL483/E484P/F486I/F490P/G496S/Q498Y/P499S/N501T/Y505H/V511I |
|
| NY | Biobot Analytics | 2021-06-28 | 98% | 16,281 | NY14 | NDb | K417T/K444S/DEL445-6/L452R/N460K/S477D/F486V/Q493K/Q498Y/P499S/N501T |
|
| NY | Biobot Analytics | 2021-06-28 | 30% | 21,974 | NY11 | B.1 | N440K/K444S/DEL445-6/L452Q/Y453F/N460K/S477N/D484/F486A/Q493K/Q498K/P499S/N501Y/H519N |
|
| NY | Biobot Analytics | 2021-06-16 | 99% | 21,152 | NY11 | A.29 | K444T/Y449H/E484A/Y489Y/F490Y/Q498H |
|
| NY | Biobot Analytics | 2021-06-16 | 100% | 15,593 | NY10 | NDb | E484A/F486P/S494/Q498Y/H519N |
n/a = not available;
ND = none designated
Fig. 8Polymorphisms from wastewater genomes. Shown are all mutations present in at least three of the whole genome sequences from NYC listed in Table 2 and their corresponding amino acid changes. First column lists the prevalence of each mutation among all patients samples collected in June 2021 from New York. Each other column lists the prevalence of each mutation in each of the genome sequences.