| Literature DB >> 36159190 |
Lenore Pipes1, Zihao Chen2, Svetlana Afanaseva1, Rasmus Nielsen1,3.
Abstract
Wastewater surveillance has become essential for monitoring the spread of SARS-CoV-2. The quantification of SARS-CoV-2 RNA in wastewater correlates with the Covid-19 caseload in a community. However, estimating the proportions of different SARS-CoV-2 haplotypes has remained technically difficult. We present a phylogenetic imputation method for improving the SARS-CoV-2 reference database and a method for estimating the relative proportions of SARS-CoV-2 haplotypes from wastewater samples. The phylogenetic imputation method uses the global SARS-CoV-2 phylogeny and imputes based on the maximum of the posterior probability of each nucleotide. We show that the imputation method has error rates comparable to, or lower than, typical sequencing error rates which substantially improves the reference database and allows for accurate inferences of haplotype composition. Our method for estimating relative proportions of haplotypes uses an initial step to remove unlikely haplotypes and an Expectation-Maximization (EM) algorithm for obtaining maximum likelihood estimates of the proportions of different haplotypes in a sample. Using simulations with a reference database of >3 million SARS-CoV-2 genomes, we show that the estimated proportions reflect the true proportions given sufficiently high sequencing depth.Entities:
Keywords: COVID-19; Expectation-Maximization; Imputation; SARS-CoV-2; Wastewater surveillance; Wastewater-based epidemiology
Year: 2022 PMID: 36159190 PMCID: PMC9485417 DOI: 10.1016/j.crmeth.2022.100313
Source DB: PubMed Journal: Cell Rep Methods ISSN: 2667-2375