| Literature DB >> 35816422 |
Caleb Ki1, Jonathan Terhorst1.
Abstract
The ongoing global pandemic has sharply increased the amount of data available to researchers in epidemiology and public health. Unfortunately, few existing analysis tools are capable of exploiting all of the information contained in a pandemic-scale data set, resulting in missed opportunities for improved surveillance and contact tracing. In this paper, we develop the variational Bayesian skyline (VBSKY), a method for fitting Bayesian phylodynamic models to very large pathogen genetic data sets. By combining recent advances in phylodynamic modeling, scalable Bayesian inference and differentiable programming, along with a few tailored heuristics, VBSKY is capable of analyzing thousands of genomes in a few minutes, providing accurate estimates of epidemiologically relevant quantities such as the effective reproduction number and overall sampling effort through time. We illustrate the utility of our method by performing a rapid analysis of a large number of SARS-CoV-2 genomes, and demonstrate that the resulting estimates closely track those derived from alternative sources of public health data.Entities:
Keywords: birth-death model; pandemic-scale; phylodynamics; phylogenetics
Mesh:
Year: 2022 PMID: 35816422 PMCID: PMC9348775 DOI: 10.1093/molbev/msac154
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 8.800
Fig. 1.Median of the medians and the equal-tailed 95% credible intervals of the posteriors of the effective reproductive number over time of the 10 simulations for each scenario using VBSKY and BEAST. The dotted line is the true effective reproductive number over time.
Prior Distributions Used in Analyses.
| Analysis |
|
|
|
|
|
|---|---|---|---|---|---|
| Uninformative Smoothing | LogN(1,1) | Beta(0.02, 0.98) | Gamma(0.001, 0.001) | Gamma(0.001, 0.001) | LogN(−1.2, 0.1) |
| Less Smoothing | LogN(1,1) | Beta(20, 980) | Gamma(10, 100) | Gamma(10, 100) | LogN(−1.2, 0.1) |
| Biased/Cluster Sampling | LogN(1,1) | Beta(20, 980) | Gamma(0.001, 0.001) | Gamma(0.001, 0.001) | — |
| Multistrain | LogN(0,1) | Beta(2, 98) | Gamma(10000, 0.01) | Gamma(10000, 0.01) | — |
Fig. 2.Posterior of for Florida, Michigan, and the USA using biased sampling and a strong prior on . For each method the posterior median and equal-tailed 95% credible interval are shown. The dotted line is .
Fig. 3.The posterior median and equal-tailed 95% credible interval of for the Alpha, Delta, Omicron variants.
Variational Bayesian Skyline (VBSKY)
|
|
|
|
| Sample with replacement |
| Estimate the tree topology |
|
|
| Initialize |
|
|
| |
| Draw M samples |
| Approximate |
| Update |
| Update |
| |
|
|
|
|