| Literature DB >> 31058982 |
Timothy G Vaughan1,2,3, Gabriel E Leventhal4,5, David A Rasmussen2,6,7, Alexei J Drummond1,8, David Welch1,8, Tanja Stadler2,3.
Abstract
Modern phylodynamic methods interpret an inferred phylogenetic tree as a partial transmission chain providing information about the dynamic process of transmission and removal (where removal may be due to recovery, death, or behavior change). Birth-death and coalescent processes have been introduced to model the stochastic dynamics of epidemic spread under common epidemiological models such as the SIS and SIR models and are successfully used to infer phylogenetic trees together with transmission (birth) and removal (death) rates. These methods either integrate analytically over past incidence and prevalence to infer rate parameters, and thus cannot explicitly infer past incidence or prevalence, or allow such inference only in the coalescent limit of large population size. Here, we introduce a particle filtering framework to explicitly infer prevalence and incidence trajectories along with phylogenies and epidemiological model parameters from genomic sequences and case count data in a manner consistent with the underlying birth-death model. After demonstrating the accuracy of this method on simulated data, we use it to assess the prevalence through time of the early 2014 Ebola outbreak in Sierra Leone.Entities:
Keywords: Bayesian phylogenetics; epidemiology; particle filter; phylodynamics
Mesh:
Year: 2019 PMID: 31058982 PMCID: PMC6681632 DOI: 10.1093/molbev/msz106
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
. 1.The true epidemiological trajectory can be inferred from the reconstructed phylogeny. (a) The trajectory of an epidemic outbreak consists of a sequence of events (infection, sampling, and recovery) e at times t that result in a corresponding sequence of compartment occupancies such as the infectious compartment occupancies I. (b) The full transmission tree contains information on when infections happened and between which lineages (filled squares) and when individuals were removed (filled circles). The sampled transmission tree represents a subset of the full tree (red). The rest of the transmission tree is unobserved (blue). (c) The time ordered observations consist of the events o seen on the tree (infection, sampling with removal, and sampling without removal) at times τ, combined with the number of lineages on the sampled tree in the intervals immediately before each of these events. (d) There is an ensemble of trajectories that are compatible with the sampled transmission tree. Note that the sampled transmission tree contains only a subset of the events represented by the full tree and true trajectory , and each of these “observed” events must be present in every compatible trajectory.
. 2.Comparison between values of the phylodynamic likelihoods computed using the PMMH algorithm with those calculated using other approaches: (a) likelihood of r under the linear birth–death model from PMMH compared with the analytical result (Stadler 2010) and (b) likelihood of β under the stochastic SIS model from PMMH compared with a numerical result from ExpoTree (Leventhal et al. 2014).
. 3.Marginal posteriors for the infection rate as a function of the fraction f of samples regarded as “sequenced” when no data besides the sampling times are available. The invariance of this distribution with respect to f shows that the treatment of unsequenced samples is consistent with the treatment of sequenced samples.
. 4.Inference of prevalence dynamics from sequence data simulated under (a) linear birth–death, (b) stochastic SIS, and (c) stochastic SIR model. Samples from the posterior of the prevalence trajectory are shown in red, whereas the black line represents the truth. The blue lines are prevalence trajectories simulated from the posterior samples of the compartmental model parameters.
Fixed Parameter Values Used for Well-Calibrated Trajectory Inference Validation.
| Model |
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| Linear birth–death | 0.5 | 0.1 | — | 0.25 | 0.0 | 0.0 | 10.0 |
| SIS | 0.02 | 1.0 | 199 | 0.1 | 0.0 | 0.0 | 5.0 |
| SIR | 0.02 | 1.0 | 199 | 0.1 | 0.0 | 0.0 | 5.0 |
. 5.Proportion of simulated data analyses which included the true prevalence in their % HPD intervals, for alignments simulated under each of the (a) linear birth–death, (b) SIS, and (c) SIR models. Colors represent the distinct times at which the coverage fractions were computed, and the insets indicate where these times fall in relation to the approximate deterministic prevalence curves. The linear relationship between the relative inclusion frequencies and α indicates that the PMMH algorithm is correctly sampling from the posterior prevalence distribution under each of these models.
. 6.(a, b) Jointly inferred posterior distributions (red) and unconditioned simulated distributions (blue) for (a) infected host count and (b) effective reproduction number during the Kailahun EVD outbreak. (c) Posterior distribution of infected host count per 105 hosts (prevalence). (d) Expected number of new EVD infections per susceptible host per week (incidence). (e) Comparison of inferred number of infected hosts using all data (red curves) and only the first 4 weeks of sequence data (brown curves). (f) Temporal distribution of EBOV cases used in the full analysis, both sequenced (turquoise) and unsequenced (orange). The vertical dashed line in (f) indicates the end of the 4-week period of sequence data used to infer the brown trajectories in (e).
Parameter Priors Distributions Used in and 95% HPD Intervals Derived from Our Analysis of EBOV Genomes Sampled from the 2014 EVD Outbreak in Kailahun.
| Parameter | Unit | Prior Distribution | Posterior 95% HPD | |
|---|---|---|---|---|
| Lower | Upper | |||
|
| Year−1 |
|
|
|
|
| — |
| 576 | 1,390 |
|
| Year−1 |
| 16 | 36 |
|
| Year |
| 0.64 (May 5) | 0.83 (Feb 25) |
Note.—While T is the time difference between the start of the outbreak and the end of the observation period, for a given time of cessation of observation it implies the absolute time of the start of the outbreak, which we provide in the bracketed (2014) dates.