| Literature DB >> 29880306 |
Denise Kühnert1, Mireia Coscolla2, Daniela Brites2, David Stucki2, John Metcalfe3, Lukas Fenner4, Sebastien Gagneux2, Tanja Stadler5.
Abstract
The fast evolution of pathogenic viruses has allowed for the development of phylodynamic approaches that extract information about the epidemiological characteristics of viral genomes. Thanks to advances in whole genome sequencing, they can be applied to slowly evolving bacterial pathogens like Mycobacterium tuberculosis. In this study, we investigate and compare the epidemiological dynamics underlying two M. tuberculosis outbreaks using phylodynamic methods. Specifically, we (i) test if the outbreak data sets contain enough genetic variation to estimate short-term evolutionary rates and (ii) reconstruct epidemiological parameters such as the effective reproduction number. The first outbreak occurred in the Swiss city of Bern (1987-2012) and was caused by a drug-susceptible strain belonging to the phylogenetic M. tuberculosis Lineage 4. The second outbreak was caused by a multidrug-resistant (MDR) strain of Lineage 2, imported from the Wat Tham Krabok (WTK) refugee camp in Thailand into California. There is little temporal signal in the Bern data set and moderate temporal signal in the WTK data set. Thanks to its high sampling proportion (90%) the Bern outbreak allows robust estimation of epidemiological parameters despite the poor temporal signal. Conversely, there is much uncertainty in the epidemiological estimates concerning the sparsely sampled (9%) WTK outbreak. Our results suggest that both outbreaks peaked around 1990, although they were only recognized as outbreaks in 1993 (Bern) and 2004 (WTK). Furthermore, individuals were infected for a significantly longer period (around 9 years) in the WTK outbreak than in the Bern outbreak (4-5 years). Our work highlights both the limitations and opportunities of phylodynamic analysis of outbreaks involving slowly evolving pathogens: (i) estimation of the evolutionary rate is difficult on outbreak time scales and (ii) a high sampling proportion allows quantification of the age of the outbreak based on the sampling times, and thus allows for robust estimation of epidemiological parameters.Entities:
Keywords: Epidemic dynamics; Phylodynamic analysis; Transmission dynamics; Tuberculosis outbreak
Mesh:
Year: 2018 PMID: 29880306 PMCID: PMC6227250 DOI: 10.1016/j.epidem.2018.05.004
Source DB: PubMed Journal: Epidemics ISSN: 1878-0067 Impact factor: 4.396
Bayesian prior distributions used for phylodynamic analysis.
| Mean substitution rate θ | Standard deviation σ | Effective reproduction number Re | Recovery rate δ | Exposed rate σ | Sampling proportion s | Origin of sample | Removal (upon sampling) probability r | |
|---|---|---|---|---|---|---|---|---|
| Unif(0,∞) | Exp(0.33) | LogN(0,1) | LogN | – | Beta(45,5) | Unif(0,40) | Unif(0,1) | |
| Unif(0,∞) | Exp(0.33) | LogN(0,1) | Fixed to 2, 4 or 6 | LogN(exp(m),1) | Beta(45,5) | Unif(0,40) | Unif(0,1) | |
| Unif(0,∞) | Exp(0.33) | LogN(0,1) | LogN(exp(0.5) | – | Beta(10,90) | Unif(0,100) | Unif(0,1) |
Mean determined to correspond to an infected duration (i.e. the sum of the exposed and infectious periods) of 2 years.
Fig. 1Plot of phylogenetic Root-to-tip distance relative to sampling time (TempEst). Each dot represents one sample per data set (left: Bern, right: WTK).
Fig. 2Bern reproduction number through time (BDSKY).
The median effective reproduction number (black line) with its 95% highest posterior density (HPD) interval (shaded area). The grey bars display a histogram of the number of cases diagnosed per year.
Bayesian Posterior results of phylodynamic analyses of both outbreaks.
| Mean number of SNPs per genome per year | Standard deviation σ | Effective reproduction number Re | Effective reproduction number Re | Recovery rate δ | Exposed rate σ | Sampling proportion s | Time of epidemic origin of sample | Removal (upon sampling) probability r | |
|---|---|---|---|---|---|---|---|---|---|
| 0.72 | 0.95 | 2.28 | 0.24 | 2 (fixed) | 0.25 | 0.87 | 1986.75 | 0.98 | |
| 0.80 | 1.00 | 2.25 | 0.22 | 4 (fixed) | 0.24 | 0.85 | 1987.08 | 0.98 | |
| 0.83 | 1.01 | 2.23 | 0.24 | 6 (fixed) | 0.24 | 0.83 | 1987.2 | 0.97 | |
| 0.55 | 0.90 | See | 0.20 | NA | 0.90 | 1986.39 | 0.98 | ||
| 0.36 | 0.27 | See | 0.13 | NA | 0.08 | 1975.58 | 0.49 | ||
Fig. 3Bern maximum clade credibility tree.
Fig. 4WTK reproduction number through time (BDSKY).
The median effective reproduction number (black line) with its 95% highest posterior density (HPD) interval (shaded area). The grey bars display a histogram of the number of cases diagnosed per year.
Fig. 5Hmong maximum clade credibility tree.
Overview of marginal likelihood estimates from Path Sampling analyses.
| Data | Demographic Model | Clock Model | Marginal Likelihood |
|---|---|---|---|
| WTK | BDSKY | UCLD | |
| WTK | BSP | UCLD | −5411683.47 |
| WTK | ConstCoal | UCLD | −5411701.334 |
| WTK | BDSKY | SC | −5411679.402 |
| WTK | BDSKY | UCED | −5411681.248 |
| Bern | BDSKY | UCLD | |
| Bern | MTBD | UCLD | −5236177.83 |
| Bern | BSP | UCLD | −5422820.42 |
| Bern | ConstCoal | UCLD | −5422845.13 |
| Bern | BDSKY | SC | −5422841.42 |
| Bern | BDSKY | UCED | −5422827.40 |
The highest marginal likelihood estimate for each data set is shown in bold.