| Literature DB >> 33216809 |
Kristina M Ceres1, Ynte H Schukken2, Yrjö T Gröhn1.
Abstract
Infectious disease management relies on accurate characterization of disease progression so that transmission can be prevented. Slowly progressing infectious diseases can be difficult to characterize because of a latency period between the time an individual is infected and when they show clinical signs of disease. The introduction of Mycobacterium avium ssp. paratuberculosis (MAP), the cause of Johne's disease, onto a dairy farm could be undetected by farmers for years before any animal shows clinical signs of disease. In this time period infected animals may shed thousands of colony forming units. Parameterizing trajectories through disease states from infection to clinical disease can help farmers to develop control programs based on targeting individual disease state, potentially reducing both transmission and production losses due to disease. We suspect that there are two distinct progression pathways; one where animals progress to a high-shedding disease state, and another where animals maintain a low-level of shedding without clinical disease. We fit continuous-time hidden Markov models to multi-year longitudinal fecal sampling data from three US dairy farms, and estimated model parameters using a modified Baum-Welch expectation maximization algorithm. Using posterior decoding, we observed two distinct shedding patterns: cows that had observations associated with a high-shedding disease state, and cows that did not. This model framework can be employed prospectively to determine which cows are likely to progress to clinical disease and may be applied to characterize disease progression of other slowly progressing infectious diseases.Entities:
Mesh:
Year: 2020 PMID: 33216809 PMCID: PMC7678993 DOI: 10.1371/journal.pone.0242683
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1MAP fecal shedding patterns from five seven cows.
Shedding patterns were variable among individuals. The number of samples taken from each cow was also variable, ranging from 2 to 11 samples. The variation in both CFU count and number of samples may be related to disease progression, as animals with high CFU counts may be culled early due to clinical MAP infection related production losses.
Fig 2Diagram of hidden Markov model 1.
This diagram represents a hidden Markov model with three hidden states labeled 0, 1, and 2. The model is shown with initial state probabilities, π0, time dependent transition probabilities, P and emissions, E, which have gamma distributions.
Descriptions and initial values of parameters in three hidden state models.
| Probability of starting in each hidden state k | Unif(0,1) | |
| Gamma distribution shape parameter for state k | Unif(1,20) | |
| Gamma distribution scale parameter for state k | Unif(0.01, 1) | |
| Transition rate from state j to state k | Unif(0.01,5) |
Four hidden Markov model structures with two, three, four or five hidden states were initialized with 250 sets of random parameter values drawn from uniform distributions. We chose parameter values for transition rates and emission distributions that would produce reasonable state transition probabilities and gamma distributions to fit the observed CFU values. We allowed the initial state probabilities to range from 0 to 1 reflecting our limited knowledge on the proportion of animals in each underlying disease state.
Model comparison using AIC.
| 3 State | 14 | -95.83 | 1 | 0.993 | |
| 2 State | 7 | -107.86 | 229.72 | 6.5 E -3 | 0.006 |
| 4 State | 23 | -106.24 | 258.48 | 3.7 E -9 | 3.7 E -9 |
| 5 State | 34 | -104.38 | 276.76 | 4.0 E -13 | 4.0 E -13 |
Rel. L represents the relative log likelihood for model i and is calculated as where is AIC the minimum AIC value in the candidate set. w(AIC) is the AIC weight for model i and is calculated as . p is the number of parameters in model i, and log(L) is the log likelihood for model i.
Transition and emission probability estimates.
| Two state models | |||
| Probability of starting in state 0 | 0.45 | (0.30, 0.64) | |
| Gamma distribution shape parameter for state 0 | 17.56 | (4.81, 26.33) | |
| Gamma distribution shape parameter for state 1 | 17.86 | (13.86, 184.37) | |
| Gamma distribution scale parameter for state 0 | 0.14 | (0.09, 0.82) | |
| Gamma distribution scale parameter for state 1 | 0.39 | (0.04, 0.48) | |
| Gamma distribution mean for state 0 | 2.44 | (2.30, 3.95) | |
| Gamma distribution mean for state 1 | 6.87 | (6.49, 8.03) | |
| Transition rate from state 0 to 1 | 0.03 | (0.01, 2.90) | |
| Transition rate from state 1 to 0 | 2.00e-3 | (1.22e-07, 1.02) | |
| Three state models | |||
| Probability of starting in state 0 | 0.43 | (0.29, 0.62) | |
| Probability of starting in state 1 | 0.21 | (4.11e-36, 0.39) | |
| Gamma distribution shape parameter for state 0 | 19.41 | (4.79, 30.07) | |
| Gamma distribution shape parameter for state 1 | 16.24 | (12.29, 15307.72) | |
| Gamma distribution shape parameter for state 2 | 173.20 | (36.00, 239.35) | |
| Gamma distribution scale parameter for state 0 | 0.12 | (0.08, 0.78) | |
| Gamma distribution scale parameter for state 1 | 0.35 | (3.27e-4,0.39) | |
| Gamma distribution scale parameter for state 2 | 0.05 | (0.03, 0.20) | |
| Gamma distribution mean for state 0 | 2.40 | (2.17, 3.75) | |
| Gamma distribution mean for state 1 | 5.64 | (3.83, 6.18) | |
| Gamma distribution mean for state 2 | 7.80 | (7.14, 8.12) | |
| Transition rate from state 0 to 1 | 0.03 | (0.01, 0.98) | |
| Transition rate from state 0 to 2 | 1.13e-40 | (1.89e-46, 0.48) | |
| Transition rate from state 1 to 0 | 4.65e-39 | (1.86e-131, 1.46) | |
| Transition rate from state 1 to 2 | 0.03 | (0.02, 6.15) | |
| Transition rate from state 2 to 0 | 3.00e-3 | (5.12e-143, 0.60) | |
| Transition rate from state 2 to 1 | 1.50e-06 | (1.50e-8, 1.49) | |
Parameter estimates for the two and three hidden disease models were generated using the modified Baum-Welch Expectation Maximization algorithm and bootstrap confidence intervals, using initial values in S1 Table.
Fig 3Transient distributions and emission distributions in two and three hidden state models.
The transient distribution for each Markov chain was calculated for time t = 0,..,300 weeks (A). Fitted emission distributions are shown in (C), and the observed data are shown in grey (B). The number of random samples used to create each plotted state emission distribution was proportional to the stationary distribution value for that hidden state.
Fig 4Progression patterns through disease states.
Progression patterns of individual cows through disease states estimated using posterior decoding are shown for both the two state and the three state models (A). The color of the line indicates the state with the highest posterior probability at the last observation. The lines were jittered vertically so that overlapping lines could be distinguished. The empirical and predicted log(CFU) MAP value from seven representative cows shows variability in fecal shedding over the sampling period, with some cows shedding very high CFU counts over the sampling period and others maintaining a lower CFU count (B) The median, and 10–90 percentile range for the predicted log(CFU) distributions are shown as points and bars, respectively. The predicted CFU values are colored according to the fitted state emission distribution for each sample. The empirical log(MAP) CFU counts are shown in black.