| Literature DB >> 30296266 |
Edwin Michael1, Swarnali Sharma1, Morgan E Smith1, Panayiota Touloupou2, Federica Giardina3, Joaquin M Prada4, Wilma A Stolk3, Deirdre Hollingsworth5, Sake J de Vlas3.
Abstract
BACKGROUND: Mathematical models are increasingly being used to evaluate strategies aiming to achieve the control or elimination of parasitic diseases. Recently, owing to growing realization that process-oriented models are useful for ecological forecasts only if the biological processes are well defined, attention has focused on data assimilation as a means to improve the predictive performance of these models. METHODOLOGY AND PRINCIPALEntities:
Mesh:
Year: 2018 PMID: 30296266 PMCID: PMC6175292 DOI: 10.1371/journal.pntd.0006674
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Annual mf prevalence survey and MDA data for three LF endemic sites.
| IVM+ALB | DEC | DEC+IVM | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Anopheles | Culex | Anopheles | |||||||
| 2090 | 20000 [ | 8194 | |||||||
| Year (Survey/ | Mf Prev | Total Population MDA Cov. | Year | Mf Prev | Total Population MDA Cov. | Year | Mf Prev | Total Population MDA Cov. | |
| Sept 2004/ Oct 2004 | 26.1% (471) | 72% | Nov 1994 | 17.2% (230) | 48% | 1994 | 66.7% (63) | 50% | |
| Jan 2006/ Feb 2006 | 20.8% (461) | 70% | May 1995 | 18.5% (230) | 48% | 1995 | 61.5% (65) | 78% | |
| Jan 2007/ May 2007 | 15.8% (438) | 62% | Aug 1996 | 14.5% (230) | 48% | 1996 | 20.5% (88) | 75% | |
| Oct 2008/ Feb 2009 | 12.9% (302) | 59% | Nov 1997 | 11.8% (230) | 48% | 1997 | 13.5% (89) | 68% | |
| Oct 2009/ Nov 2009 | 5.0% (259) | 76% | Feb 1999 | 12.2% (230) | 48% | 1998 | 5.4% (92) | 72% | |
| Nov 2010/ Dec 2010 | 4.4% (400) | 60% | April 2000 | 4.9% (230) | 48% | 1999 | 3.7% (109) | - | |
| Nov 2011 | 2.7% (393) | April 2001 | 4.2% (230) | - | - | - | - | ||
aDrug efficacy assumptions are listed as instantaneous mf kill rate/duration of sterilization in months [1]
bTransmission in Kirare is by both Anopheles and Culex mosquitoes, but models based on the dominant species (Anopheles) were used in this study. The ABR represents the combined biting rate [45].
cIn the model simulations, the allowed ABR range was informed by the observed ABRs reported here.
dThe “Mf Prev” columns denote the prevalence for a given year which was surveyed right before the MDA given in that year at the coverage reported in the column “MDA Cov”. Some mid-treatment surveys in Kirare, Tanazania do not follow this pattern exactly, so for that site the time of the mf survey and the time of the treatment of that year are given explicitly. The survey and MDA times are reflected in the model simulations.
eThe number tested represented those tested for CFA. Only those positive for CFA were tested for mf. The expected number of mf positives in the total sample were calculated as [number positive for CFA]x[number positive for mf]/[number of CFA positives examined for mf] as given in [45].
fThe total coverage was calculated using annual population sizes and coverage of the eligible population (≥ 5 years old) given in [45] and the fraction of individuals ≥ 5 years old calculated from [48].
gThe number of individuals sampled is reported as a random 7% of households which we assume here to represent 7% of the total population.
hMDA coverage reported as ranging between 50–71% of the eligible population throughout the programme in [46]. The average total population coverage calculated as [average coverage]x[proportion of the population eligible for treatment] based on figures given in [46] was modelled.
Fig 1Schematic diagram showing the sequential fitting procedure for updating models and predictions by incorporating longitudinal data.
In all scenarios, the initial EPIFIL models were initialized with parameter priors and a chi-square fitting criterion was applied to select those models which represent the baseline mf prevalence data sufficiently well (α = 0.05). The accepted models were then used to simulate the impact of interventions on mf prevalence. The chi-square fitting criterion was sequentially applied to refine the selection of models according to the post-MDA mf prevalence data included in the fitting scenario. The fitted parameters from selection of acceptable models at each data point were used to predict timelines to achieve 1% mf prevalence. The scenarios noted in the blue boxes indicate the final relevant updating step before using the fitted parameters to predict timelines to achieve 1% mf in that data fitting scenario.
Model predictions of timelines to achieve 1% mf prevalence and corresponding information metrics.
| Model | Site | Scenario | No. of accepted models | Median no. of years | Weighted variance (significance | Entropy | Relative |
|---|---|---|---|---|---|---|---|
| EPIFIL | Kirare | 0 | 865 | 9 (6–19)1,2,3,4 | 14.711,2,3,4 | 3.511,2,3,4 | - |
| 1 | 829 | 8 (6–17)0,2,3,4 | 10.370,3,4 | 3.130,2,3,4 | 12.06 | ||
| 2 | 117 | 14 (11–21)0,1 | 8.660,4 | 3.270,1,4 | 6.84 | ||
| 3 | 105 | 14 (11–18)0,1 | 5.820,1 | 3.060,1,4 | 12.82 | ||
| Alagramam | 0 | 15098 | 10 (7–23)1,2,3,4 | 19.621,2,3,4 | 3.691,2,3,4 | - | |
| 1 | 16410 | 9 (7–21) 0,2,3,4 | 14.350,3,4 | 3.530,2,3,4 | 4.34 | ||
| 2 | 11026 | 11 (8–22)0,2,3,4 | 14.440,3,4 | 3.590,1,3,4 | 2.71 | ||
| 3 | 10351 | 11 (8–19)0,1,2,4 | 9.600,1,2,4 | 3.380,1,2,4 | 8.4 | ||
| Peneng | 0 | 4610 | 12 (6–29)1,2,3,4 | 38.241,2,3,4 | 4.291,2,3,4 | - | |
| 1 | 4255 | 10 (6–25)0,2,3,4 | 26.920,2,3,4 | 4.020,2,3,4 | 6.29 | ||
| 2 | 2714 | 10 (7–17)0,1,3,4 | 8.370,1,3,4 | 3.290,1,3,4 | 23.31 | ||
| 4 | 2728 | 8 (6–12)0,1,2,3 | 3.860,1,2,3 | 2.80,1,2,3 | 34.73 | ||
| LYMFASIM | Kirare | 0 | 6471 | 11 (7–28)1,2 | 35.311,2,3,4 | 4.192,3,4 | - |
| 1 | 901 | 10 (6–34)0,2 | 50.910,2,3,4 | 4.202,3,4 | -0.24 | ||
| 2 | 363 | 13 (10–20)0,1,3,4 | 9.500,3,4 | 3.310,1,3,4 | 21.00 | ||
| 4 | 245 | 11 (9–14)2 | 2.020,1,2 | 2.410,1,2 | 42.48 | ||
| Alagramam | 0 | 6903 | 12 (9–21)1,2,3,4 | 15.461,2,3,4 | 3.383,4 | - | |
| 1 | 2906 | 11 (9–22)0,2,3,4 | 20.440,3,4 | 3.373,4 | 0.30 | ||
| 2 | 2148 | 13 (10–24)0,1,3,4 | 22.380,3,4 | 3.453,4 | -2.07 | ||
| 3 | 1966 | 12 (10–19)0,1,2,4 | 11.110,1,2,4 | 2.870,1,2 | 15.09 | ||
| Peneng | 0 | 4195 | 12 (7–26)2,3,4 | 32.022,3,4 | 4.262,3,4 | - | |
| 1 | 3772 | 12 (6–26)2,3,4 | 30.862,3,4 | 4.242,3,4 | 0.47 | ||
| 2 | 1531 | 10 (7–13)0,1 | 2.220,1 | 2.530,1 | 40.61 | ||
| 4 | 1655 | 10 (7–13)0,1 | 2.330,1 | 2.560,1 | 39.91 | ||
| TRANSFIL | Kirare | 0 | 6866 | 13 (7–43)1,2,3,4 | 81.781,2,3,4 | 4.661,2,3,4 | - |
| 1 | 17625 | 11 (7–27)0,2 | 32.620.2,3,4 | 4.000,2,3,4 | 14.16 | ||
| 2 | 6414 | 13 (10–26)0,1,3,4 | 22.260,1,3,4 | 3.500,1,3,4 | 24.89 | ||
| 3 | 2108 | 11 (9–15)2 | 3.190,1,2,4 | 2.560,1,2,4 | 45.06 | ||
| Alagramam | 0 | 9666 | 15 (9–42)2,3,4 | 72.861,2,3,4 | 4.601,2,3,4 | - | |
| 1 | 9109 | 15 (9–50)2,3,4 | 155.570,3,4 | 4.520,2,3,4 | 1.74 | ||
| 2 | 5555 | 18 (11–50)0,1,3,4 | 146.860,3,4 | 4.590,1,3,4 | 0.22 | ||
| 4 | 383 | 11 (10–15)0,1,2 | 5.330,1,2,3 | 2.460,1,2,3 | 46.52 | ||
| Peneng | 0 | 7014 | 21 (8–48)1,2,3,4 | 100.371,2,3,4 | 5.161,2,3,4 | - | |
| 1 | 55425 | 16 (7–41)0,2,3,4 | 70.430,2,3,4 | 4.810.2,3,4 | 6.78 | ||
| 2 | 8892 | 10 (6–22)0,1,3,4 | 15.990,1,3,4 | 3.770,1,3,4 | 26.94 | ||
| 4 | 13922 | 11 (7–22)0,1,2,3 | 14.990,1,2 | 3.700,1,2,3 | 28.29 |
The lowest entropy scenario for each site is bolded and shaded grey. Additional scenarios shaded grey are not significantly different from the lowest entropy scenario.
#Scenario 0: model-only; Scenario 1: baseline data; Scenario 2: baseline + post-MDA 3 data; Scenario 3: baseline + post-MDA 3 + post-MDA 5 data; Scenario 4: baseline + post-MDA 5 data
*For each pair of scenarios, pairwise F-tests for equality of variance were performed to compare the weighted variance, differential Shannon entropy tests were performed to compare the entropy, and Kruskal-Wallis multiple comparison tests were performed to compare medians. Pairwise significance is represented by reporting those scenarios which are statistically significantly different from each other by numbers (0–4) as superscripts. For example, the weighted variance for scenario 0 for Kirare has the superscript numbers (1–4) to indicate that the weighted variance for scenario 0 is significantly different from the weighted variance for scenarios 1–4. Significance was determined using the Benjamini-Hochberg procedure for controlling the false discovery rate (q = 0.05) in all pairwise statistical tests.
+information gained by each data stream (scenario 1–4) are presented in comparison to the information contained in the model-only simulation (scenario 0)
Fig 2Comparison of the distributions of predicted timelines to LF elimination from the three models for Kirare, Tanzania.
This visual comparison shows that the predictions coming from the model-only simulations (scenario 0) have the widest spread in their distributions for all three models compared to model predictions obtained via constraining using subsequent data scenarios. Pairwise Kolmogorov-Smirnov tests for equal distributions were performed on the results from each model to evaluate whether updating the models with sequential data changed the distribution of predictions. Significance was determined using the Benjamini-Hochberg procedure for controlling the false discovery rate (q = 0.05). Apart from scenarios 2 and 3 for EPIFIL and scenarios 3 and 4 for LYMFASIM, all distributions were significantly different from one another (see S2 Supplementary Information for results from the villages of Alagramam and Peneng).
Fig 3Comparison of model-predicted timelines from model-only simulations and the lowest entropy simulations in each site.
The boxplots show that by calibrating the models to data streams, more precise predictions are able to be made regarding timelines to achieve 1% mf prevalence across all models and sites. The results of pairwise F-tests for variance, performed to compare the weighted variance in timelines to achieve 1% mf prevalence between model-only simulations (scenario 0) and the lowest entropy simulations (best scenario) (see Table 2), show that the predictions for the best scenarios are significantly different from the predictions for the model-only simulations. Significance was determined using the Benjamini-Hochberg procedure for controlling the false discovery rate (q = 0.05). For EPIFIL, LYMFASIM and TRANSFIL, the best scenarios are scenarios 4, 3, and 4 for Kirare, scenarios 4, 4, and 3 for Alagramam, and scenarios 3, 3, and 3 for Peneng, respectively.
Fig 4Pooled predictions of the timelines to reach 1% mf from three LF models.
The shaded regions show the weighted 95% percentile interval from the composite predictions of all three models of the timelines required to cross the WHO 1% elimination threshold for all five scenarios. The black dots indicate upper and lower bounds (weighted 2.5th and 97.5th percentiles) of the composite predictions from all three models for each scenario. The range of predictions is tightest when the models were constrained with data from scenarios 3 and 4.
Fig 5Parameter constraint achieved through the coupling of EPIFIL with data.
Overall parameter constraint was measured as the ratio of the mean standard deviation of the fitted parameter distributions to that of the prior parameter distributions. Values < 1 indicate that the fitted parameter space was constrained compared to the prior parameter space. The results show that the fitted parameter space for Kirare and Peneng was more constrained by calibrating the model to data compared to the model-only scenario, but this was not the case for Alagramam.
Spearman parameter correlations for scenarios 1 (lower left triangle) and 3 (upper right triangle) for Alagramam, India.
| λ | α | k0 | kLin | κ | r | σ | Ψ1 | Ψ2 | μ | γ | g | c | HLin | Ic | Sc | τ | δ | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| λ | -0.008 | 0.002 | 0.012 | 0.005 | <0.001 | 0.012 | 0.006 | -0.018 | -0.016 | |||||||||
| α | -0.005 | -0.003 | 0.005 | -0.016 | 0.013 | |||||||||||||
| k0 | -0.001 | 0.002 | 0.007 | -0.008 | -0.007 | 0.007 | 0.011 | -0.001 | -0.018 | <0.001 | 0.014 | -0.004 | 0.013 | -0.005 | -0.018 | |||
| kLin | 0.006 | 0.003 | 0.013 | -0.001 | ||||||||||||||
| κ | -0.005 | -0.004 | 0.011 | 0.001 | -0.001 | -0.012 | 0.007 | 0.004 | -0.012 | -0.011 | 0.002 | -0.009 | -0.003 | 0.008 | ||||
| r | 0.008 | -0.013 | 0.008 | 0.017 | -0.012 | 0.005 | -0.003 | 0.005 | ||||||||||
| σ | 0.012 | -0.002 | 0.011 | -0.019 | -0.010 | 0.001 | 0.003 | 0.017 | -0.012 | 0.012 | ||||||||
| Ψ1 | -0.012 | -0.011 | 0.011 | 0.010 | -0.009 | -0.013 | -0.011 | -0.004 | -0.015 | |||||||||
| Ψ2 | 0.012 | -0.009 | 0.016 | |||||||||||||||
| μ | 0.008 | 0.006 | 0.009 | 0.009 | -0.001 | 0.012 | 0.001 | -0.005 | 0.019 | 0.016 | ||||||||
| γ | -0.004 | 0.005 | -0.014 | -0.015 | -0.010 | 0.003 | 0.002 | 0.010 | ||||||||||
| g | -0.010 | -0.015 | 0.014 | 0.006 | 0.007 | 0.010 | 0.006 | -0.013 | ||||||||||
| c | -0.001 | <0.001 | -0.002 | 0.012 | 0.004 | 0.006 | -0.007 | 0.004 | 0.003 | -0.003 | -0.010 | 0.007 | ||||||
| HLin | 0.009 | 0.009 | 0.011 | 0.015 | 0.009 | 0.008 | -0.008 | 0.003 | -0.004 | 0.002 | 0.011 | 0.015 | -0.008 | 0.006 | 0.007 | |||
| Ic | 0.010 | <0.001 | -0.010 | -0.014 | -0.015 | 0.010 | 0.002 | -0.014 | 0.003 | -0.005 | ||||||||
| Sc | -0.013 | -0.002 | 0.007 | -0.005 | -0.008 | 0.004 | 0.003 | -0.005 | 0.001 | 0.005 | -0.005 | 0.003 | -0.008 | -0.015 | -0.011 | |||
| τ | -0.003 | -0.002 | -0.004 | -0.001 | 0.001 | -0.005 | 0.001 | 0.009 | -0.005 | 0.009 | -0.004 | -0.009 | 0.003 | |||||
| δ | -0.014 | -0.009 | 0.003 | 0.001 | 0.003 | 0.007 | 0.010 | -0.010 | 0.011 | 0.005 | 0.001 |
Cell formatting reflects significant correlations (bold text), correlation coefficient sign changes between the two scenarios (bold bordered cells), and more than two fold magnitude changes in the correlation coefficients between the two scenarios (blue cells indicate the correlation was stronger in scenario 1 and red cells indicate the correlation was stronger in scenario 3).
Fig 6Weighted variance and entropy values of EPIFIL predictions of LF elimination timelines using optimal MDA coverage in each study site.
For all sites, either scenario 3 or 4 had the lowest entropies, and scenario 4 was not significantly different from scenario 3 for Kirare and Alagramam. These results were not statistically different from the results given 65% coverage (see Table 2), suggesting that the data stream associated with the lowest entropy is robust to changes in the interventions simulated. Scenarios where the weighted variance or entropy were not significantly different from the lowest entropy scenario are noted with the abbreviation NS. Significance was determined using the Benjamini-Hochberg procedure for controlling the false discovery rate (q = 0.05).
Predictions of timelines to achieve 1% mf in Villupuram district, India, considering extended post-MDA data.
| 15419 | 10 (7–23)1,2,3,4,5,6 | 19.621,2,3,4,5,6 | 3.691,2,3,4,5,6 | - | |
| 16352 | 7 (3–17)0,2,3,5,6 | 12.420,2,3,4,5,6 | 3.470,2,3,4,5,6 | 5.96 | |
| 11581 | 8 (6–18)0,1,4 | 10.750,1,3,5,6 | 3.320,1,3,4,5,6 | 10.03 | |
| 11381 | 8 (6–16)0,1,4 | 8.920,1,2,4 | 3.230,1,2,4 | 12.47 | |
| 16152 | 7 (3–16)0,2,3,5,6 | 10.820,1,3,5,6 | 3.400,1,2,3,5,6 | 7.86 | |
| 11381 | 8 (6–16)0,1,4 | 8.920,1,2,4 | 3.230,1,2,4 | 12.47 | |
The lowest entropy scenario for each site is bolded and shaded grey. Additional scenarios shaded grey are not significantly different from the lowest entropy scenario.
#Scenario 0–4 are as previously defined; Scenario 5: Baseline + post-MDA 3 + post-MDA 5 + post-MDA 7 data; Scenario 6: Baseline + post-MDA 3 + post-MDA 5 + post-MDA 7 + post-MDA 9 data
*For each pair of scenarios, pairwise F-tests for equality of variance were performed to compare the weighted variance, differential Shannon entropy tests were performed to compare the entropy, and Kruskal-Wallis multiple comparison tests were performed to compare the medians. Pairwise significance is represented by reporting those scenarios which are statistically significantly different from each other by numbers (0–4) as superscripts. For example, the weighted variance for scenario 0 has the superscript numbers (1–6) to indicate that the weighted variance for scenario 0 is significantly different from the weighted variance for scenarios 1–6. Significance was determined using the Benjamini-Hochberg procedure for controlling the false discovery rate (q = 0.05) in all pairwise statistical tests.
+information gained by each data stream (scenario1-6) are presented in comparison to the information contained in the model-only simulation (scenario 0)
Annual mf prevalence survey and MDA data for Dokan Tofa, Nigeria.
| Dokan Tofa | ||||
|---|---|---|---|---|
| IVM+ALB (99/9) | ||||
| Anopheles | ||||
| 300–5000 | ||||
| Year | Mf Prev | Upper limit | Total population | |
| 2003 | 5% (21/419) | 7.1% | 74.9% | |
| 2004 | NA | NA | 76.7% | |
| 2005 | 3% (7/236) | 5.4% | 67.4% | |
| 2006 | 0% (0/132) | 2.2% | 77.6% | |
| 2008 | 0% (0/158) | 1.03% | 78.3% | |
| 2010 | 0% (0/119) | 0.73% | ||
Years shaded in grey indicate data used to constrain the model.
* Drug regimen efficacy given as % mf killed instantaneously/number of months of reduced worm fecundity
EPIFIL predictions of timelines to achieve 1% mf prevalence in Dokan Tofa, Nigeria.
| 0 | 3007 | 3 (2–10) | 2.411,2,3,4 | 2.551,2,3,4 | - | |
| 1 | 2046 | 3 (2–8) | 2.400,2,3 | 2.450,2,3 | 0.41 | |
| 2 | 2007 | 3 (2–7) | 2.070,1,4 | 2.350,1,4 | 0.85 | |
| 4 | 2046 | 3 (2–8) | 2.400,2,3 | 2.450,2,3 | 0.41 |
The lowest entropy scenario for each site is bolded and shaded grey. Additional scenarios shaded grey are not significantly different from the lowest entropy scenario.
#Scenario 0: model-only; Scenario 1: baseline data; Scenario 2: baseline + post-MDA 3 data; Scenario 3: baseline + post-MDA 3 + post-MDA 5 data; Scenario 4: baseline + post-MDA 5 data
*For each pair of scenarios, pairwise F-tests for equality of variance were performed to compare the weighted variance, differential Shannon entropy tests were performed to compare the entropy, and Kruskal-Wallis multiple comparison tests were performed to compare medians. Pairwise significance is represented by reporting those scenarios which are statistically significantly different from each other by numbers (0–4) as superscripts. For example, the weighted variance for scenario 0 for Kirare has the superscript numbers (1–4) to indicate that the weighted variance for scenario 0 is significantly different from the weighted variance for scenarios 1–4. Significance was determined using the Benjamini-Hochberg procedure for controlling the false discovery rate (q = 0.05) in all pairwise statistical tests.
+information gained by each data stream (scenario 1–4) are presented in comparison to the information contained in the model-only simulation (scenario 0)