| Literature DB >> 35027002 |
Julia Ledien1, Zulma M Cucunubá2,3, Gabriel Parra-Henao4,5, Eliana Rodríguez-Monguí6, Andrew P Dobson7, María-Gloria Basáñez2, Pierre Nouvellet8.
Abstract
Age-stratified serosurvey data are often used to understand spatiotemporal trends in disease incidence and exposure through estimating the Force-of-Infection (FoI). Typically, median or mean FoI estimates are used as the response variable in predictive models, often overlooking the uncertainty in estimated FoI values when fitting models and evaluating their predictive ability. To assess how this uncertainty impact predictions, we compared three approaches with three levels of uncertainty integration. We propose a performance indicator to assess how predictions reflect initial uncertainty.In Colombia, 76 serosurveys (1980-2014) conducted at municipality level provided age-stratified Chagas disease prevalence data. The yearly FoI was estimated at the serosurvey level using a time-varying catalytic model. Environmental, demographic and entomological predictors were used to fit and predict the FoI at municipality level from 1980 to 2010 across Colombia.A stratified bootstrap method was used to fit the models without temporal autocorrelation at the serosurvey level. The predictive ability of each model was evaluated to select the best-fit models within urban, rural and (Amerindian) indigenous settings. Model averaging, with the 10 best-fit models identified, was used to generate predictions.Our analysis shows a risk of overconfidence in model predictions when median estimates of FoI alone are used to fit and evaluate models, failing to account for uncertainty in FoI estimates. Our proposed methodology fully propagates uncertainty in the estimated FoI onto the generated predictions, providing realistic assessments of both central tendency and current uncertainty surrounding exposure to Chagas disease.Entities:
Keywords: Chagas disease; Force of infection; Infectious disease; Model averaging; Modelling
Mesh:
Year: 2022 PMID: 35027002 PMCID: PMC8759231 DOI: 10.1186/s12874-021-01477-6
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Fig. 1Comparison of the predictive ability of the best-fit models for the three approaches investigated. Approach 1: (A1) models fitted with median FoI estimates and selected based on predictive R2; Approach 2 (A2): models fitted with median FoI estimates and selected base on predictive R2 and overlap; Approach 3 (A3): models fitted with the full posterior distribution of FoI estimates and selected based on the predictive R2 and overlap. Note: The overlap obtained for A1 is presented for comparison purpose and has been calculated using the same methodology as A2 but is never taken into consideration for the model selection
Fig. 2Goodness-of-fit of the model averaging of the 3 modelling approaches for all serosurveys. The solid lines and envelopes show standardised distances between observations and predictions’ median (blue), and 95%CrI (upper bound in red and lower bound in purple). A perfect fit would translate in all colored solid lines overalpping with the correspondingly-colored dotted lines. A blue solid line overlapping the blue dotted line, together with a red and purple solid lines at 2 and − 2 respectively would reflect a good central prediction with CrI in predictions twice as large as the CrI in the ‘observed’ FoI. Approach 1: models fitted with median FoI estimates and selected based on predictive R2; Approach 2: models fitted with median FoI estimates and selected based on predictive R2 and overlap; Approach 3: models fitted with the full posterior distribution of FoI estimates and selected based on the predictive R2 and overlap
Fig. 3Force-of-Infection of Chagas disease in urban, rural and indigenous settings, Colombia, 1990. Main map, predictions per year and per susceptible individual; small map, Median Absolute Deviation (MAD) Coefficient of Variation (n = 1065 municipalities) . Rows correspond to the 3 modelling approaches. Maps show model-averaged estimates (across the 10 best setting-specific models). Approach 1: models fitted using the median FoI estimates and selected based on predictive R2; Approach 2: models fitted with median FoI estimates and selected based on predictive R2 and overlap; Approach 3: models fitted with the full posterior distribution of FoI estimates and selected based on the predictive R2 and overlap
Predicted FoI averaged across all Colombian municipalities in 1980, 1990 and 2010, the percentage of decrease between 1980 and 2010 (trend) for each setting and the spatial clustering effect given by the Moran’s I statistic for the test under randomisation in 1980, 1990, 2000 and 2010 (n = 1065 municipalities)
| Predicted FoI values | Moran’s I statistic | |||||||
|---|---|---|---|---|---|---|---|---|
| 1980 | 1990 | 2010 | trend | 1980 | 1990 | 2000 | 2010 | |
| mean (sd) | mean (sd) | mean (sd) | % | |||||
| Urban | 2.2 × 10−3 (1.1 × 10−3) | 2.1 × 10− 3 (1.1 × 10− 3) | 1.7 × 10− 3 (9.9 × 10− 4) | −23a | 0.82 | 0.82 | 0.79 | 0.78 |
| Rural | 1.7 × 10− 3 (1.0 × 10− 3) | 1.7 × 10− 3 (1.0 × 10− 3) | 1.7 × 10− 3 (1.0 × 10− 3) | −0.07 | 0.93 | 0.93 | 0.93 | 0.93 |
| Indigenous | 2.0 × 10− 2 (4.5 × 10− 3) | 2.0 × 10− 2 (4.5 × 10− 3) | 1.8 × 10− 2 (4.4 × 10− 3) | −7a | 0.91 | 0.91 | 0.90 | 0.90 |
aStatistically significant at a 5% significance level according to Student’s t test comparing FoI values between 1980 and 2010
Fig. 4Spatiotemporal trends in Chagas disease Force-of-Infection, Colombia, 1980–2010. Main maps, predictions per year using approach 3 and model averaging; small maps, MAD Coefficient of Variation (n = 1065 municipalities)
Fig. 5Predictors included in the model averaging of the FoI of Chagas disease in Colombia. Models fitted with the full posterior distribution of FoI estimates and selected based on predictive R2 and overlap. For the full set of predictors see Supp. Table 1