| Literature DB >> 23180505 |
Kate Zinszer1, Aman D Verma, Katia Charland, Timothy F Brewer, John S Brownstein, Zhuoyu Sun, David L Buckeridge.
Abstract
OBJECTIVES: There is a growing body of literature on malaria forecasting methods and the objective of our review is to identify and assess methods, including predictors, used to forecast malaria.Entities:
Year: 2012 PMID: 23180505 PMCID: PMC3533056 DOI: 10.1136/bmjopen-2012-001992
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Figure 1Flow of literature searches and screening process.
Characteristics of malaria forecasting studies included in review (n=29)
| Authors (reference) | Population and setting | Model specifics | Malaria outcome | Number of data points used for training/testing | Evaluation measure |
|---|---|---|---|---|---|
| Adimi | Community health post data from 2004 to 2007 for 23 provinces in Afghanistan; clinical confirmation | 23 linear regressions (1 for each province); included autoregressive, seasonal and trend parameters | Monthly cases | 31/6 (varied between provinces but last 6 months used only for testing) | Root mean squared error and absolute difference |
| Chatterjee and Sarkar | Municipal data for 2002–2005 for Chennai (city), India; microscopic confirmation | Logistic regression; polynominal and autoregressive parameters | Monthly slide positivity rate | 36/1 | 95% CI (for predicted value and compared to observed) |
| Gomez-Elipe | Health service data from 1997 to 2003 for Karuzi Province, Burundi; clinical confirmation | Linear regression; adjusted for population, lagged weather covariates, autoregressive and seasonal parameters | Monthly incidence | 60/24; 1 month ahead forecasts | 95% CI, correlation, p value trend line of difference (between predicted and observed) |
| Haghdoost | District health centre data from 1994 to 2001 for Kahnooj District, Iran; microscopic confirmation | Separate Poisson regressions for | 10-day cases | 213/73 | Average percent error |
| Rahman | Hospital data from 1992 to 2001 for all divisions of Bangladesh; clinical confirmation | Four linear regressions (1 for each administrative division and one for all of Bangladesh); environmental covariate for weeks of highest correlation | Yearly cases | 10, 1 year was removed from series at a time | Root mean squared error and relative bias (observed-predicted) |
| Roy | Municipal data for Chennai (city) (2002–2004) and Mangalore (city) (2003–2007), India; microscopic confirmation | Two linear regressions (one for each city); adjusted for population, lagged weather covariates, autoregressive term, interaction terms, polynomial terms | Monthly SPR (Chennai), monthly cases (Mangalore) | 28/8 (Chennai), 48/12 (Mangalore); 1 month ahead | 95% CI |
| Teklehaimanot | Health facility data from 1990 to 2000 for all districts in Ethiopia; microscopic confirmation | 10 Poisson regressions (one for each district); lagged weather covariates, autoregressive term, time trend and indicator covariates for week of the year | Weekly cases | 572 (varied between districts, training and testing); 52 weeks (year) were removed from series at a time; 1–4 week ahead forecasts | Compared performance of alerts from predicted versus observed cases (using potentially prevented cases) |
| Xiao | Medical and health unit data from 1995 to 2007 for Hainan Province, China; microscopic confirmation | Poisson regression; lagged weather covariates, autoregressive term | Monthly incidence | 144/12 | T-test (predictive value significantly different than actual) |
| Yacob and Swaroop | Medical data from 1944 to 1996 for all health districts in Punjab; clinical confirmation | 19 linear regressions (1 for each district); include coefficients of correlation between rainfall and epidemic figures from 1914 to 1943 | Seasonal epidemic figure* | Coefficient of correlation (between actual and predicted epidemic figure) | |
| Yan | Municipal data from 1951 to 2001 for Chongquin (city), China | Linear regression; logarithm curve | Yearly cases | 50/1 | Visual inspection of predicted within range of actual values |
| Abeku | Health clinics data from 1986 to 1999 for 20 areas in Ethiopia; mixture of microscopic and clinical confirmed | 20 models (1 for each area) compared approaches: Overall average, seasonal average, seasonal adjustment, ARIMA | Monthly cases | 168/12 (varied between areas but last 12 months only used for testing); 1–12 month ahead forecasts | Average forecast error |
| Briët | Health facility data from 1972 to 2005 for all districts in Sri Lanka; microscopic confirmation | 25 models (1 for each district) compared approaches: Holt-Winters, ARIMA (seasonality assessed with fixed effects or harmonics) and SARIMA; lagged weather covariates | Monthly cases of malaria slide positives | 180/204 (varied between districts but approximately 50% of series reserved for testing); 1–4 month ahead forecasts | Mean absolute relative error |
| Liu | Data from 2004 to 2010 for China | SARIMA | Monthly incidence | 72/12 | Visual (plot of predicted vs observed) |
| Wangdi | Health centre data from 1994 to 2008 for seven districts in Bhutan; microscopic and antigen confirmation | Seven models (one for each district): SARIMA and ARIMAX; lagged weather covariates | Monthly cases | 144/24 | Mean average percent error |
| Wen | Data from 1991 to 2002 for Wanning County, China | SARIMA | Monthly incidence | 252/12 | 95% CI |
| Zhang | CDC data from 1959 to 1979 for Jinan (city) China; clinical confirmation | SARIMA; lagged weather covariates | Monthly cases | 84/120 (removed 1967 and 1968 from series) | Visual (plot of predicted vs observed) |
| Zhou | Data from 1996 to 2007 for Huaiyuan County, China; microscopic and clinical confirmation | SARIMA | Monthly incidence | 108/12 | Average error |
| Zhu | Data from 1998 to 2007 for Huaiyuan and Tongbai counties, China | SARIMA | Monthly incidence rates | 84/24; 1–12 month ahead forecasts | 95% CI and error |
| Gaudart | Data from cohort of children from 1996 to 2000 in Bancoumana (municipality), Mali from 1996 to 2006; microscopic confirmation | VSEIRS model | Monthly incidence rate | 60 (training and testing); 15 day, 1 month, 2 month, seasonal forecasts | Mean absolute percentage error and root mean squared error |
| Laneri | Health centre data (passive and active surveillance) for Kutch (1987–2007) and Balmer (1985–2005) Districts, India; microscopic confirmation | 2 models (one for each district); compared two types of VSEIRS model to linear and negative binominal regressions | Monthly incidence for parameter estimation; seasonal totals (Sept−Dec) for epidemic forecasting | 240 (training and testing); 1 to 4 months ahead forecasts | Weighted mean square error and prediction likelihood |
| Cunha | Ministry of Health data from 2003 to 2009 for Cornwall (City), Brazil; microscopic confirmation | Compared neural network to linear regression | Monthly cases | 72/12; 3, 6 and 12 months forecasts | Absolute error and mean square error |
| Gao | Data from 1994 to 1999 for Honghe State, China | Neural network | Monthly incidence | 48/12 | Percent error |
| Kiang | Hospital and clinic data from 1994 to 2001 for 19 provinces, Thailand; microscopic confirmation | 19 neural networks (1 for each province); various architectures used (varied by province) | Monthly incidence | 84/12 | Root mean square error |
| Fang | Data from 1956 to 1988 for Xuzhou (City), China | Grey and Grey Verhulst models (1,1) | Yearly incidence | 30/2 | Percent error |
| Gao | Data from 1998 to 2005 for Longgang District, China | Grey model (1,1) | Yearly incidence | 6/1 | Error and percent error |
| Guo | Data from 1988 to 2010 China | Grey model (1,1) | Yearly incidence | 21/2 | Visual (plot of predicted vs observed) |
| Gill | Medical data from 1925 to 1926 for health districts in Punjab; clinical confirmation | 29 forecasts consisting of visual inspection of rainfall, spleen rates and epidemic potential† | Seasonal epidemic (yes/no) | Qualitative comparison of prediction (presence of epidemic) to epidemic figure | |
| Medina | Community health centre data from 1996 to 2004 (14 centres) for Niono District, Mali; clinical confirmation | Multiplicative Holt-Winters model, age-specific rates (three age groups); compared to seasonal adjustment method | Monthly malaria consultation rates | 36/72; 2 and 3-month ahead forecasts; one step ahead forecasts | Mean absolute percentage error and 95% CI |
| Xu and Jin | Data from 2000 to 2005 for Jiangsu Province, China | Grey model | Yearly cases | 4/1 | Visual (plot of predicted vs observed number of cases) |
*Seasonal epidemic figure is the ratio of October incidence to mean spring incidence.
†Epidemic potential is the coefficient of variability of fevers during the month of October for the periods of 1868–1921.
ARIMA, auto-regressive integrated moving average; ARIMAX, auto-regressive integrated moving average with exogenous input; SARIMA, seasonal auto-regressive integrated moving average; SPR, slide positivity rate; VSEIRS, vector-susceptible-exposed-infected-recovered-susceptible model.
Summary of malaria forecasting methods (n=29)
| Forecasting method | Number of studies (reference) |
|---|---|
| GLM | 12 |
| ARIMA | 7 |
| Grey methods | 4 |
| Smoothing methods* | 3 |
| Neural networks | 3 |
| Mathematical models | 2 |
| Visual | 1 |
References in bold indicate multiple comparisons. ARIMA, auto-regressive integrated moving average; GLM, generalised linear model.
*Includes Holt - (Holt-Winters) Winters, seasonal average, seasonally adjusted average and simple average.
Time varying predictors considered in malaria forecasting models
| Predictor | Number of studies (reference) |
|---|---|
| Total rainfall | 11 |
| Average rainfall | 2 |
| Rainy day index* | 1 |
| Number of rainy days/month | 1 |
| Average relative humidity | 7 |
| Minimum humidity | 1 |
| Maximum humidity | 1 |
| Maximum air temperature | 8 |
| Minimum air temperature | 7 |
| Average air temperature | 4 |
| Average LST | 2 |
| Temperature condition index | 1 |
| Average NDVI | 2 |
| Maximum NDVI | 2 |
| Vegetation condition index | 1 |
| Average air pressure | 2 |
| Average air evaporation | 1 |
| Sunshine hours | 1 |
| Malaria in neighbouring districts | 1 |
| Population | 1 |
*Rainy day index: the number of days per month when rainfall was larger than zero divided by the number of days that a reading for rainfall was available.
LST, land surface temperature; NDVI, normalised difference vegetation index.
Parameters included in the mathematical forecasting models
| Predictor | References |
|---|---|
| Mean developmental delay | |
| Number of bites per night | |
| Probability of a susceptible becoming infected after one single bite from a contagious human | |
| Mortality per day | |
| Density | |
| Length of gonotrophic cycle | |
| Time lag of NDVI influence | |
| Lowest NDVI value to influence behaviour | |
| Probability of a susceptible human becoming infected after one single infected bite | |
| Probability of becoming susceptible after being resistant | |
| Probability of acquiring contagiousness | |
| Probability of losing contagiousness | |
| Average human life expectancy | |
| Infectivity of quiescent cases relative to full-blown infections | |
| Reporting fraction* |
*Reporting fraction is the fraction of malaria cases in the population that are reported to public health.
NDVI, normalised difference vegetation index.