| Literature DB >> 29351762 |
Nicholas Midzi1, Blessing Kavhu2, Portia Manangazira3, Isaac Phiri3, Susan L Mutambu4, Cremants Tshuma3, Moses J Chimbari5, Shungu Munyati6, Stanely M Midzi7, Lincon Charimari7, Anatoria Ncube8, Masceline J Mutsaka-Makuvaza4, White Soko4, Emmanuel Madzima4, Gibson Hlerema4, Joel Mbedzi4, Gibson Mhlanga3, Mhosisi Masocha2.
Abstract
BACKGROUND: Reliable mapping of soil-transmitted helminth (STH) parasites requires rigorous statistical and machine learning algorithms capable of integrating the combined influence of several determinants to predict distributions. This study tested whether combining edaphic predictors with relevant environmental predictors improves model performance when predicting the distribution of STH, Ascaris lumbricoides and hookworms at a national scale in Zimbabwe.Entities:
Keywords: Ascaris lumbricoides; Gradient boosted model; Hookworms; Maxent; Soil-transmitted helminths; Species distribution
Mesh:
Substances:
Year: 2018 PMID: 29351762 PMCID: PMC5775612 DOI: 10.1186/s13071-017-2586-6
Source DB: PubMed Journal: Parasit Vectors ISSN: 1756-3305 Impact factor: 3.876
Fig. 1Location of Zimbabwe. A. lumbricoides and hookworm data are overlaid as solid circles within administrative district boundaries
Characteristics of environmental variables considered important in predicting the distribution of STH in Zimbabwe
| Variable | Units | Spatial resolution (km) | Data source | Accessible at |
|---|---|---|---|---|
| Gridded monthly CHIRPS precipitation | mm per month | ~5.5 | Climate Hazards Group |
|
| MODIS monthly daytime land surface temperature (MOD11C3) | Kelvin | ~5.5 | NASA’s Land Processes Distributed Active Archive Center (LP DAAC) | |
| MODIS monthly night-time land surface temperature (MOD11C3) | Kelvin | ~5.5 | NASA’s Land Processes Distributed Active Archive Center (LP DAAC) | |
| MODIS normalized difference vegetation index (MOD13A3) | dimensionless (-1 to 1) | 1 | NASA’s Land Processes Distributed Active Archive Center (LP DAAC) | |
| Gridded human population density | number of persons/km2 | 1 | Socioeconomic and Data Application Centers |
|
| Distance from perennial rivers | m | 1 | Calculated in a GIS | |
| Long-term average soil moisture | % | 30 | Africa Soil Information Services |
|
| Soil pH | – | 5 | International Soil Reference Centre (ISRIC) |
|
| Soil organic carbon (C) content topsoil (0–30 cm) | % C | 5 | International Soil Reference Centre (ISRIC) |
|
Fig. 2Scatterplot of TSS and ROC illustrating the performance of ten modelling techniques used to predict the distribution of A. lumbricoides in Zimbabwe
Fig. 3Scatterplot of TSS and ROC illustrating the performance of ten modelling techniques used to predict hookworms distribution in Zimbabwe
Percentage change in model performance among ten modelling techniques used to predict A. lumbricoides distribution in Zimbabwe
| Model | TSS (2)a | TSS (1)a | % change | ROC (2)a | ROC (1)a | % change |
|---|---|---|---|---|---|---|
| ANN | 0.10 | 0.24 | 140 | 0.55 | 0.58 | 5 |
| CTA | 0.20 | 0.38 | 90 | 0.56 | 0.72 | 29 |
| FDA | 0.34 | 0.46 | 35 | 0.69 | 0.78 | 13 |
| GAM | 0.23 | 0.37 | 12 | 0.57 | 0.67 | 18 |
| GBM | 0.45 | 0.60 | 33 | 0.75 | 0.76 | 1 |
| GLM | 0.45 | 0.54 | 20 | 0.73 | 0.74 | 1 |
| MARS | 0.32 | 0.48 | 50 | 0.66 | 0.74 | 12 |
| MAXENT | 0.32 | 0.42 | 31 | 0.67 | 0.71 | 6 |
| RF | 0.33 | 0.42 | 27 | 0.68 | 0.74 | 9 |
| SRE | 0.20 | 0.52 | 160 | 0.49 | 0.75 | 53 |
aEvaluation scores for models with environmental variables only are denoted TSS (2) and ROC (2) and those derived from a set of environmental variables plus edaphic variables are denoted TSS (1) and ROC (1)
Percentage change in model performance among ten modelling techniques used to predict hookworms distribution in Zimbabwe
| Model | TSS(2)a | TSS (1)a | % change | ROC (2)a | ROC (1)a | % change |
|---|---|---|---|---|---|---|
| ANN | 0.00 | 0.02 | 1900 | 0.01 | 0.51 | 5000 |
| CTA | 0.32 | 0.49 | 53 | 0.71 | 0.73 | 3 |
| FDA | 0.42 | 0.47 | 12 | 0.76 | 0.78 | 3 |
| GAM | 0.38 | 0.51 | 34 | 0.69 | 0.77 | 12 |
| GBM | 0.42 | 0.51 | 21 | 0.74 | 0.77 | 4 |
| GLM | 0.42 | 0.61 | 45 | 0.76 | 0.84 | 11 |
| MARS | 0.39 | 0.50 | 28 | 0.66 | 0.79 | 20 |
| MAXENT | 0.34 | 0.53 | 6 | 0.68 | 0.76 | 12 |
| RF | 0.40 | 0.42 | 5 | 0.76 | 0.78 | 3 |
| SRE | 0.00 | 0.10 | 9900 | 0.48 | 0.55 | 15 |
aEvaluation scores for models with environmental variables only are denoted TSS (2) and ROC (2) and those derived from a set of environmental variables plus edaphic variables are denoted TSS (1) and ROC (1)
Fig. 4The predicted spatial distribution of A. lumbricoides and hookworms across Zimbabwe
Fig. 5The predicted presence of A. lumbricoides and hookworms within and across the administrative districts of Zimbabwe
Variables identified as important for modelling the geographical distribution of A. lumbricoides and hookworms in Zimbabwe
| Variable |
| Hookworms | ||||
|---|---|---|---|---|---|---|
| GLM | GBM | SRE | GLM | GBM | MAXENT | |
| DPW | 0.147* | 0.062 | 0.071 | 0.153* | 0.075 | 0.000 |
| Soil moisture | 0.078 | 0.069 | 0.183* | 0.062 | 0.025 | 0.001 |
| Soil pH | 0.551* | 0.144* | 0.076 | 0.062 | 0.004 | 0.556* |
| Soil organic content | 0.089 | 0.026 | 0.044 | 0.159* | 0.211* | 0.131* |
| HPD | 0.030 | 0.266* | 0.206* | 0.048 | 0.365* | 0.261* |
| AVP | 0.111* | 0.034 | 0.152* | 0.000 | 0.037 | 0.081 |
| NDVI | 0.379* | 0.132* | 0.190* | 0.460* | 0.233* | 0.000 |
| LST(day) | 0.193* | 0.079 | 0.161* | 0.295* | 0.088 | 0.148* |
Abbreviations: GLM generalised linear model, GBM gradient boosted model, SRE surface range envelope, Maxent maximum entropy, DPW distance from perennial water body, HPD human population density, AVP average annual precipitation, LST land surface temperature
*Important predictors