| Literature DB >> 32251421 |
Jun Zhang1, Ming Yue2, Yi Hu1, Robert Bergquist3, Chuan Su4, Fenghua Gao5, Zhi-Guo Cao5, Zhijie Zhang1.
Abstract
Elimination of the intermediate snail host of Schistosoma is the most effective way to control schistosomiasis and the most important first step is to accurately identify the snail habitats. Due to the substantial resources required for traditional, manual snail-searching in the field, and potential risk of miss-classification of potential snail habitats by remote sensing, more convenient and precise methods are urgently needed. Snail data (N = 15,000) from two types of snail habitats (lake/marshland and hilly areas) in Anhui Province, a typical endemic area for schistosomiasis, were collected together with 36 environmental variables covering the whole province. Twelve different models were built and evaluated with indices, such as area under the curve (AUC), Kappa, percent correctly classified (PCC), sensitivity and specificity. We found the presence-absence models performing better than those based on presence-only. However, those derived from machine-learning, especially the random forest (RF) approach were preferable with all indices above 0.90. Distance to nearest river was found to be the most important variable for the lake/marshlands, while the climatic variables were more important for the hilly endemic areas. The predicted high-risk areas for potential snail habitats of the lake/marshland type exist mainly along the Yangtze River, while those of the hilly type are dispersed in the areas south of the Yangtze River. We provide here the first comprehensive risk profile of potential snail habitats based on precise examinations revealing the true distribution and habitat type, thereby improving efficiency and accuracy of snail control including better allocation of limited health resources.Entities:
Mesh:
Year: 2020 PMID: 32251421 PMCID: PMC7162538 DOI: 10.1371/journal.pntd.0008178
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Fig 1The epidemic areas and sample points in Anhui Province about here.
(A) The figure is the epidemic areas of Anhui Province in China with the blue lines through the southern parts of the province representing the Yangtze River. The red and circular purple triangular points represent the sample points in the lake/marshland and hilly/mountainous types of snail habitat, respectively. The map was created using the ArcGIS 10.0 software (ESRI Inc., Redlands, CA, USA).
Summary of environment variables used in study before screening.
| Data description | Label | Variable type |
|---|---|---|
| Normalized Difference Vegetation Index | NDVI | Continuous |
| Land surface temperature | LST | Continuous |
| Elevation | DEM | Continuous |
| Aspect | Asp | Continuous |
| Slope | Slope | Continuous |
| Distance to nearest water body | Water | Continuous |
| WorldClim | Bio1~Bio19 | Continuous |
| Accumulated temperature beyond 0°C | Aat0 | Continuous |
| Accumulated temperature beyond 10°C | Aat10 | Continuous |
| Moisture index | Im | Continuous |
| Annual average precipitation | Pa | Continuous |
| Annual average temperature | Tadem | Continuous |
| Soil type | Soil | Categorical |
| Soil texture | Clay, Sand, Silt | Continuous |
| Geomorphic type | Geo | Categorical |
| Land use type | Lucc | Categorical |
| Ecosystem type | Eco | Categorical |
| Vegetation type | Veg | Categorical |
Evaluation of 12 models based on five different statistical indexes.
| AUC | Kappa | PCC | sensitivity | specificity | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Model | PO | lake/marshland | hilly | lake/marshland | hilly | lake/marshland | hilly | lake/marshland | hilly | lake/marshland | hilly |
| RF | PA | 0.993 | 0.985 | 0.928 | 0.883 | 0.964 | 0.942 | 0.991 | 0.974 | 0.937 | 0.908 |
| ANN | PA | 0.970 | 0.900 | 0.809 | 0.618 | 0.905 | 0.809 | 0.857 | 0.712 | 0.952 | 0.887 |
| SVM | PA | 0.962 | 0.925 | 0.762 | 0.638 | 0.881 | 0.819 | 0.826 | 0.731 | 0.936 | 0.907 |
| Maxent | PO | 0.969 | 0.922 | 0.656 | 0.638 | 0.828 | 0.820 | 0.680 | 0.724 | 0.974 | 0.914 |
| GBM | PA | 0.965 | 0.910 | 0.802 | 0.666 | 0.900 | 0.833 | 0.986 | 0.951 | 0.816 | 0.715 |
| MARS | PA | 0.964 | 0.895 | 0.810 | 0.634 | 0.905 | 0.817 | 0.983 | 0.948 | 0.828 | 0.686 |
| GLM | PA | 0.955 | 0.894 | 0.800 | 0.602 | 0.900 | 0.801 | 0.984 | 0.941 | 0.817 | 0.661 |
| FDA | PA | 0.955 | 0.891 | 0.799 | 0.623 | 0.900 | 0.812 | 0.944 | 0.897 | 0.856 | 0.727 |
| CTA | PA | 0.935 | 0.885 | 0.822 | 0.716 | 0.911 | 0.858 | 0.987 | 0.973 | 0.836 | 0.744 |
| Domain | PO | 0.916 | 0.813 | 0.832 | 0.626 | 0.916 | 0.813 | 0.997 | 0.995 | 0.835 | 0.631 |
| GARP | PO | 0.882 | 0.796 | 0.625 | 0.349 | 0.812 | 0.675 | 0.991 | 0.988 | 0.635 | 0.362 |
| Bioclim | PO | 0.860 | 0.768 | 0.719 | 0.536 | 0.860 | 0.768 | 0.920 | 0.924 | 0.800 | 0.613 |
*Presence only;
**Presence and absence
Fig 2ROC curves of predicted results of the 12 models for the two types of snail habitat.
(A) ROC curve for snail habitats in the lake/marshlands. (B) ROC curve for snail habitats in the hilly/mountainous areas.
The importance of variables (IV).
| Lake/marshland type | Hilly type | ||
|---|---|---|---|
| Variable name | Importance of variables (IV) | Variable name | Importance of variables (IV) |
| Water | 0.305 | Bio9 | 0.362 |
| Bio9 | 0.268 | Bio12 | 0.335 |
| Aat10 | 0.190 | DEM | 0.121 |
| Bio3 | 0.035 | Veg | 0.118 |
| DEM | 0.032 | Bio8 | 0.072 |
| Im | 0.015 | Water | 0.047 |
| Bio6 | 0.010 | Aat0 | 0.034 |
| LST | 0.008 | Bio15 | 0.017 |
| Geo | 0.007 | Soil | 0.013 |
| Bio8 | 0.004 | NDVI | 0.009 |
| Clay | 0.004 | LST | 0.007 |
| NDVI | 0.004 | Slope | 0.007 |
| Lucc | 0.003 | Clay | 0.006 |
| Veg | 0.002 | Geo | 0.005 |
| Asp | 0.001 | Sand | 0.002 |
| Eco | 0.001 | Asp | 0.001 |
| Silt | 0.001 | Eco | 0.001 |
| Slope | 0.001 | Lucc | 0.001 |
| Soil | 0.001 | Silt | 0.001 |
| Sand | 0.000 | ||
Fig 3Predicted risk map of potential snail habitats for Anhui Province according to the RF model.
(A) Risk map of potential snail habitats for the lake/marshland type. The shifting shades of the colour red from light to dark represent the risk of snail presence changing from low to high. (B) Risk map of potential snail habitats for the hilly/mountainous type. The shifting shades of the colour green from light to dark represent the risk of snail presence changing from low to high. (C) Combined risk map with pink areas representing the lake/marshland type only, yellow areas the hilly/mountainous type only, and the red areas the overlapping regions.