| Literature DB >> 28708077 |
Han Zhang1, Yali Si2,3, Xiaofeng Wang4, Peng Gong5,6.
Abstract
Bacillary dysentery has long been a considerable health problem in southwest China, however, the quantitative relationship between anthropogenic and physical environmental factors and the disease is not fully understand. It is also not clear where exactly the bacillary dysentery risk is potentially high. Based on the result of hotspot analysis, we generated training samples to build a spatial distribution model. Univariate analyses, autocorrelation and multi-collinearity examinations and stepwise selection were then applied to screen the potential causative factors. Multiple logistic regressions were finally applied to quantify the effects of key factors. A bootstrapping strategy was adopted while fitting models. The model was evaluated by area under the receiver operating characteristic curve (AUC), Kappa and independent validation samples. Hotspot counties were mainly mountainous lands in southwest China. Higher risk of bacillary dysentery was found associated with underdeveloped socio-economy, proximity to farmland or water bodies, higher environmental temperature, medium relative humidity and the distribution of the Tibeto-Burman ethnicity. A predictive risk map with high accuracy (88.19%) was generated. The high-risk areas are mainly located in the mountainous lands where the Tibeto-Burman people live, especially in the basins, river valleys or other flat places in the mountains with relatively lower elevation and a warmer climate. In the high-risk areas predicted by this study, improving the economic development, investment in health care and the construction of infrastructures for safe water supply, waste treatment and sewage disposal, and improving health related education could reduce the disease risk.Entities:
Keywords: anthropogenic environment; bacillary dysentery; logistic regression model; physical environment; prevention and intervention; risk mapping
Mesh:
Year: 2017 PMID: 28708077 PMCID: PMC5551220 DOI: 10.3390/ijerph14070782
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Summary of environmental datasets used in this study.
| Category | Description of Datasets | Format | Resolution |
|---|---|---|---|
| Anthropogenic environmental data | GRP (gross regional product) | Raster | 1 km |
| Ethnic group | Polygon | ||
| Number of hospital beds | Polygon | ||
| Physical environmental data | DEM (digital elevation model) | Raster | 90 m |
| Temperature | Raster | 1 km | |
| Precipitation | Point | ||
| Relative humidity | Point | ||
| Rivers | Polyline | ||
| Lakes | Polygon | ||
| Forest | Raster | 30 m | |
| Farmland | Raster | 30 m | |
| Terrain type | Polygon |
Summary of environmental variables used for analysis in this study.
| Category | Description of Variables | Type | Abbreviation | Unit |
|---|---|---|---|---|
| Anthropogenic environmental variables | GRP per 1 km2 | Continuous | GRP | 10 million yuan/km2 |
| Ethnic group (2: Tibeto-Burman; 1: Others) | Categorical | Ethnic | No unit | |
| Number of beds in hospitals (4: fewer than 204 beds; 3: 204–668 beds; 2: 669–1678 beds; 1: more than 1679 beds) | Categorical | BedNum | No unit | |
| Physical environmental variables | Elevation | Continuous | Elevation | 100 m |
| Elevation variation | Continuous | ElevationSD | 100 m | |
| Slope | Continuous | Slope | ° | |
| Temperature (Annual/Summer/Winter) | Continuous | T Ann/Sum/Win | °C | |
| Precipitation (Annual/Summer/Winter) | Continuous | PR Ann/Sum/Win | mm | |
| Relative humidity (Annual/Summer/Winter) | Continuous | RH Ann/Sum/Win | % | |
| Distance to water bodies | Continuous | DisWater | km | |
| Distance to farmlands | Continuous | DisFarm | km | |
| Forest coverage (2: Forest; 1: Non-forest) | Categorical | Forest | No unit | |
| Terrain type (2: Mountain land; 1: Others) | Categorical | Terrain | No unit | |
| Interaction variable | Ethnic * Terrain 1 (2: Tibeto-Burman in Mountain land; 1: Others) | Categorical | EthnicTerrain | No unit |
1 An interaction variable generated from Ethnic and Terrain. Regions belong to both Tibeto-Burman and mountain land are assigned with value 2, other regions are assigned with value 1 as references.
Figure 1Distribution of hotspot districts/counties in southwest China from 2005 to 2014. (The white counties are indicated because they were excluded from residential sampling due to their uncertain status according to the hot spot analysis algorithm).
Summary of the multiple logistic regression models on the risks for bacillary dysentery in southwest China.
| Variable | Coefficient | OR | OR (95% CI) | AUC ± SD | Kappa ± SD | ||
|---|---|---|---|---|---|---|---|
| Intercept | −24.557 | <0.001 | |||||
| DisFarm | −0.087 | 0.916 | 0.880 | 0.954 | <0.001 | ||
| DisWater | −0.062 | 0.940 | 0.914 | 0.967 | 0.001 | ||
| GRP | −0.296 | 0.744 | 0.664 | 0.834 | <0.001 | ||
| EthnicTerrain-2 | 1.112 | 3.059 1 | 2.242 | 4.175 | <0.001 | ||
| Forest-2 | 3.474 | 32.875 2 | 13.154 | 82.188 | <0.001 | ||
| Slope | −0.131 | 0.877 | 0.863 | 0.891 | <0.001 | ||
| ElevationSD | 0.542 | 1.721 | 1.535 | 1.931 | <0.001 | ||
| RH_Ann | 0.706 | 2.029 | 1.758 | 2.341 | <0.001 | ||
| (RH_Ann) 2 | −0.007 | 0.993 | 0.992 | 0.994 | <0.001 | ||
| AT_Ann | 0.169 | 1.184 | 1.135 | 1.236 | <0.001 | ||
| BedNum-2 | 0.877 | 2.441 3 | 1.178 | 5.062 | <0.001 | ||
| BedNum-3 | 2.346 | 10.710 | 5.155 | 22.260 | <0.001 | ||
| BedNum-4 | 2.981 | 20.234 | 9.187 | 44.586 | <0.001 | ||
| Model | 0.944 ± 0.004 | 0.75 ± 0.01 | |||||
1 the odds ratio of EthnicTerrain-2 (Tibeto-Burman in Mountain land) is estimated in comparison to EthnicTerrain-1 (other situations); 2 the odds ratio of Forest-2 (forest coverage) is estimated in comparison to Forest-1 (non-forest coverage); 3 the odds ratio of BedNum-2, BedNum-3, BedNum-4 is estimated in comparison to BedNum-1 respectively.
Figure 2ROC curves of the predictive power of the multiple logistic regression models on the risks of bacillary dysentery in southwest China. (Gray lines: ROC curves of the 1000 fitted models; Black line: mean ROC curve of the 1000 fitted models).
Accuracy assessment based on independent validation samples.
| Predicted | Actual | |
|---|---|---|
| P 1 | A 2 | |
| 113 | 16 | |
| 14 | 111 | |
| Overall accuracy: 88.19% | ||
1 P: Presence; 2 A: Absence.
Figure 3Predictive risk map of bacillary dysentery in southwest China.
Figure 4Predicted high-risk patches in non-hotspot regions. (A–D are four examples of predicted high-risk patches in non-hotspot region. Residents in site A, B and C live in river valleys, residents in site D live on rugged lands. The four images are Google Earth images in corresponding sites showing the poor living conditions).
Figure 5Predicted risk of bacillary dysentery in different hypothetical scenarios. (Scenarios (a): GRP per square kilometer of the under developed places are improved to the contemporaneous level of that in the urban area of Lhasa. Scenarios (b): the number of hospital beds rise by one level. Scenarios (c): both (a) and (b)).