| Literature DB >> 30278073 |
Won Seob Oh1, Sanghyun Yoon1, Juhwan Noh2, Jungwoo Sohn2, Changsoo Kim2, Joon Heo1.
Abstract
Geographical variations and influential factors of disease prevalence are crucial information enabling optimal allocation of limited medical resources and prioritization of appropriate treatments for each regional unit. The purpose of this study was to explore the geographical variations and influential factors of cardiometabolic disease prevalence with respect to 230 administrative districts in South Korea. Global Moran's I was calculated to determine whether the standardized prevalences of cardiometabolic diseases (hypertension, stroke, and diabetes mellitus) were spatially clustered. The CART algorithm was then applied to generate decision tree models that could extract the diseases' regional influential factors from among 101 demographic, economic, and public health data variables. Finally, the accuracies of the resulting model-hypertension (67.4%), stroke (62.2%), and diabetes mellitus (56.5%)-were assessed by ten-fold cross-validation. Marriage rate was the main determinant of geographic variation in hypertension and stroke prevalence, which has the possibility that married life could have positive effects in lowering disease risks. Additionally, stress-related variables were extracted as factors positively associated with hypertension and stroke. In the opposite way, the wealth status of a region was found to have an influence on the prevalences of stroke and diabetes mellitus. This study suggested a framework for provision of novel insights into the regional characteristics of diseases and the corresponding influential factors. The results of the study are anticipated to provide valuable information for public health practitioners' cost-effective disease management and to facilitate primary intervention and mitigation efforts in response to regional disease outbreaks.Entities:
Mesh:
Year: 2018 PMID: 30278073 PMCID: PMC6168158 DOI: 10.1371/journal.pone.0205005
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1230 administrative districts in South Korea; Source: Statistical Geographical Information Service (SGIS).
Fig 2Flow chart for generation of optimally pruned tree with maximum classification accuracy based on ten-fold cross-validation.
The optimal tree size is determined from the point where the average classification accuracy in the 10-fold cases is maximized.
Statistical test of Moran's I for each disease.
| Disease | Moran’ I | z-score |
|---|---|---|
| Hypertension | 0.30 | 5.69 |
| Stroke | 0.24 | 4.47 |
| Diabetes mellitus | 0.26 | 4.96 |
Fig 3Spatial distribution of three cardiometabolic diseases: (a) Hypertension; (b) Stroke; (c) Diabetes mellitus; Portions of this document/figure include intellectual property of Esri and its licensors and are used under license. Copyright [31, Aug., 2018.] Esri and its licensors. All rights reserved.
Fig 4Influential factors of hypertension extracted from decision tree model.
Fig 6Influential factors of diabetes mellitus extracted from decision tree model.
Spatial distribution with positive and negative influential factors for three cardiometabolic diseases.
| Disease | Spatial distribution | Positive influential factors | Negative influential factors |
|---|---|---|---|
| Hypertension | • | • | • |
| Stroke | • | • | • |
| Diabetes mellitus | • | • | • |
a indicates attributes selected as root node. The positive influential factors indicate variables of which the higher standardized value yields higher prevalence, while the negative influential factors indicate variables of which the lower standardized value yields higher prevalence.
Fig 5Influential factors of stroke extracted from decision tree model.