| Literature DB >> 35371281 |
Rongsheng Zhou1, Weihao Yin2, Wenjin Li1, Yingchun Wang1, Jing Lu1, Zhong Li1, Xinxin Hu1.
Abstract
Objective: Improving health literacy in infectious diseases is a direct manifestation of the solid advance in disease control and prevention. Our study is aimed at exploring applying synthetic minority oversampling technique (SMOTE) in the prediction assessment of whether residents and business employees have infectious disease health literacy.Entities:
Mesh:
Year: 2022 PMID: 35371281 PMCID: PMC8975663 DOI: 10.1155/2022/8498159
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Figure 1Flow chart of scale analysis.
Figure 2Schematic diagram of SMOTE algorithm.
Univariate analysis of health literacy and health behavior of residents and employees of enterprises.
| Items | Resident | Enterprise staff | ||
|---|---|---|---|---|
| Health literacy | Health literacy | Health literacy | Health literacy | |
| Gender | ||||
| Male | 36.33 ± 4.70 | 11.19 ± 1.65 | 36.42 ± 4.75 | 11.23 ± 1.66 |
| Female | 36.91 ± 4.49 | 11.33 ± 1.66 | 36.87 ± 4.41 | 11.27 ± 1.63 |
|
| 1.309 | 0.898 | 0.972 | 0.245 |
|
| 0.191 | 0.370 | 0.331 | 0.806 |
| Degree of education | ||||
| Junior high school and below | 33.00 ± 2.95 | 9.33 ± 0.95 | 33.29 ± 3.31 | 9.47 ± 1.10 |
| Senior high school/vocational high school/technical secondary school | 35.31 ± 3.11 | 11.33 ± 0.95 | 35.50 ± 3.39 | 11.34 ± 1.00 |
| Junior college | 38.31 ± 4.65 | 11.66 ± 0.25 | 38.27 ± 4.63 | 11.65 ± 1.24 |
| Undergraduate | 39.00 ± 4.35 | 13.00 ± 0.82 | 39.30 ± 4.27 | 12.88 ± 0.89 |
| Postgraduate | 43.33 ± 3.40 | 14.00 ± 0.84 | 43.53 ± 3.41 | 14.00 ± 0.82 |
|
| 67.406 | 217.531 | 65.143 | 196.757 |
|
| <0.001 | <0.001 | <0.001 | <0.001 |
| Age | ||||
| 16~20 years old | 33.10 ± 2.98 | 9.48 ± 1.06 | 33.35 ± 2.82 | 9.46 ± 0.90 |
| 20~30 years old | 35.59 ± 3.55 | 11.33 ± 1.22 | 34.87 ± 3.75 | 10.82 ± 1.52 |
| 30~40 years old | 38.54 ± 4.64 | 11.88 ± 1.31 | 38.51 ± 4.70 | 11.84 ± 1.28 |
| 40~50 years old | 39.82 ± 4.60 | 13.22 ± 0.96 | 39.25 ± 4.41 | 12.68 ± 1.28 |
| >50 years old | 34.51 ± 3.68 | 10.38 ± 1.43 | 35.85 ± 5.31 | 11.27 ± 2.10 |
|
| 47.862 | 107.497 | 28.792 | 48.625 |
|
| <0.001 | 0.037 | <0.001 | <0.001 |
Performance comparison of the 3 classification models applied to raw data.
| Model | Recall | Accuracy rates | f-scores | Precision | AUC |
|---|---|---|---|---|---|
| Logistic | 0.291 | 0.828 | 0.431 | 0.936 | 0.754 |
| Random forest | 0.349 | 0.612 | 0.448 | 0.932 | 0.817 |
| SVM | 0.419 | 0.654 | 0.497 | 0.938 | 0.759 |
Figure 3ROC curves of three classification models applied to the raw data.
Performance comparison of 3 classification models applied to data processed by SMOTE algorithm.
| Model | Recall | Accuracy rates |
| Precision | AUC |
|---|---|---|---|---|---|
| Logistic | 0.608 | 0.938 | 0.739 | 0.816 | 0.913 |
| Random forest | 0.732 | 0.911 | 0.817 | 0.855 | 0.925 |
| SVM | 0.741 | 0.894 | 0.806 | 0.851 | 0.910 |
Figure 4ROC curve of three classification models applied to data processed by the SMOTE algorithm.