| Literature DB >> 33256679 |
Jie Wang1, Chao Li1, Jing Li1, Sheng Qin1, Chunlei Liu2, Jiaojiao Wang1, Zhe Chen1, Jianhui Wu3,4, Guoli Wang1,5.
Abstract
BACKGROUND: The prevalence of metabolic syndrome continues to rise sharply worldwide, seriously threatening people's health. The optimal model can be used to identify people at high risk of metabolic syndrome as early as possible, to predict their risk, and to persuade them to change their adverse lifestyle so as to slow down and reduce the incidence of metabolic syndrome.Entities:
Keywords: Data mining; Metabolic syndrome; Oil workers; Risk prediction
Mesh:
Year: 2020 PMID: 33256679 PMCID: PMC7706262 DOI: 10.1186/s12889-020-09921-w
Source DB: PubMed Journal: BMC Public Health ISSN: 1471-2458 Impact factor: 3.295
Fig. 1Comparison of abnormal rates among components of metabolic syndrome
Comparison of the basic conditions of oil workers with and without metabolic syndrome
| Basic conditions | Category(Unit) | MetS n(%)/M(P25,P75) | |||
|---|---|---|---|---|---|
| No | Yes | ||||
| Age | Year | 43(38,47) | 44(40,49) | −5.79 | < 0.001 |
| Gender | Male | 601(69.00) | 504(84.42) | 45.26 | < 0.001 |
| Female | 270(31.00) | 93(15.58) | |||
| BMI | Kg/m2 | 23.9(21.90,25.90) | 26.80(24.90,28.80) | −16.35 | < 0.001 |
| Marital status | Unmarried | 56(6.43) | 15(2.51) | 11.82 | 0.003 |
| Married | 782(89.78) | 559(93.63) | |||
| Others | 33(3.79) | 23(3.85) | |||
| Education level | Junior high school and below | 133(15.27) | 104(17.42) | 9.07 | 0.011 |
| High school/technical secondary school | 374(42.94) | 290(48.58) | |||
| College and above | 364(41.79) | 203(34.00) | |||
| Per capita monthly household income(Yuan) | < 2000 | 619(71.07) | 454(76.05) | 8.05 | 0.018 |
| 2000~ | 212(24.34) | 109(18.26) | |||
| 3000~ | 40(4.59) | 34(5.70) | |||
| Family history of hypertension | No | 489(56.14) | 288(48.24) | 8.88 | 0.003 |
| Yes | 382(43.86) | 309(51.76) | |||
| Family history of hyperlipidemia | No | 801(91.96) | 538(90.12) | 1.51 | 0.22 |
| Yes | 70(8.04) | 59(9.88) | |||
| Family history of diabetes mellitus | No | 725(83.24) | 454(76.05) | 11.58 | 0.001 |
| Yes | 146(16.76) | 143(23.95) | |||
Comparison of diet and lifestyle of oil workers with and without metabolic syndrome
| Factors | Category | MetS n(%)/M(P25,P75) | |||
|---|---|---|---|---|---|
| No | Yes | ||||
| Salt | Light | 221(25.37) | 88(14.74) | 26.39 | < 0.001 |
| Moderate | 381(43.74) | 276(46.23) | |||
| Salty | 269(30.88) | 233(39.03) | |||
| Meat intake | Never | 23(2.64) | 13(2.18) | 9.38 | 0.025 |
| Occasionally | 198(22.73) | 101(16.92) | |||
| Regularly | 335(38.46) | 232(38.86) | |||
| Every day | 315(36.17) | 251(42.04) | |||
| Fruit intake | Never | 37(4.25) | 27(4.52) | 6.59 | 0.086 |
| Occasionally | 278(31.92) | 223(37.35) | |||
| Regularly | 258(29.62) | 146(24.46) | |||
| Every day | 298(34.21) | 201(33.67) | |||
| Dairy intake | Never | 127(14.58) | 103(17.25) | 119.81 | < 0.001 |
| Occasionally | 230(26.41) | 297(49.75) | |||
| Regularly | 199(22.85) | 111(18.59) | |||
| Every day | 315(36.17) | 86(14.41) | |||
| Carbonated beverage intake | Never | 370(42.48) | 270(45.23) | 10.52 | 0.015 |
| Occasionally | 384(44.09) | 258(43.22) | |||
| Regularly | 79(9.07) | 31(5.19) | |||
| Every day | 38(4.36) | 38(6.37) | |||
| Physical exercise | No | 307(35.25) | 259(43.38) | 9.90 | 0.002 |
| Yes | 564(64.75) | 338(56.62) | |||
| Smoking status | No smoking | 524(60.16) | 262(43.89) | 39.30 | < 0.001 |
| Quit smoking | 51(5.86) | 61(10.22) | |||
| Smoking | 296(33.98) | 274(45.90) | |||
| Drinking status | No drinking | 585(67.16) | 309(51.76) | 37.02 | < 0.001 |
| Alcohol withdrawal | 16(1.84) | 24(4.02) | |||
| Drinking | 270(31.00) | 264(44.22) | |||
Comparison of occupational exposure factors of oil workers with and without metabolic syndrome
| Factors | Category | MetS n(%)/M(P25,P75) | |||
|---|---|---|---|---|---|
| No | Yes | ||||
| Shift work situation | Never | 535(61.42) | 254(42.55) | 51.44 | < 0.001 |
| Once | 208(23.88) | 202(33.84) | |||
| Now | 128(14.70) | 141(23.62) | |||
| Labour intensity | Mild | 93(10.68) | 44(7.37) | 5.36 | 0.069 |
| Moderate | 434(49.83) | 295(49.41) | |||
| Severe | 344(39.49) | 258(43.22) | |||
| Occupational heat | No | 548(62.92) | 266(44.56) | 48.34 | < 0.001 |
| Yes | 323(37.08) | 331(55.44) | |||
| Noise | No | 429(49.25) | 206(34.51) | 31.39 | < 0.001 |
| Yes | 442(50.75) | 391(65.49) | |||
Comparison of laboratory tests in oil workers with and without metabolic syndrome
| Biochemical Indicators | MetS n(%)/M(P25,P75) | |||
|---|---|---|---|---|
| No | Yes | |||
| RBC(× 1012/L) | 5.01(4.65,5.33) | 5.29(4.99,5.54) | −6.94 | < 0.001 |
| MCV(fl) | 88.80(85.10,92.00) | 88.20(84.80,91.80) | −0.85 | 0.397 |
| BPC(×1012/L) | 256.00(219.50,290.75) | 251.00(211.00,284.00) | −0.55 | 0.59 |
| MPV(fl) | 8.20(7.70,8.80) | 8.20(7.70,8.80) | −0.83 | 0.405 |
| Hemoglobin(g/L) | 155(141,165) | 160(151,169) | −6.44 | < 0.001 |
| TBIL(mmol/L) | 13.50(10.50,17.70) | 13.45(10.30,17.10) | −0.81 | 0.421 |
| UA(mmol/L) | 307(242,373) | 367(304,426) | −11.13 | < 0.001 |
| ALT(U/L) | 20.00(14.00,24.00) | 35.00(21.00,45.00) | −17.07 | < 0.001 |
Multivariate nonconditional Logistic regression analysis of influencing factors in oil workers with metabolic syndrome
| Factors | ||||||
|---|---|---|---|---|---|---|
| Age | 0.088 | 0.012 | 55.251 | 0.000 | 1.092 | 1.067, 1.118 |
| Per capita monthly household income(2000~) | −0.77 | 0.22 | 12.244 | 0.000 | 0.463 | 0.301, 0.713 |
| Per capita monthly household income(3000~) | 0.166 | 0.388 | 0.184 | 0.668 | 1.181 | 0.552, 2.525 |
| BMI | 0.273 | 0.026 | 114.091 | 0.000 | 1.313 | 1.249, 1.381 |
| Family history of diabetes mellitus | 0.373 | 0.183 | 4.129 | 0.042 | 1.452 | 1.013, 2.080 |
| Salt(Moderate) | 0.86 | 0.206 | 17.429 | 0.000 | 2.362 | 1.578, 3.536 |
| Salt(Salty) | 0.555 | 0.214 | 6.759 | 0.009 | 1.742 | 1.146, 2.648 |
| Dairy intake(Occasionally) | 0.676 | 0.216 | 9.771 | 0.002 | 1.966 | 1.287, 3.003 |
| Dairy intake(Every day) | −1.149 | 0.261 | 19.317 | 0.000 | 0.317 | 0.190, 0.529 |
| Carbonated beverage intake(Every day) | 1.102 | 0.365 | 9.148 | 0.002 | 3.012 | 1.474, 6.153 |
| Physical exercise | −0.398 | 0.152 | 6.86 | 0.009 | 0.672 | 0.499, 0.905 |
| Smoking status(Smoking) | 0.431 | 0.181 | 5.675 | 0.017 | 1.539 | 1.079, 2.194 |
| Shift work situation(Once) | 0.974 | 0.172 | 32.184 | 0.000 | 2.648 | 1.892, 3.707 |
| Shift work situation(Now) | 1.509 | 0.237 | 40.489 | 0.000 | 4.522 | 2.841, 7.198 |
| Occupational heat | 0.656 | 0.224 | 8.548 | 0.003 | 1.926 | 1.241, 2.989 |
| UA | 0.004 | 0.001 | 27.244 | 0.000 | 1.004 | 1.003, 1.006 |
| ALT | 0.029 | 0.005 | 40.946 | 0.000 | 1.030 | 1.020, 1.039 |
Assignment of influencing factor variables
| Variable name | Variable meaning | Assignment method |
|---|---|---|
| Y | MetS | 0 = No,1 = Yes |
| X1 | Age | Continuous variable (year) |
| X2 | Per capita monthly household income | 1 = < 2000,2 = 2000–3000,3 = ≥3000 |
| X3 | BMI | Continuous variable(Kg/m2) |
| X4 | Family history of diabetes mellitus | 1 = No,2 = Yes |
| X5 | Salt | 1 = Light,2 = Moderate,3 = Salty |
| X6 | Dairy intake | 1 = Never,2 = Occasionally,3 = Regularly,4 = Every day |
| X7 | Carbonated beverage intake | 1 = Never,2 = Occasionally,3 = Regularly,4 = Every day |
| X8 | Physical exercise | 1 = No,2 = Yes |
| X9 | Smoking status | 1 = No smoking,2 = Quit smoking,3 = Smoking |
| X10 | Shift work situation | 1 = Never,2 = Once,3 = Now |
| X11 | Occupational heat | 1 = No,2 = Yes |
| X12 | UA | Continuous variable(mmol/L) |
| X13 | ALT | Continuous variable(U/L) |
Sample classification results of Logistic regression model, Random Forest model and Convolutional neural network model [N (%)]
| Model | predictive value | Actual value | ||
|---|---|---|---|---|
| Yes | No | Total | ||
| Logistic regression model | Yes | 766(87.94) | 152(25.46) | 918 |
| No | 105(12.06) | 445(74.54) | 550 | |
| Total | 871 | 597 | 1468 | |
| Random forest model | Yes | 832(95.52) | 20(3.35) | 852 |
| No | 39(4.48) | 577(96.65) | 616 | |
| Total | 871 | 597 | 1468 | |
| CNN | Yes | 789(90.59) | 35(5.86) | 824 |
| No | 82(9.41) | 562(94.14) | 644 | |
| Total | 871 | 597 | 1468 | |
Comparison of predictive performance of the three models
| Evaluation index | Logistic regression model | Random forest model | CNN |
|---|---|---|---|
| Accuracy rate(%) | 82.49 | 95.98 | 92.03 |
| Sensitivity(%) | 87.94 | 95.52 | 90.59 |
| Specificity(%) | 74.54 | 96.65 | 94.14 |
| F1 Score | 0.86 | 0.97 | 0.93 |
| AUC | 0.88 | 0.96 | 0.92 |
| Brier score | 0.15 | 0.08 | 0.12 |
| observed-expected ratio | 0.83 | 0.97 | 1.13 |
| calibration-in-the-large | 0.109 | 0.099 | 0.098 |
| Integrated Calibration Index | 0.075 | 0.073 | 0.074 |
Fig. 2ROC curves and calibration curves of three predictive models