| Literature DB >> 26290899 |
Apilak Worachartcheewan1, Watshara Shoombuatong2, Phannee Pidetcha3, Wuttichai Nopnithipat2, Virapong Prachayasittikul4, Chanin Nantasenamat2.
Abstract
AIMS: This study proposes a computational method for determining the prevalence of metabolic syndrome (MS) and to predict its occurrence using the National Cholesterol Education Program Adult Treatment Panel III (NCEP ATP III) criteria. The Random Forest (RF) method is also applied to identify significant health parameters.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26290899 PMCID: PMC4531182 DOI: 10.1155/2015/581501
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Figure 1Individual components of metabolic syndrome in the study subjects.
Comparison of clinical and biochemical parameters between MS and non-MS groups.
| MS | Non-MS |
| |
|---|---|---|---|
| Case number | 1,338 (23.70) | 4,308 (76.30) | — |
| Male | 696 (34.32) | 1,332 (65.68) | — |
| Female | 642 (17.74) | 2,976 (82.26) | — |
| Age (year) | 46.99 ± 9.54 | 40.35 ± 10.41 | <0.001 |
| WC (cm) | 92.08 ± 9.06 | 78.52 ± 9.23 | <0.001 |
| BMI (kg/m2) | 27.56 ± 4.21 | 22.66 ± 3.49 | <0.001 |
| SBP (mmHg) | 133.28 ± 12.50 | 119.81 ± 12.62 | <0.001 |
| DBP (mmHg) | 85.15 ± 9.30 | 77.47 ± 9.06 | <0.001 |
| FPG (mg/dL) | 108.35 ± 33.83 | 90.42 ± 12.73 | <0.001 |
| CHOL (mg/dL) | 216.63 ± 39.32 | 205.01 ± 36.00 | <0.001 |
| TG (mg/dL) | 196.62 ± 90.71 | 100.36 ± 48.24 | <0.001 |
| LDL-C (mg/dL) | 131.21 ± 46.98 | 121.23 ± 32.68 | <0.001 |
| HDL-C (mg/dL) | 48.07 ± 11.19 | 63.78 ± 14.48 | <0.001 |
| WBC (×109/L) | 7.26 ± 1.77 | 6.46 ± 1.56 | <0.001 |
| Hb (g/dL) | 14.28 ± 1.55 | 13.60 ± 1.47 | <0.001 |
| Hct (%) | 41.97 ± 4.19 | 39.92 ± 3.98 | <0.001 |
| PLT (×109/L) | 274.38 ± 66.21 | 261.71 ± 60.70 | <0.001 |
| Smoking | 0.105 ± 0.307 | 0.057 ± 0.232 | <0.001 |
| Alcohol | 0.372 ± 0.484 | 0.296 ± 0.457 | <0.001 |
Data were expressed as the mean ± SD or as percentages. MS: metabolic syndrome, non-MS: nonmetabolic syndrome, WC: waist circumference, BMI: body mass index, SBP: systolic blood pressure, DBP: diastolic blood pressure, FPG: fasting plasma glucose, CHOL: total cholesterol, TG: triglyceride, LDL-C: low-density lipoprotein cholesterol, HDL-C: high-density lipoprotein cholesterol, WBC: white blood cells, Hb: hemoglobin, Hct: hematocrit, and PLT: platelet. Smoking and alcohol refer to individuals who smoke cigarettes and consume alcohol.
Figure 2Box plots of biochemical parameters of metabolic syndrome (Yes) and nonmetabolic syndrome (No) groups.
Figure 3Prevalence of metabolic syndrome components among the subjects using NCEP ATP III.
Figure 4Prevalence of metabolic syndrome in different age groups.
The number of subjects used as internal and external validation sets for predicting MS.
| Status | Initial | Internal validation set | External validation set |
|---|---|---|---|
| MS | 1337 | 1137 | 200 |
| Non-MS | 4306 | 3659 | 647 |
| Total | 5643 | 4796 | 847 |
Summary of statistical parameters for MS classification using Random Forest.
|
| Internal test set (10-fold CV) | External test set | ||||||
|---|---|---|---|---|---|---|---|---|
| Acc | Sens | Spec | MCC | Acc | Sens | Spec | MCC | |
| 10 | 97.10 | 94.28 | 97.98 | 0.92 | 97.99 | 95.00 | 98.92 | 0.94 |
| 20 | 97.94 | 95.07 | 98.82 | 0.94 | 98.11 | 94.00 | 99.38 | 0.95 |
| 30 | 98.02 | 94.72 | 99.04 | 0.94 | 97.64 | 92.00 | 99.38 | 0.93 |
| 40 | 98.02 | 94.81 | 99.02 | 0.94 | 97.76 | 92.50 | 99.38 | 0.94 |
| 50 | 98.02 | 94.64 | 99.07 | 0.94 | 97.76 | 92.50 | 99.38 | 0.94 |
10-fold CV: 10-fold cross-validation, Acc: accuracy, Sens: sensitivity, Spec: specificity, and MCC: Matthews correlation coefficient.
Figure 5Health parameters importance graph.
Figure 6Scatter plots of MS component classifications: MS (red) and non-MS (blue) groups.
Summary of prediction performance for MS classification using Random Forest from 20 independent runs.
| Prediction performance | Internal test set (10-fold CV) | External test set | ||||||
|---|---|---|---|---|---|---|---|---|
| Acc | Sens | Spec | MCC | Acc | Sens | Spec | MCC | |
| Mean | 97.88 | 94.54 | 98.91 | 0.94 | 98.12 | 94.80 | 99.15 | 0.95 |
| SD | 0.18 | 0.65 | 0.12 | 0.00 | 0.45 | 1.49 | 0.45 | 0.01 |
10-fold CV: 10-fold cross-validation, Acc: accuracy, Sens: sensitivity, Spec: specificity, and MCC: Matthews correlation coefficient.
Decision rules extracted from one of twenty trees from the predictive model trained with Random Forest.
| Frequency (%) | Error (%) | Condition | Prediction |
|---|---|---|---|
| 42 | 0 | WC ≤ 79.75 and TG ≤ 150.5 | Non-MS |
| 10.1 | 0 | FPG ≤ 99.5 and TG ≤ 149.5 and HDL-C > 49.15 and LDL-C ≤ 128.05 | Non-MS |
| 9.6 | 0 | WC > 87.75 and systolic > 127 and TG > 149.5 | MS |
| 3.3 | 0 | Sex = female and WC > 79.5 and systolic > 125 and FPG > 99.5 | MS |
| 1.7 | 0 | WC > 87.5 and systolic > 128 and FPG > 99.5 | MS |
| 1.7 | 0 | Sex = female and WC > 80 and TG > 150.5 and HDL-C ≤ 48.1 | MS |
| 1.7 | 0 | WC ≤ 87.75 and systolic ≤ 127 and Hct > 44.77 | Non-MS |
| 1.3 | 0 | Sex = female and WC > 79.5 and systolic > 129 and HDL-C ≤ 49.75 | MS |
| 8.7 | 0 | FPG ≤ 99.5 and TG ≤ 149.5 and HDL-C > 49.15 | Non-MS |
| 1.1 | 0 | FPG ≤ 99.5 and TG ≤ 149.5 and HDL-C > 38.7 and Hct > 44.1 | Non-MS |
| 1.1 | 0 | Systolic > 125 and FPG > 99.5 and TG > 149.5 | MS |
| 1.5 | 0 | WC ≤ 78.5 and HDL-C > 50.45 | Non-MS |
| 1 | 0 | WC > 87.75 and FPG > 99.5 and TG > 148.5 | MS |
| 1.5 | 1.1 | WC ≤ 87.5 and TG ≤ 150.5 and Hct > 42.305 | Non-MS |
| 1.8 | 1.9 | FPG ≤ 99.5 and TG ≤ 149.5 and HDL-C > 39.6 | Non-MS |
| 2.1 | 2.5 | Systolic ≤ 125 and diastolic ≤ 85 and TG ≤ 149.5 and HDL-C > 47.2 | Non-MS |
| 2.5 | 2.9 | Sex = male and WC ≤ 87.75 and HDL-C > 39.95 | Non-MS |
| 1 | 6.8 | Diastolic > 85 and TG > 149.5 | MS |
| 1.6 | 6.5 | WC ≤ 87.75 and systolic ≤ 127 and FPG ≤ 99.5 | Non-MS |
| 1.1 | 0 | TG > 150.5 and HDL-C ≤ 40.05 | MS |