| Literature DB >> 34707419 |
Hui Zhang1, Dandan Chen1, Jing Shao1, Ping Zou2, Nianqi Cui3, Leiwen Tang1, Xiyi Wang4, Dan Wang1, Jingjie Wu1, Zhihong Ye1.
Abstract
PURPOSE: Machine learning (ML) techniques have emerged as a promising tool to predict risk and make decisions in different medical domains. We aimed to compare the predictive performance of machine learning-based methods for 4-year risk of metabolic syndrome in adults with the previous model using logistic regression. PATIENTS AND METHODS: This was a retrospective cohort study that employed a temporal validation strategy. Three popular ML techniques were selected to build the prognostic models. These techniques were artificial neural networks, classification and regression tree, and support vector machine. The logistic regression algorithm and ML techniques used the same five predictors. Discrimination, calibration, Brier score, and decision curve analysis were compared for model performance.Entities:
Keywords: calibration; discrimination; machine learning; metabolic syndrome; prognosis model
Year: 2021 PMID: 34707419 PMCID: PMC8543031 DOI: 10.2147/RMHP.S328180
Source DB: PubMed Journal: Risk Manag Healthc Policy ISSN: 1179-1594
Characteristics of Participants
| Candidate Predictor | Derivation Cohort (N=6793) n (%)/Mean±SD | Missing Values n (%) | External Cohort (N=7681) n (%)/Mean±SD | Missing Values n (%) |
|---|---|---|---|---|
| Age | 41.488±10.411 | 0 | 41.988±11.246 | 0 |
| TC | 4.419±0.846 | 0.471 | 4.834±0.893 | 0.234 |
| UA | 303.305±86.971 | 0.501 | 331.758±83.226 | 0.234 |
| BMI | 22.802±2.726 | 30.59 | 22.697±2.681 | 31.194 |
| ALT | 21.453±20.271 | 0.118 | 20.984±15.619 | 0.169 |
| WC | 86.445±6.972 | 16.971 | 86.445±7.327 | 17.057 |
| TG | 1.969±1.383 | 0.114 | 1.83±1.165 | 0.045 |
| HDL-c | 1.151±0.29 | 0.114 | 1.183±0.3 | 0.045 |
| SBP | 127.648±14.457 | 4.914 | 125.101±14.657 | 3.375 |
| DBP | 78.301±10.582 | 4.914 | 75.841±10.361 | 3.375 |
| FPG | 5.306±0.993 | 0.114 | 5.324±0.762 | 0.045 |
Results of the Discrimination at Internal and External Validation
| Model | Internal Validation | External Validation |
|---|---|---|
| Logistic regression | 0.783(0.772–0.795) | 0.782(0.771–0.793) |
| ANN | 0.788 (0.777–0.800) | 0.780 (0.769–0.791) |
| CART | 0.663 (0.645–0.681) | 0.740 (0.728–0.752) |
| SVM | 0.740 (0.726–0.754) | 0.742(0.729–0.755) |
Figure 1Calibration plot in internal validation.
Figure 2Calibration plot in external validation.
Results of the Calibration Intercept, Calibration Slope, and the Brier Score at Internal and External Validation
| Model | Internal Validation | External Validation |
|---|---|---|
| Calibration Intercept | ||
| Logistic regression | −0.008(−0.088–0.073) | −0.045(−0.113–0.022) |
| ANN | 0.009 (−0.068–0.086) | 0.102 (0.032–0.172) |
| CART | 0 (−0.077–0.078) | −0.072 (−0.139–0.004) |
| SVM | 0.310 (0.201–0.420) | 0.341(0.250–0.433) |
| Calibration Slope | ||
| Logistic regression | 0.995(0.934–1.058) | 1.006(−0.011–1.063) |
| ANN | 0.996 (0.934–1.059) | 0.974 (0.918–1.031) |
| CART | 1.000 (0.939–1.062) | 0.812 (0.760–0.864) |
| SVM | 0.559(0.518–0.600) | 0.574(0.539–0.611) |
| Brier Score | ||
| Logistic regression | 0.156 | 0.164 |
| ANN | 0.153 | 0.165 |
| CART | 0.218 | 0.176 |
| SVM | 0.185 | 0.192 |
Figure 3Decision-curve analysis in external validation dataset.