| Literature DB >> 35660759 |
Peter C Austin1,2,3, Frank E Harrell4, Douglas S Lee5,6,7, Ewout W Steyerberg8.
Abstract
Machine learning is increasingly being used to predict clinical outcomes. Most comparisons of different methods have been based on empirical analyses in specific datasets. We used Monte Carlo simulations to determine when machine learning methods perform better than statistical learning methods in a specific setting. We evaluated six learning methods: stochastic gradient boosting machines using trees as the base learners, random forests, artificial neural networks, the lasso, ridge regression, and linear regression estimated using ordinary least squares (OLS). Our simulations were informed by empirical analyses in patients with acute myocardial infarction (AMI) and congestive heart failure (CHF) and used six data-generating processes, each based on one of the six learning methods, to simulate continuous outcomes in the derivation and validation samples. The outcome was systolic blood pressure at hospital discharge, a continuous outcome. We applied the six learning methods in each of the simulated derivation samples and evaluated performance in the simulated validation samples. The primary observation was that neural networks tended to result in estimates with worse predictive accuracy than the other five methods in both disease samples and across all six data-generating processes. Boosted trees and OLS regression tended to perform well across a range of scenarios.Entities:
Mesh:
Year: 2022 PMID: 35660759 PMCID: PMC9166797 DOI: 10.1038/s41598-022-13015-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Baseline characteristics of patients in the AMI derivation and validation samples.
| Variable | Derivation sample (N = 8145) | Validation sample (N = 4444) | P-value |
|---|---|---|---|
| Discharge systolic blood pressure | 120.40 ± 19.69 | 122.48 ± 20.60 | < 0.001 |
| Age | 66.51 ± 13.58 | 69.13 ± 14.32 | < 0.001 |
| Female | 2792 (34.3%) | 1709 (38.5%) | < 0.001 |
| Systolic blood pressure | 148.87 ± 31.15 | 144.64 ± 31.24 | < 0.001 |
| Diastolic blood pressure | 83.86 ± 18.46 | 80.39 ± 18.42 | < 0.001 |
| Heart rate | 83.61 ± 23.77 | 85.72 ± 23.74 | < 0.001 |
| Respiratory rate | 20.86 ± 5.45 | 20.41 ± 5.32 | < 0.001 |
| Cardiogenic shock | 56 (0.7%) | < = 5 | *** |
| Acute congestive heart failure/pulmonary edema | 389 (4.8%) | 293 (6.6%) | < 0.001 |
| Diabetes | 2072 (25.4%) | 1268 (28.5%) | < 0.001 |
| Hypertension | 3731 (45.8%) | 2658 (59.8%) | < 0.001 |
| Current smoker | 2753 (33.8%) | 1208 (27.2%) | < 0.001 |
| Dyslipidemia | 2597 (31.9%) | 2004 (45.1%) | < 0.001 |
| Family history of coronary artery disease | 2603 (32.0%) | 1377 (31.0%) | 0.262 |
| Cerebrovascular accident/transient ischemic attack | 772 (9.5%) | 583 (13.1%) | < 0.001 |
| Angina | 2685 (33.0%) | 1361 (30.6%) | 0.007 |
| Cancer | 225 (2.8%) | 80 (1.8%) | < 0.001 |
| Dementia | 250 (3.1%) | 267 (6.0%) | < 0.001 |
| Peptic ulcer disease | 452 (5.5%) | 226 (5.1%) | 0.27 |
| Previous AMI | 1824 (22.4%) | 1139 (25.6%) | < 0.001 |
| Asthma | 448 (5.5%) | 282 (6.3%) | 0.052 |
| Depression | 566 (6.9%) | 483 (10.9%) | < 0.001 |
| Peripheral vascular disease | 590 (7.2%) | 398 (9.0%) | < 0.001 |
| Previous revascularization | 749 (9.2%) | 604 (13.6%) | < 0.001 |
| Congestive heart failure | 331 (4.1%) | 283 (6.4%) | < 0.001 |
| Hyperthyroidism | 102 (1.3%) | 15 (0.3%) | < 0.001 |
| Aortic stenosis | 119 (1.5%) | 86 (1.9%) | 0.045 |
| Hemoglobin | 138.70 ± 18.67 | 135.66 ± 20.66 | < 0.001 |
| White blood count | 10.23 ± 4.83 | 10.43 ± 4.27 | 0.025 |
| Sodium | 139.03 ± 3.75 | 138.62 ± 3.93 | < .001 |
| Potassium | 4.09 ± 0.55 | 4.11 ± 0.58 | 0.064 |
| Glucose | 9.37 ± 5.21 | 9.01 ± 4.53 | < 0.001 |
| Urea | 7.38 ± 4.53 | 8.13 ± 5.40 | < 0.001 |
| Creatinine | 103.60 ± 58.77 | 111.64 ± 72.95 | < 0.001 |
Continuous variables are reported as mean ± standard deviation, while binary variables are reported as N (%).
***Suppressed due to small sample size.
Baseline characteristics of patients in the CHF derivation and validation samples.
| Variable | Derivation sample (N = 7156) | Validation sample (N = 6818) | P-value |
|---|---|---|---|
| Discharge systolic blood pressure | 124.87 ± 22.27 | 125.77 ± 21.94 | 0.017 |
| Age | 75.20 ± 11.54 | 76.23 ± 11.58 | < 0.001 |
| Female | 3578 (50.0%) | 3460 (50.7%) | 0.377 |
| Systolic blood pressure | 150.41 ± 33.22 | 148.42 ± 32.27 | < 0.001 |
| Heart rate | 94.46 ± 25.30 | 92.36 ± 25.73 | < 0.001 |
| Respiratory rate | 25.96 ± 7.25 | 24.45 ± 6.91 | < 0.001 |
| Neck vein distension | 3946 (55.1%) | 4148 (60.8%) | < 0.001 |
| S3 | 707 (9.9%) | 430 (6.3%) | < 0.001 |
| S4 | 275 (3.8%) | 189 (2.8%) | < 0.001 |
| Rales > 50% of lung field | 739 (10.3%) | 845 (12.4%) | < 0.001 |
| Pulmonary edema | 3691 (51.6%) | 4130 (60.6%) | < 0.001 |
| Cardiomegaly | 2552 (35.7%) | 3014 (44.2%) | < 0.001 |
| Diabetes | 2498 (34.9%) | 2582 (37.9%) | < 0.001 |
| Cerebrovascular disease/transient ischemic attack | 1144 (16.0%) | 1223 (17.9%) | 0.002 |
| Previous AMI | 2637 (36.9%) | 2508 (36.8%) | 0.936 |
| Atrial fibrillation | 2070 (28.9%) | 2401 (35.2%) | < 0.001 |
| Peripheral vascular disease | 897 (12.5%) | 917 (13.4%) | 0.108 |
| Chronic obstructive pulmonary disease | 1171 (16.4%) | 1521 (22.3%) | < 0.001 |
| Dementia | 472 (6.6%) | 626 (9.2%) | < 0.001 |
| Cirrhosis | 51 (0.7%) | 52 (0.8%) | 0.73 |
| Cancer | 802 (11.2%) | 759 (11.1%) | 0.888 |
| Left bundle branch block | 1056 (14.8%) | 915 (13.4%) | 0.023 |
| Hemoglobin | 124.17 ± 20.65 | 123.23 ± 20.53 | 0.007 |
| WBC (white blood cell) count | 9.89 ± 5.23 | 9.65 ± 4.24 | 0.003 |
| Sodium | 138.37 ± 4.74 | 138.43 ± 4.86 | 0.451 |
| Potassium | 4.28 ± 0.66 | 4.26 ± 0.66 | 0.123 |
| Glucose | 9.03 ± 4.69 | 8.61 ± 4.08 | < 0.001 |
| Urea level | 10.00 ± 6.32 | 9.92 ± 6.04 | 0.458 |
| Creatinine | 129.63 ± 94.43 | 126.42 ± 81.08 | 0.031 |
Continuous variables are reported as mean ± standard deviation, while binary variables are reported as N (%).
Figure 1Performance in validation sample (Case study).
Figure 2Performance in AMI sample (External validation).
Figure 3Performance in CHF sample (External validation).