| Literature DB >> 31601916 |
Paul D Myers1, Wei Huang2,3, Fred Anderson2,3, Collin M Stultz4,5,6.
Abstract
Most risk stratification methods use expert opinion to identify a fixed number of clinical variables that have prognostic significance. In this study our goal was to develop improved metrics that utilize a variable number of input parameters. We first used Bootstrap Lasso Regression (BLR) - a Machine Learning method for selecting important variables - to identify a prognostic set of features that identify patients at high risk of death 6-months after presenting with an Acute Coronary Syndrome. Using data derived from the Global Registry of Acute Coronary Events (GRACE) we trained a logistic regression model using these features and evaluated its performance on a development set (N = 43,063) containing patients who have values for all features, and a separate dataset (N = 6,363) that contains patients who have missing feature values. The final model, Ridge Logistic Regression with Variable Inputs (RLRVI), uses imputation to estimate values for missing features. BLR identified 19 features, 8 of which appear in the GRACE score. RLRVI had modest, yet statistically significant, improvement over the standard GRACE score on both datasets. Moreover, for patients who are relatively low-risk (GRACE≤87), RLRVI had an AUC and Hazard Ratio of 0.754 and 6.27, respectively, vs. 0.688 and 2.46 for GRACE, (p < 0.007). RLRVI has improved discriminatory performance on patients who have values for the 8 GRACE features plus any subset of the 11 non-GRACE features. Our results demonstrate that BLR and data imputation can be used to obtain improved risk stratification metrics, particularly for patients who are classified as low risk using traditional methods.Entities:
Mesh:
Year: 2019 PMID: 31601916 PMCID: PMC6787006 DOI: 10.1038/s41598-019-50933-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Features selected by the bootstrap lasso.
| Demographics | Appears in GRACE Score? |
|---|---|
| Age | Yes |
| Admission weight | No |
|
| |
| Congestive heart failure | No |
| Peripheral artery disease | No |
| Renal insufficiency | No |
|
| |
| Systolic blood pressure | Yes |
| Pulse | Yes |
| Killip class | Yes |
| Cardiac arrest | Yes |
| ST segment deviation | Yes |
|
| |
| Warfarin | No |
| Medications, pre-hospital or within 1st 24 hours | |
| Statin | No |
| Diuretic | No |
| Insulin | No |
| IV inotropic agent | No |
| Oral beta blocker | No |
| IV beta blocker | No |
|
| |
| Initial creatinine | Yes |
| Initial positive enzymes | Yes |
GRACE = Global Registry of Acute Coronary Events; IV = intravenous.
Population characteristics in the development and validation sets.
| Development Set | Validation Set | |
|---|---|---|
| Population size | 43,063 | 6,363 |
| Low-risk (GRACE score ≤ 87) | 13,205 | 1,665 |
| Mortalities | 3,078 (7.15%) | 719 (11.3%) |
| Low-risk (GRACE score ≤ 87) | 316 (1.16%) | 29 (1.74%) |
|
| ||
| Age (years) | 66.1 (55.7–75.8) | 68.2 (57.1–77.6) |
| Female | 32.6% | 33.9% |
| Height (cm) | 170 (162–175) | 169 (161–175) |
| Admission weight (kg) | 77.0 (67.0–88.0) | 77.0 (67.2–87.2) |
| Medical History (%) | ||
| Congestive heart failure | 10.5 | 11.1 |
| Peripheral artery disease | 9.7 | 9.2 |
| Angina | 51.9 | 45.5 |
| Coronary Artery Bypass Graft (CABG) | 12.6 | 11.9 |
| Myocardial Infarction (MI) | 30.3 | 31.0 |
| Hypertension | 62.1 | 61.6 |
| Hyperlipidemia | 48.3 | 48.1 |
| Diabetes | 25.1 | 26.3 |
| Percutaneous Coronary Intervention (PCI) | 17.7 | 17.7 |
| Smoking | 57.7 | 53.0 |
| TIA/Stroke | 8.3 | 9.1 |
| Renal insufficiency | 7.8 | 8.0 |
|
| ||
| Systolic blood pressure (mmHg) | 140 (120–160) | 140 (120–160) |
| Pulse (bpm) | 77 (65–90) | 77 (65–90) |
| Killip class I | 83.3% | 81.6% |
| Killip class II | 12.0% | 12.6% |
| Killip class III | 3.9% | 4.6% |
| Killip class IV | 0.8% | 1.3% |
| Cardiac arrest | 1.7% | 2.3% |
| ST segment deviation | 54.8% | 53.1% |
|
| ||
| Oral beta blocker, pre-hospital acute or within 1st 24 hours in hospital | 69.8 | 67.1 |
| Warfarin, chronic use | 4.5 | 5.2 |
| Statin, pre-hospital acute or within 1st 24 hours in hospital | 51.0 | 56.6 |
| Diuretic, pre-hospital acute or within 1st 24 hours in hospital | 25.3 | 28.0 |
| Insulin, pre-hospital acute or within 1st 24 hours in hospital | 14.3 | 16.0 |
| IV inotropic agent, pre-hospital acute or within 1st 24 hours in hospital | 4.5 | 6.3 |
| IV beta blocker, pre-hospital acute or within 1st 24 hours in hospital | 12.9 | 11.5 |
| Aspirin, within 1st 24 hours in hospital | 90.3 | 86.6 |
| ACE Inhibitors, pre-hospital acute or within 1st 24 hours in hospital | 47.6 | 47.4 |
|
| ||
| Initial creatinine (mg/dl) | 1.0 (0.9–1.3) | 1.0 (0.9–1.3) |
| Initial positive enzymes | 46.8% | 50.7% |
Numbers for continuous variables are presented as the median with the interquartile range in parentheses. GRACE = Global Registry of Acute Coronary Events; CABG = coronary artery bypass grafting; MI = myocardial infarction; PCI = percutaneous coronary intervention; TIA = transient ischemic attack; IV = intravenous; ACE = angiotensin converting enzyme.
Figure 1RLR Performance on the Development Set. AUCs and six-month hazard ratios in the overall (a,b) and low-risk (GRACE < 87) subset (c,d) of the development set. Error bars show one standard error of the mean. * indicates p < 0.001. Numbers above the bars indicate mean values. AUC = area under the curve; GRACE = Global Registry of Acute Coronary Events; RLRVI = ridge logistic regression with variable inputs; STEMI = ST elevation myocardial infarction; NSTEMI = non-ST elevation myocardial infarction; UA = unstable angina.
Figure 2Evaluating the Relative Importance of non-GRACE Score Features. AUCs from adding one of the 11 non-GRACE score features at a time and imputing the remaining features. AUCs are averaged over 100 bootstrapped test sets. Error bars show one standard error of the mean. All models show improved performance over the GRACE score with p < 0.001. AUC = area under the curve; GRACE = Global Registry of Acute Coronary Events; RLRVI = ridge logistic regression with variable inputs; Hx = History; Peri Vasc Dis = peripheral vascular disease; IV = intravenous; CHF = congestive heart failure.
Figure 3RLRVI Discriminatory Ability Using a Subset of Clinical Features. AUCs averaged over 10 bootstrap splits of the development set for all possible combinations of the 11 non-GRACE score features selected by BLR. The red line and numbers indicate the number of features that were known and therefore not imputed. For example, the red 6 indicates that all points in that range were generated by models that had six of the non-GRACE score features available; all possible combinations of 11 choose 6 are represented in this range. The performance of the GRACE score on the same 10 bootstrap splits is shown by the dashed line at the bottom of the plot. All feature combinations show improvement over the GRACE score with p < 0.003. AUC = area under the curve; GRACE = Global Registry of Acute Coronary Events.
Figure 4RLRVI Performance on the Validation Set. AUCs and six-month hazard ratios in the overall (a,b) and low-risk (GRACE < 87) subset (c,d) of the validation set. Error bars show one standard error of the mean. * indicates p < 0.007. Numbers above the bars indicate mean values. AUC = area under the curve; GRACE = Global Registry of Acute Coronary Events; RLRVI = ridge logistic regression with variable inputs; STEMI = ST elevation myocardial infarction; NSTEMI = non-ST elevation myocardial infarction; UA = unstable angina.