| Literature DB >> 31290991 |
Bobak J Mortazavi1,2,3,4, Emily M Bucholz5,6, Nihar R Desai3,4, Chenxi Huang3,4, Jeptha P Curtis3,4, Frederick A Masoudi7, Richard E Shaw8, Sahand N Negahban9, Harlan M Krumholz3,4,10.
Abstract
Importance: Better prediction of major bleeding after percutaneous coronary intervention (PCI) may improve clinical decisions aimed to reduce bleeding risk. Machine learning techniques, bolstered by better selection of variables, hold promise for enhancing prediction. Objective: To determine whether machine learning techniques better predict post-PCI major bleeding compared with the existing National Cardiovascular Data Registry (NCDR) models. Design, Setting, and Participants: This comparative effectiveness study used the NCDR CathPCI Registry data version 4.4 (July 1, 2009, to April 1, 2015), machine learning techniques were used (logistic regression with lasso regularization and gradient descent boosting [XGBoost, version 0.71.2]), and output was then compared with the existing simplified risk score and full NCDR models. The existing models were recreated, and then performance was evaluated through additional techniques and variables in a 5-fold cross-validation in analysis conducted from October 1, 2015, to October 27, 2017. The setting was retrospective modeling of a nationwide clinical registry of PCI. Participants were all patients undergoing PCI. Percutaneous coronary intervention procedures were excluded if they were not the index PCI of admission, if the hospital site had missing outcomes measures, or if the patient underwent subsequent coronary artery bypass grafting. Exposures: Clinical variables available at admission and diagnostic coronary angiography data were used to determine the severity and complexity of presentation. Main Outcomes and Measures: The main outcome was in-hospital major bleeding within 72 hours after PCI. Results were evaluated by comparing C statistics, calibration, and decision threshold-based metrics, including the F score (harmonic mean of positive predictive value and sensitivity) and the false discovery rate.Entities:
Mesh:
Year: 2019 PMID: 31290991 PMCID: PMC6624806 DOI: 10.1001/jamanetworkopen.2019.6835
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
Patient Characteristics
| Characteristic | Full Sample (N = 3 316 465) | Bleeding Cases (n = 149 724) | Nonbleeding Cases (n = 3 166 741) |
|---|---|---|---|
| Post-PCI major bleeding rate, % | 4.5 | 100 | 0 |
| Demographic characteristics | |||
| Age, median (IQR), y | 65 (56-73) | 68 (59-77) | 65 (56-73) |
| Male sex, % | 68.1 | 50.6 | 68.9 |
| BMI, median (IQR) | 29.1 (25.7-33.3) | 27.7 (24.3-32.1) | 29.2 (25.8-33.4) |
| Medical conditions | |||
| Diabetes, % | 37.0 | 37.5 | 36.9 |
| Hypertension, % | 82.1 | 80.2 | 82.2 |
| Peripheral vascular disease, % | 12.2 | 15.2 | 12.0 |
| Chronic kidney disease, % | 3.7 | 9.9 | 3.5 |
| Previous PCI, % | 41.2 | 30.0 | 41.7 |
| Previous CABG, % | 18.1 | 14.6 | 18.3 |
| Preprocedural hemoglobin, median (IQR), g/dL | 13.7 (12.5-14.8) | 13.6 (11.6-14.9) | 13.7 (12.5-14.8) |
| Procedural status, % | |||
| Elective | 41.5 | 19.1 | 42.6 |
| Urgent | 39.9 | 33.2 | 40.3 |
| Emergent | 18.3 | 45.9 | 17.0 |
| Salvage | 0.3 | 1.8 | 0.2 |
| STEMI | 16.7 | 42.5 | 15.5 |
| Shock | 2.5 | 14.3 | 1.9 |
| Cardiac arrest within 24 h of PCI | 1.9 | 10.0 | 1.5 |
Abbreviations: BMI, body mass index (calculated as weight in kilograms divided by height in meters squared); CABG, coronary artery bypass grafting; IQR, interquartile range; PCI, percutaneous coronary intervention; STEMI, ST-segment elevation myocardial infarction.
SI conversation factor: To convert hemoglobin level to grams per liter, multiply by 10.0.
Bleeding cases are the subset of the full sample deemed to have had major bleeding as an outcome in the 72-hour period after PCI. The nonbleeding cases are those who did not have this adverse outcome.
Names of Variables, Descriptions, and Use in Prior Models
| Variable | Description [NCDR CathPCI Registry Field No.] | Use in Prior Model |
|---|---|---|
| Age | Patient age [2050] | Existing simplified risk score |
| Age >70 y | Is patient age >70 y? (yes or no) [2050] | Existing full model |
| Age ≤70 y | Is patient age ≤70 y? (yes or no) [2050] | Existing full model |
| BMI | BMI [4055, 4060] | Existing simplified risk score |
| BMI ≤30 | Is BMI ≤30? (yes or no) [4055, 4060] | Existing full model |
| Chronic lung disease | Chronic lung disease (yes or no) [4080] | Existing full model |
| Chronic kidney disease | Chronic kidney disease, composite categorical: 1, 2, 3, 4 [4065] | Existing simplified risk score |
| No chronic kidney disease: 1 | Patients with GFR ≥60 mL/min/1.73 m2 (yes or no) | NA |
| Mild chronic kidney disease: 2 | Patients with GFR ≥45 and <60 mL/min/1.73 m2 (yes or no) | Existing full model |
| Moderate chronic kidney disease: 3 | Patients with GFR ≥30 and <45 mL/min/1.73 m2 (yes or no) | Existing full model |
| Severe chronic kidney disease: 4 | Patients with GFR <30 mL/min/1.73 m2 or receiving current dialysis (yes or no) [4065] | Existing full model |
| GFR | GFR (continuous) [7315, 2050, 2060, 2071] | NA |
| Female sex | Is patient female? (yes or no) [2060] | Existing simplified risk score, existing full model |
| Diabetes | Diabetes (yes or no) [4085] | NA |
| Diabetes composite | Diabetes and therapy composite: 1, 2 [4090] | NA |
| Diabetes with noninsulin treatment: 1 | Diabetes therapy (yes or no) [4090] | NA |
| Insulin-requiring diabetes: 2 | Diabetes therapy (yes or no) [4090] | Existing full model |
| Currently receiving dialysis | Dialysis (yes or no) [4065] | NA |
| NYHA composite | 0 (No), 1 (class I), 2 (class II), 3 (class III), 4 (class IV) [5025, 5045] | NA |
| NYHA 1, 2, 3 | Class IV vs <IV (yes or no) [5045] | Existing full model |
| NYHA 4 | Class IV (yes or no) [5045] | Existing full model |
| History of cerebrovascular disease | Cerebrovascular disease (yes or no) [4070] | Existing full model |
| History of peripheral arterial disease | Peripheral arterial disease (yes or no) [4075] | Existing full model |
| Previous PCI | Previous PCI in prior visit (yes or no) [4035] | Existing simplified risk score, existing full model |
| Preprocedural TIMI flow grade | Preprocedural TIMI flow grade: 0, 1, 2, or 3 [7140] | NA |
| Preprocedural TIMI flow grade: 0 | Preprocedural TIMI flow grade is 0 (yes or no) [7140] | Existing full model |
| PCI lesion composite | 1: Proximal right, mid-LAD, or proximal circumflex | NA |
| 2: Proximal LAD | ||
| 3: Left main | ||
| 0: Other [7105] | ||
| Proximal LAD PCI | PCI is for proximal LAD lesion (yes or no) [7105] | Existing full model |
| Left main PCI | PCI is for left main lesion (yes or no) [7105] | Existing full model |
| Vessel disease composite | Vessel disease: 0, 1, 2, or 3 [6100, 6110, 6120, 6130, 6140, 6150, 6160] | NA |
| 2-Vessel or 3-vessel disease | 2-Vessel or 3-vessel disease vs none or 1-vessel disease (yes or no) [6100, 6110, 6120, 6130, 6140, 6150, 6160] | Existing full model |
| Lesion complexity | High complexity or nonlesion (yes or no) [7185] | NA |
| SCAI lesion class | I, II, III, or IV [7185, 7115] | NA |
| SCAI lesion class: II or III | SCAI lesion is class II or III (yes or no) [7185, 7115] | Existing full model |
| SCAI lesion class: IV | SCAI lesion is class IV (yes or no) [7185, 7115] | Existing full model |
| CAD presentation | CAD presentation [5000] | NA |
| STEMI | If CAD presentation is STEMI (yes or no) [5000] | Existing simplified risk score, existing full model |
| Stenosis % before treatment | % Stenosis immediately before treatment [7115] | NA |
| Subacute stent thrombosis | In-stent thrombosis [7145, 7150, 7165] | Existing full model |
| PCI status and shock composite | 1: Salvage and shock (within 24 h and at start of PCI) | NA |
| 2: Salvage or shock (within 24 h and at start of PCI) | ||
| 3: Shock within 24 h or at start of PCI | ||
| 4: Emergent procedure | ||
| 5: Urgent procedure | ||
| 6: Elective procedure [5060, 7020, 7030] | ||
| PCI status and shock composite 1 | Salvage and shock (within 24 h and at start of PCI) (yes or no) [5060, 7020, 7030] | NA |
| PCI status and shock composite 2 | Salvage or shock (within 24 h and at start of PCI) (yes or no) [5060, 7020, 7030] | Existing full model |
| PCI status and shock composite 3 | Shock within 24 h or at start of PCI (yes or no) [5060, 7020, 7030] | Existing full model |
| PCI status and shock composite 4 | Emergent procedure (yes or no) [5060, 7020, 7030] | Existing full model |
| PCI status and shock composite 5 | Urgent procedure (yes or no) [5060, 7020, 7030] | Existing full model |
| PCI status and shock composite 6 | Elective procedure (yes or no) [5060, 7020, 7030] | NA |
| Shock within 24 h and at start of PCI | Shock within 24 h and at start of PCI (yes or no) [5060, 7030] | Existing full model |
| Lytics before PCI for STEMI | Lytics before PCI for STEMI (yes or no) [5000, 5010] | Existing full model |
| Cardiac arrest within 24 h | Cardiac arrest within 24 h (yes or no) [5065] | Existing simplified risk score, existing full model |
| Cardiogenic shock within 24 h | Cardiogenic shock within 24 h (yes or no) [5060] | NA |
| Cardiogenic shock within 24 h or at start of PCI | Cardiogenic shock within 24 h or at start of PCI (yes or no) [5060, 7030] | Existing simplified risk score |
| Cardiogenic shock composite | 1: Cardiogenic shock within 24 h and at start of PCI | NA |
| 2: Cardiogenic shock within 24 h | ||
| 3: Cardiogenic shock at start of PCI | ||
| 4: No cardiogenic shock [5060, 7030] | ||
| Cardiogenic shock at start of PCI | Cardiogenic shock at start of PCI [7030] | NA |
| PCI status composite | 1: Elective | Existing simplified risk score |
| 2: Urgent | ||
| 3: Emergency | ||
| 4: Salvage [7020] | ||
| Preprocedural hemoglobin | Preprocedural hemoglobin (continuous) [7320] | Existing full model |
| Preprocedural hemoglobin ≤13 g/dL | Preprocedural hemoglobin ≤13 g/dL (yes or no) [7320] | Existing full model |
| Preprocedural hemoglobin >13 g/dL | Preprocedural hemoglobin >13 g/dL (yes or no) [7320] | NA |
| Preprocedural LVEF % | Preprocedural LVEF % (continuous) [7025] | Existing simplified risk score |
| Preprocedural creatinine | Preprocedural creatinine (continuous) [7315] | NA |
Abbreviations: BMI, body mass index (calculated as weight in kilograms divided by height in meters squared); CAD, coronary artery disease; GFR, glomerular filtration rate; LAD, left anterior descending; LVEF, left ventricular ejection fraction; NA, not applicable (additions to the blended model data set); NCDR, National Cardiovascular Data Registry; NYHA, New York Heart Association; PCI, percutaneous coronary intervention; SCAI, Society for Cardiovascular Angiography and Interventions; STEMI, ST-segment elevation myocardial infarction; TIMI, thrombolysis in myocardial infarction.
SI conversion factor: To convert hemoglobin level to grams per liter, multiply by 10.0.
C Statistics of 5-Fold Cross-validation Results for the Existing Simplified Risk Score and the Blended Model
| Timing | Variable Set | Mean (95% CI) C Statistic |
|---|---|---|
| Existing simplified risk score | Existing simplified risk score | 0.77 (0.77-0.77) |
| Existing simplified risk score with lasso regularization | 0.77 (0.77-0.77) | |
| Existing simplified risk score with gradient descent boosting | 0.81 (0.80-0.81) | |
| Blended model | Existing full model | 0.78 (0.78-0.78) |
| Existing full model with lasso regularization | 0.78 (0.78-0.78) | |
| Existing full model with gradient descent boosting | 0.78 (0.78-0.78) | |
| Blended model with lasso regularization | 0.78 (0.78-0.78) | |
| Blended model with gradient descent boosting | 0.82 (0.82-0.82) |
Prospective Predictions and Changes in 5-Fold Cross-validation for the Existing Simplified Risk Score and the Blended Model
| Timing | Variable Set | Prospective Prediction | Mean (SD) Decision Threshold | |||
|---|---|---|---|---|---|---|
| True-Positives | True-Negatives | False-Positives | False-Negatives | |||
| Existing simplified risk score | Existing simplified risk score with high-risk threshold | 105 316 | 2 208 569 | 958 172 | 44 408 | 65.0 (0.0) Points (6.5%) |
| Existing simplified risk score with data-driven threshold | 47 445 | 2 990 509 | 176 232 | 102 279 | 96.4 (0.1) Points (range, 14.9%-17.0%) | |
| Existing simplified risk score with lasso regularization | 52 420 | 2 954 631 | 212 110 | 97 304 | 10.0% (0.3%) | |
| Existing simplified risk score with gradient descent boosting | 52 768 | 3 019 006 | 147 735 | 96 956 | 15.1% (0.8%) | |
| Blended model | Existing full model | 49 967 | 2 982 389 | 184 352 | 99 757 | 11.9% (0.5%) |
| Blended model with lasso regularization | 51 840 | 2 977 168 | 189 573 | 97 884 | 11.6% (0.4%) | |
| Blended model with gradient descent boosting | 55 527 | 3 013 868 | 152 873 | 94 197 | 15.6% (0.9%) | |
The “NA” fields in Table 2 are the baseline models and have no previous method against which to compare change.
Figure. Plots for the Existing Full Model and the Blended Model
The blended model demonstrated a closer calibration than the existing full model. A, Decile-based calibration plots are calculated from the 5-fold cross-validation showing stable model calibration. B, Continuous calibration plots with 95% CIs (shaded areas) are shown for the 2 models.