| Literature DB >> 34658259 |
Arman Kilic1,2, Robert H Habib3, James K Miller4, David M Shahian5, Joseph A Dearani6, Artur W Dubrawski4.
Abstract
Background This study evaluated the role of supplementing Society of Thoracic Surgeons (STS) risk models for surgical aortic valve replacement with machine learning (ML). Methods and Results Adults undergoing isolated surgical aortic valve replacement in the STS National Database between 2007 and 2017 were included. ML models for operative mortality and major morbidity were previously developed using extreme gradient boosting. Concordance and discordance in predicted risk between ML and STS models were defined using equal-size tertile-based thresholds of risk. Calibration metrics and discriminatory capability were compared between concordant and discordant patients. A total of 243 142 patients were included. Nearly all calibration metrics were improved in cases of concordance. Similarly, concordance indices improved substantially in cases of concordance for all models with the exception of deep sternal wound infection. The greatest improvements in concordant versus discordant cases were in renal failure: ML model (concordance index, 0.660 [95% CI, 0.632-0.687] discordant versus 0.808 [95% CI, 0.794-0.822] concordant) and STS model (concordance index, 0.573 [95% CI, 0.549-0.576] discordant versus 0.797 [95% CI, 0.782-0.811] concordant) (each P<0.001). Excluding deep sternal wound infection, the concordance indices ranged from 0.549 to 0.660 for discordant cases and 0.674 to 0.808 for concordant cases. Conclusions Supplementing ML models with existing STS models for surgical aortic valve replacement may have an important role in risk prediction and should be explored further. In particular, for the roughly 25% to 50% of patients demonstrating discordance in estimated risk between ML and STS, there appears to be a substantial decline in predictive performance suggesting vulnerability of the existing models in these patient subsets.Entities:
Keywords: aortic valve replacement; complications; machine learning; mortality; risk prediction
Mesh:
Year: 2021 PMID: 34658259 PMCID: PMC8751954 DOI: 10.1161/JAHA.120.019697
Source DB: PubMed Journal: J Am Heart Assoc ISSN: 2047-9980 Impact factor: 5.501
Significant Predictors of Concordance and Discordance in Predicted Risk Between the ML and STS Models
| Variable | Concordance | Discordance | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Mortality | Comp | Renal Failure | Prolonged Ventilation | Reoperation | Stroke | DSWI | Mortality | Comp | Renal Failure | Prolonged Ventilation | Reoperation | Stroke | DSWI | |
| Age, y (increasing) | x | x | x | x | x | |||||||||
| Female | x | x | x | x | x | x | x | |||||||
| White | x | x | x | x | x | |||||||||
| BMI (increasing) | x | x | ||||||||||||
| BSA (increasing) | x | x | x | x | x | |||||||||
| Hyperlipidemia | ||||||||||||||
| Diabetes mellitus | x | x | x | x | x | x | ||||||||
| Hypertension | x | x | x | x | x | |||||||||
| Chronic lung disease—mild | x | x | x | x | x | x | ||||||||
| Chronic lung disease—moderate | x | x | x | x | x | |||||||||
| Chronic lung disease—severe | x | x | x | x | x | x | x | |||||||
| Dialysis | x | x | x | x | ||||||||||
| Creatinine (increasing) | x | x | x | x | x | |||||||||
| Immunosuppressed | x | x | x | |||||||||||
| Infective endocarditis | x | x | x | |||||||||||
| PAD | x | x | x | x | x | |||||||||
| CVD | x | x | x | x | ||||||||||
| FHCAD | x | x | x | |||||||||||
| Redo | x | x | x | x | x | x | ||||||||
| Prior MI | x | x | ||||||||||||
| Shock | x | x | x | x | x | x | ||||||||
| IABP | x | x | x | |||||||||||
| AV insufficiency | x | x | x | x | x | |||||||||
| EF (increasing) | x | x | x | x | x | |||||||||
| Urgent status | x | x | x | x | ||||||||||
| Emergent status | x | x | x | x | x | x | ||||||||
| Intraoperative | ||||||||||||||
| CPB time (increasing) | ||||||||||||||
| Aortic XC time (increasing) | ||||||||||||||
| Blood transfusion | x | x | x | x | x | |||||||||
| Mechanical valve | x | x | x | |||||||||||
Intraoperative variables were added subsequently after identifying significant preoperative predictors. AV indicates aortic valve; BMI, body mass index; BSA, body surface area; Comp, composite of mortality or morbidity; CPB, cardiopulmonary bypass; CVD, cerebrovascular disease; DSWI, deep sternal wound infection; EF, ejection fraction; FHCAD, family history of coronary artery disease; IABP, intra‐aortic balloon pump; MI, myocardial infarction; ML, machine learning; PAD, peripheral arterial disease; STS, Society of Thoracic Surgeons; and XC, cross‐clamp.
Intraoperative variables were entered into the multivariable model only after fully executing the multivariable models using only preoperative variables.
Figure 1Improvement in calibration for Society of Thoracic Surgeons (STS) risk models for operative mortality in concordant cases.
Improvement in Calibration Metrics of the STS Models in Cases of Concordance
| STS Model | ||
|---|---|---|
| Operative Mortality | Discordant (n=12 615; 26.4%) | Concordant (n=35 191; 73.6%) |
| Observed‐to‐expected ratio | 0.770 | 0.866 |
| Calibration‐in‐the‐large | −0.267 | −0.157 |
| Slope of calibration curve | 0.837 | 0.964 |
STS indicates Society of Thoracic Surgeons.
Figure 2Improvement in calibration for machine learning risk models for operative mortality in concordant cases.
Improvement in Calibration Metrics of the ML Models in Cases of Concordance
| ML Model | ||
|---|---|---|
| Operative Mortality | Discordant (n=12 615; 26.4%) | Concordant (n=35 191; 73.6%) |
| Observed‐to‐expected ratio | 0.860 | 1.017 |
| Calibration‐in‐the‐large | −0.154 | 0.016 |
| Slope of calibration curve | 0.806 | 0.987 |
ML indicates machine learning.
Improvement in Calibration Metrics of Models Averaging ML and STS Risk in Cases of Concordance
| Average Model | ||
|---|---|---|
| Operative Mortality | Discordant (n=12 615; 26.4%) | Concordant (n=35 191; 73.6%) |
| Observed‐to‐expected ratio | 0.812 | 0.935 |
| Calibration‐in‐the‐large | −0.212 | −0.072 |
| Slope of calibration curve | 1.325 | 1.030 |
ML indicates machine learning; and STS, Society of Thoracic Surgeons.
Figure 3Improvement in area under the receiver operating characteristic (ROC) curve for the Society of Thoracic Surgeons models for operative mortality in (A) concordant vs (B) discordant cases, and for machine learning models in (C) concordant vs (D) discordant cases.
Improvement in Discriminatory Ability as Measured by Area Under the Receiver Operating Characteristic Curve in Cases of Concordance
| Model | Concordance Index (95% CI) | Concordance Index (95% CI) |
|
|---|---|---|---|
| Operative Mortality | Discordant (n=12 615; 26.4%) | Concordant (n=35 191; 73.6%) | |
| ML | 0.634 (0.595–0.673) | 0.774 (0.759–0.788) | <0.001 |
| STS | 0.614 (0.576–0.653) | 0.769 (0.754–0.784) | <0.001 |
| Average of both models | 0.650 (0.614–0.686) | 0.775 (0.760–0.789) | <0.001 |
ML indicates machine learning; and STS, Society of Thoracic Surgeons.