| Literature DB >> 35455753 |
Iuliia Lenivtceva1, Dmitri Panfilov2, Georgy Kopanitsa1,3, Boris Kozlov2.
Abstract
The complications of thoracic aortic disease include aortic dissection and aneurysm. The risks are frequently compounded by many cardiovascular comorbidities, which makes the process of clinical decision making complicated. The purpose of this study is to develop risk predictive models for patients after thoracic aneurysm surgeries, using integrated data from different medical institutions. Seven risk features were formulated for prediction. The CatBoost classifier performed best and provided an ROC AUC of 0.94-0.98 and an F-score of 0.95-0.98. The obtained results are widely in line with the current literature. The obtained findings provide additional support for clinical decision making, guiding a patient care team prior to surgical treatment, and promoting a safe postoperative period.Entities:
Keywords: aortic aneurysm; feature extraction; integrated data; machine learning; postoperative risks; predictive modeling
Year: 2022 PMID: 35455753 PMCID: PMC9024528 DOI: 10.3390/jpm12040637
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
Recent studies for cardiovascular predictive modelling.
| Scheme | Algorithm | AUC-ROC | Data | Target |
|---|---|---|---|---|
| Lee, 2018 [ | XGBoost | 0.78 | Open heart and TAA surgery | Acute kidney injury |
| Zhong, 2021 [ | XGBoost | 0.93 | Coronary artery bypass surgery, aortic valve replacement and other heart surgeries | 30-day mortality, septic shock, liver dysfunction, and thrombocytopenia |
| Allyn, 2017 [ | Model ensemble | 0.78 | Elective heart surgery | Postoperative mortality |
| Fernandes, 2021 [ | XGBoost | 0.88 | Intraoperative open heart surgery data | Postoperative mortality |
| Coulson, 2020 [ | Logistic regression | 0.78–0.85 | Open heart surgery | Acute kidney injury |
Figure 1The pipeline for medical risk model development.
Models and parameters.
| Model | Parameters |
|---|---|
| LR* (imp. feat.) | ‘C’: 2.83, ‘solver’: ‘newton-cg’ |
| LR + SMOTE (imp. feat.) | ‘C’: 0.5, ‘solver’: ‘newton-cg’ |
| LR + SMOTE (all feat.) | ‘C’: 4.0, ‘solver’: ‘liblinear’ |
| RF (imp. feat.) | ‘criterion’: ‘gini’, ‘max_features’: ‘auto’ |
| RF + SMOTE (imp. feat.) | ‘criterion’: ‘gini’, ‘max_features’: ‘auto’ |
| RF + SMOTE (all feat.) | ‘criterion’: ‘gini’, ‘max_features’: ‘log2’ |
| CC * (all. feat.) | ‘depth’: 4, ‘l2_leaf_reg’: 3, ‘learning_rate’: 0.6 |
| CC + SMOTE (imp. feat.) | ‘depth’: 5, ‘l2_leaf_reg’: 2, ‘learning_rate’: 0.9 |
| CC + SMOTE (all feat.) | ‘depth’: 4, ‘l2_leaf_reg’: 1, ‘learning_rate’: 0.2 |
* LR–logistic regression, RF—random forest, CC—CatBoost classifier; imp. feat.—the model is composed using only important features, all feat.—the model is composed using all available features.
Performance of the classifiers for each target.
| Target | Best Classifier | ROC AUC | F-Score | Recall | Precision |
|---|---|---|---|---|---|
| In-hospital mortality | CC * + SMOTE (all feat.) | 0.965 | 0.966 | 0.992 | 0.942 |
| Temporary neurological deficit (TND) | CC + SMOTE (all feat.) | 0.960 | 0.959 | 0.936 | 0.983 |
| Permanent neurological deficit (PND) | CC + SMOTE (all feat.) | 0.946 | 0.947 | 0.969 | 0.926 |
| Prolonged lung ventilation (>7 days) | CC + SMOTE (all feat.) | 0.957 | 0.958 | 0.984 | 0.934 |
| Renal replacement therapy (RRT) | CC + SMOTE (all feat.) | 0.985 | 0.984 | 0.992 | 0.978 |
| Myocardial infarction (MI) | CC + SMOTE (imp. feat.) | 0.986 | 0.984 | 0.993 | 0.979 |
| Multiple organ failure (MOF) | CC + SMOTE (all feat.) | 0.952 | 0.950 | 0.964 | 0.958 |
* CC—CatBoost classifier; imp. feat.—the model is composed using only important features, all feat.—the model is composed using all available features.
Figure 2Feature importance diagrams for target variables: (a) in-hospital mortality; (b) TND; (c) PND; (d) prolonged lung ventilation; (e) RRT; (f) MOF; (g) MI.
Figure 3The example of a single patient’s prediction.