| Literature DB >> 33061545 |
Fatima Alshakhs1, Hana Alharthi1, Nida Aslam2, Irfan Ullah Khan2, Mohamed Elasheri3.
Abstract
PURPOSE: Predictive analytics (PA) is a new trending approach in the field of healthcare that uses machine learning to build a prediction model using supervised learning algorithms. Isolated coronary artery bypass grafting (iCABG), an open-heart surgery, is commonly performed in the treatment of coronary heart disease. AIM: The aim of this study was to develop and evaluate a model to predict postoperative length of stay (PLoS) for iCABG patients using supervised machine learning techniques, and to identify the features with the highest contribution to the model.Entities:
Keywords: CABG; LoS; classifiers; predictive analytics
Year: 2020 PMID: 33061545 PMCID: PMC7537993 DOI: 10.2147/IJGM.S250334
Source DB: PubMed Journal: Int J Gen Med ISSN: 1178-7074
Figure 1Model development cycle for prediction of postoperative length of stay for isolated coronary artery bypass grafting (iCABG). Notes: Reproduced from Free machine learning diagram - free powerpoint templates; 2017. Available from: .12
Figure 2Histogram of postoperative length of stay including outliers.
Figure 3Boxplot of postoperative length of stay with the outliers.
iCABG Attributes Extracted to Build the Models
| Categories | # | Attributes |
|---|---|---|
| Demographic data | 1 | Age |
| 2 | Gender | |
| 3 | Ethnicity | |
| 4 | Marital status | |
| 5 | BMI | |
| 6 | Height | |
| 7 | Smoking history | |
| Comorbidities | 8 | Hypertension |
| 9 | Hypercholesterolemia | |
| 10 | Renal disease | |
| 11 | Renal failure | |
| 12 | Diabetic | |
| 13 | Diabetes treatment | |
| 14 | Cerebrovascular disease | |
| 15 | Chronic lung disease | |
| Patient history | 16 | Premedication |
| 17 | Family history of ischemic heart disease | |
| 18 | Heart failure | |
| 19 | Previous cardiac, vascular, or thoracic surgery | |
| 20 | Number of previous heart operations | |
| Pre-Op measures | 21 | Angina |
| 22 | Pulmonary artery systolic | |
| 23 | Dyspnoea | |
| 24 | Poor mobility | |
| 25 | Poor mobility due to any non-cardiac reason | |
| 26 | Operative urgency | |
| 27 | EuroScore II | |
| 28 | DAO | |
| 29 | Pre-op intra-aortic balloon pump used | |
| Intra-Op measures | 30 | CABG procedure |
| 31 | Number of arterial grafts | |
| 32 | Intra-aortic balloon pump used | |
| 33 | Complications during operation | |
| Post-Op measures | 34 | Infective complication |
| 35 | Blood loss at 24 hours |
Figure 4Result of Boruta feature selection method.
Results of Resampling Methods to Balance the Target Class
| Method | Results | Total Records |
|---|---|---|
| Oversampling method | Records of target class AA randomly increased | 810 |
| Undersampling method | Records of target class AB randomly decreased to match the size of the AA target class | 432 |
| Both method | Records of AA increased, records of AB decreased, and the result of this method was 209 AB records and 232 AA records | 441 |
| ROSE | Records of AA was increased using synthetic records. The result of this method was 338 AB and 362 AA class. | 700 |
Summary Results of Naïve Bayes and Random Forest Classifiers with All Resampling Methods Using 70–30% Split
| Resampling | Naïve Bayes | Random Forest | ||||||
|---|---|---|---|---|---|---|---|---|
| Over | Under | Both | ROSE | Over | Under | Both | ROSE | |
| AUC | 0.64 | 0.69 | 0.71 | 0.66 | 0.89* | 0.68 | 0.80* | 0.70 |
| Accuracy | 0.64 | 0.63 | 0.65 | 0.61 | 0.78 | 0.60 | 0.75 | 0.60 |
| Recall | 0.53 | 0.57 | 0.59 | 0.54 | 0.82 | 0.58 | 0.74 | 0.63 |
| Precision | 0.66 | 0.66 | 0.73 | 0.60 | 0.75 | 0.63 | 0.78 | 0.58 |
| F1 Score | 0.59 | 0.61 | 0.65 | 0.57 | 0.78 | 0.60 | 0.77 | 0.60 |
Note: *AUC values closest to 1 (80% and above).
Summary Results of Random Forest Classifier with All Resampling Methods Using 10-K Cross- Validation
| Random Forest | ||||
|---|---|---|---|---|
| Resampling | Over | Under | Both | ROSE |
| AUC | 0.80 | 0.81 | 0.81 | 0.80 |
| Accuracy | 0.80 | 0.82 | 0.81 | 0.77 |
| Recall | 0.80 | 0.82 | 0.82 | 0.80 |
| Precision | 0.80 | 0.82 | 0.82 | 0.79 |
| F1 Score | 0.79 | 0.80 | 0.82 | 0.78 |
Figure 5Receiver operating characteristics (ROC) curve for random forest model using both resampling method and 70–30% split.
Figure 6Receiver operating characteristics (ROC) curve for random forest model using oversampling method and 70–30% split.
Figure 7Receiver operating characteristics (ROC) curve for random forest model using both resampling method and 10-K cross-validation.
Figure 8Receiver operating characteristics (ROC) curve for random forest model using undersampling method and 10-K cross-validation.
Random Forest Models with Highest AUC, F1 and Recall
| 70–30% Split | 10-K Cross-Validation | ||||||
|---|---|---|---|---|---|---|---|
| Both | Over | ROSE | Under | Both | Over | ROSE | |
| AUC | 0.80 | 0.89 | 0.70 | 0.81 | 0.81 | 0.80 | 0.80 |
| F1 | 0.77 | 0.79 | 0.60 | 0.80 | 0.82 | 0.79 | 0.78 |
| Recall | 0.74 | 0.82 | 0.63 | 0.82 | 0.82 | 0.80 | 0.80 |
Matrix of Studies with Relative Medical Conditions to Compare with The Study Results
| Article | Authors | Year | Country | Setting | Medical Condition | Data Balance | Model Evaluation | Classifier | AUC |
|---|---|---|---|---|---|---|---|---|---|
| Predictors of in-hospital length of stay among cardiac patients: A machine learning approach | Daghistani et al | 2019 | Saudi Arabia | King Abdulaziz Medical City Complex in Riyadh | Predict LoS for cardiac patients | Smote | Cross-validation | Random Forest | 0.94 |
| Neural Network Prediction of ICU Length of | LaFaro et al | 2015 | USA | New York Medical College | Predict ICU LoS after cardiac surgery | – | Cross-validation | Ensemble of Neural Network | 0.90 |
| Using machine learning for predicting severe postoperative complications after cardiac surgery | Lapp et al | 2018 | UK | Golden Jubilee National Hospital | Predict complications after cardiac surgery | – | – | Random Forest | 0.71 |
| Prediction of In-Hospital Mortality And Length of Stay in Acute Coronary Syndrome Patients Using Machine-Learning Methods | Yakovlev et al | 2018 | Russia | – | Predict mortality and LoS for acute coronary syndrome patients | – | Cross-validation | Naïve Bayes | 0.90 |
| This study | 2019 | Saudi Arabia | Saud Albabtain Cardiac Center | Predict LoS for iCABG patients | Both method | Cross-validation | Random Forest | 0.81 | |