| Literature DB >> 36158815 |
Dabei Cai1,2, Tingting Xiao1, Ailin Zou1, Lipeng Mao1,2, Boyu Chi1,2, Yu Wang1, Qingjie Wang1,2, Yuan Ji1, Ling Sun1,2.
Abstract
Background: Predictive models based on machine learning have been widely used in clinical practice. Patients with acute myocardial infarction (AMI) are prone to the risk of acute kidney injury (AKI), which results in a poor prognosis for the patient. The aim of this study was to develop a machine learning predictive model for the identification of AKI in AMI patients.Entities:
Keywords: acute kidney injury; acute myocardia infarction; area under the receiver operating characteristic curve; machine learning; random forest
Year: 2022 PMID: 36158815 PMCID: PMC9489917 DOI: 10.3389/fcvm.2022.964894
Source DB: PubMed Journal: Front Cardiovasc Med ISSN: 2297-055X
FIGURE 1Flow diagram of the selection process of patients.
Baseline characteristics.
| MIMIC III | MIMIC IV | |||||
|
|
| |||||
| Non-AKI | AKI | Non-AKI | AKI | |||
| Demographic | ||||||
| Male ( | 639 (68.5%) | 220 (67.7%) | 0.79 | 1,164 (62.9%) | 518 (67.0%) | 0.045 |
| Age (year) | 65.1 [55.3,75.4] | 69.2 [58.7,78.4] | <0.001 | 67.0 [57.0,75.0] | 71.0 [61.0,78.0] | <0.001 |
| Vital signs | ||||||
| Heart rate (min–1) | 83.0 [72.0,95.0] | 87.0 [73.0,100.0] | <0.05 | 83.0 [73.0,95.0] | 86.0 [74.0,100] | <0.001 |
| Temperature (°C) | 36.5 [36.0,36.8] | 36.5 [35.9,37.1] | 0.548 | 36.6 [36.4,36.8] | 36.6 [36.3,36.9] | 0.789 |
| Respiratory rate (min–1) | 17.0 [15.0,21.0] | 19.0 [15.0,22.0] | <0.05 | 18.0 [15.0,22.0] | 19.0 [16.0,24.0] | <0.001 |
| ASP (mmHg) | 119.8 [106.0,134.0] | 116.0 [101.0,133.0] | 0.055 | 118.0 [105.0,134.0] | 113 [101.0,131.0] | <0.001 |
| ADP (mmHg) | 61.0 [52.0,70.0] | 59.0 [50.0,68.0] | <0.05 | 66.0 [56.0,76.0] | 61.0 [53.0,72.0] | <0.001 |
| MAP (mmHg) | 78.0 [68.0,88.0] | 75.0 [66.0,84.0] | <0.05 | 80.0 [71.0,91.0] | 77.0 [69.0,87.0] | <0.001 |
| Laboratory results | ||||||
| RBC (m/uL) | 4.2 [3.8,4.7] | 3.9 [3.4,4.5] | <0.001 | 4.0 [3.4,4.5] | 3.8 [3.2,4.3] | <0.001 |
| WBC (k/uL) | 11.0 [8.7,14.7] | 12.2 [8.4,15.2] | 0.088 | 10.5 [8.0,13.7] | 11.1 [8.0,15.8] | <0.05 |
| Platelet (k/uL) | 239.0 [194.0,297.5] | 222.0 [175.0,292.5] | <0.05 | 211.0 [165.0,265.0] | 197.0 [153.5,250.0] | <0.001 |
| Hemoglobin (g/dL) | 13.0 [11.4,14.4] | 12.1 [10.4,13.6] | <0.001 | 12.0 [10.1,13.7] | 11.2 [9.7,13.0] | <0.001 |
| Hematocrit (%) | 37.8 [34.0,41.6] | 36.0 [31.3,40.0] | <0.001 | 36.5 [31.1,40.6] | 34.9 [30.2,39.5] | <0.001 |
| Glucose (mg/dL) | 140.0 [114.0,187.0] | 155.0[118.0,230.5] | <0.001 | 133.0 [109.0,178.0] | 148.0 [113.0,203.0] | <0.001 |
| BUN (mg/dL) | 18.0 [14.0,26.0] | 25.0 [17.0,37.0] | <0.001 | 19.0 [14.0,32.0] | 25.0 [18.5,39.0] | <0.001 |
| Potassium (mEq/L) | 4.1 [3.8,4.5] | 4.3 [3.9,4.7] | <0.001 | 4.2 [3.9,4.5] | 4.3 [3.9,4.6] | <0.05 |
| Sodium (mEq/L) | 139.0 [136.0,140.0] | 138.0 [135.0,140.0] | 0.083 | 138.0 [136.0,140.0] | 138.0 [135.0,141.0] | 0.858 |
| Chloride (mEq/L) | 103.0 [100.0,106.0] | 102.0 [101.0,107.0] | 0.353 | 103.0 [99.0,105.0] | 103.0 [99.0,105.0] | 0.613 |
| Calcium (mg/dL) | 8.6 [8.2,9.1] | 8.4 [7.9,8.9] | <0.001 | 8.7 [8.2,9.1] | 8.5 [8.0,9.0] | <0.001 |
| Magnesium (mg/dL) | 1.9 [1.7,2.1] | 1.9 [1.7,2.1] | 0.468 | 2.0 [1.8,2.1] | 2.0 [1.8,2.2] | 0.900 |
| Phosphate (mg/dL) | 3.4 [2.9,4.0] | 3.6[3.0,4.5] | <0.001 | 3.6 [3.0,4.2] | 3.8 [3.2,4.6] | <0.001 |
| Bicarbonate (mEq/L) | 23.0 [21.0,26.0] | 23.0 [20.0,25.0] | <0.05 | 23.0 [21.0,25.0] | 22.0 [19.0,25.0] | <0.001 |
| APTT (s) | 31.6 [26.1,55.4] | 35.2 [27.6,59.2] | <0.05 | 35.6 [28.8,55.5] | 39.4 [29.4,65.2] | <0.05 |
| INR | 1.2 [1.1,1.3] | 1.2 [1.1,1.4] | <0.05 | 1.2 [1.1,1.3] | 1.2 [1.1,1.4] | <0.001 |
| PT (s) | 13.4 [12.5,14.6] | 13.5 [12.8,15.0] | <0.05 | 12.7 [11.7,14.6] | 13.3 [12.1,15.8] | <0.001 |
| CK-MB (ng/mL) | 32.0 [8.0,94.5] | 26.0 [8.0,97.0] | 0.633 | 20.0 [6.0,71.0] | 18.0 [5.0,69.1] | 0.431 |
| TNT (ng/mL) | 1.0 [0.8,4.0] | 1.5[0.2,5.5] | <0.05 | 0.5 [0.1,2.3] | 0.5 [0.1,2.3] | 0.978 |
| Creatinine (mg/dL) | 1.0[0.8,1.3] | 1.3 [0.9,1.6] | <0.001 | 1.0 [0.8,1.5] | 1.4 [1.0,1.9] | <0.001 |
| GFR [mL/(min⋅1.73 m2)] | 74.8 [55.8,95.8] | 56.3 [38.1,82.7] | <0.001 | 72.0 [44.7,99.3] | 52.1 [33.5,75.9] | <0.001 |
| Comorbidities ( | ||||||
| HF ( | 327 (35.0%) | 154 (47.4%) | <0.001 | 218 (11.8%) | 100 (12.9%) | 0.407 |
| Cardiogenic shock ( | 132 (14.1%) | 85 (26.2%) | <0.001 | 85 (4.6%) | 82 (10.4%) | <0.001 |
| Atrial fibrillation ( | 189 (19.9%) | 110 (33.8%) | <0.001 | 501 (27.1%) | 325 (42.0%) | <0.001 |
| Hypertension ( | 467 (50.1%) | 144 (44.3%) | 0.074 | 339 (18.3%) | 96 (12.4%) | <0.001 |
| Hyperlipidemia ( | 273 (29.3%) | 77(23.7%) | 0.054 | 1,003 (54.2%) | 339 (51.6%) | 0.229 |
| Hypercholesterolemia ( | 154 (16.5%) | 34 (10.5%) | <0.05 | 90 (4.9%) | 39 (5.0%) | 0.843 |
| Respiratory failure ( | 107 (11.5%) | 72 (22.2%) | <0.001 | 70 (3.8%) | 39 (5.0%) | 0.162 |
| DM ( | 188 (20.2%) | 86 (26.5%) | 0.018 | 171 (9.2%) | 77 (10.0%) | 0.559 |
| Ventricular tachycardia ( | 119 (12.8%) | 47 (14.5%) | 0.434 | 90 (4.9%) | 73 (9.4) | <0.001 |
Continuous variables are presented as the median and interquartile range (IQR). Counting data are presented as numbers and percentages. ASP, arterial systolic pressure; ADP, diastolic arterial pressure; MAP, mean arterial pressure; RBC, red blood cell; WBC, white blood cell; BUN, blood urea nitrogen; APTT, activated partial prothrombin time; INR, International Normalized Ratio; PT, prothrombin time; CK-MB, Creatine Kinase Isozyme-MB; TNT, Troponin-T; GFR, Glomerular Filtration Rate; HF, Heart Failure; DM, Diabetes Mellitus.
FIGURE 2Single-sample feature impact map (A); heat map of feature distribution under sample clustering (B); histogram of feature importance (C); scatter plot of feature density (D).
FIGURE 3Logistic Regression model with different variables ROC curves; training cohort (A); test cohort (B); validation cohort (C). Logistic Regression model calibration curve; training cohort (D); test cohort (E); validation cohort (F). Logistic Regression model Nomogram (G).
FIGURE 4The ROC curves for machine learning models and the performances of all models in test cohort. The X-axis in 4G-4L represents the AUC values of each model. Top 5 variables (A,G); top 10 variables (B,H); top 15 variables (C,I); top 20 variables (D,J); top 25 variables (E,K); models for all variables (F,L).
FIGURE 5The ROC curves for machine learning models and the performances of all models in validation cohort. The X-axis in 5G-5L represents the AUC values of each model. Top 5 variables (A,G); top 10 variables (B,H); top 15 variables (C,I); top 20 variables (D,J); top 25 variables (E,K); models for all variables (F,L).
FIGURE 6Dynamic plot of the area under the ROC curve for all machine learning models; training cohort (A); test cohort (B); validation cohort (C). The x-axis represents the number of variables included in the model, the y-axis represents different kinds of models, and the z-axis represents the AUC values of each model.
The performance of six models containing the top 20 importance variables.
| Model | AUC (CI 95%) | Accuracy | Sensitivity | Specificity |
| Support Vector Machine | 0.720 (0.687∼0.753) | 0.684 | 0.697 | 0.646 |
| Decision Trees | 0.637 (0.602∼0.672) | 0.670 | 0.716 | 0.538 |
| Random Forests | 0.781 (0.750∼0.811) | 0.735 | 0.748 | 0.698 |
| eXtreme Gradient Boosting | 0.741 (0.708∼0.773) | 0.682 | 0.678 | 0.695 |
| Naive Bayes | 0.716 (0.684∼0.749 | 0.628 | 0.582 | 0.763 |
| Logistic regression | 0.686 (0.653∼0.720) | 0.694 | 0.743 | 0.550 |
FIGURE 7An example of the application software for predicting AKI risk in AMI patients.