| Literature DB >> 30509300 |
Konrad Pieszko1,2, Jarosław Hiczkiewicz3,4, Paweł Budzianowski5, Janusz Rzeźniczak6, Jan Budzianowski3,4, Jerzy Błaszczyński7, Roman Słowiński7, Paweł Burchardt8,6.
Abstract
BACKGROUND: Increased systemic and local inflammation play a vital role in the pathophysiology of acute coronary syndrome. This study aimed to assess the usefulness of selected machine learning methods and hematological markers of inflammation in predicting short-term outcomes of acute coronary syndrome (ACS).Entities:
Keywords: Acute coronary syndrome; Biomarkers; Inflammation; Machine learning; Outcomes research; Risk assessment
Mesh:
Substances:
Year: 2018 PMID: 30509300 PMCID: PMC6276170 DOI: 10.1186/s12967-018-1702-5
Source DB: PubMed Journal: J Transl Med ISSN: 1479-5876 Impact factor: 5.531
Features used by XGboost and DRSA-BRE classifiers
| XGboost | DRSA-BRA (simplified set of features) | DRSA-BRE (full set of features) |
|---|---|---|
| 1. Diastolic blood pressure | 1. All features from simplified set AND | |
Basic characteristics of continuous numerical variables grouped by outcomes
| Feature | Unit | Significant lesion | No significant lesion | p-value | In-hospital death | No in—hospital death | p-value | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Support | Median | IQR | Support | Median | IQR | Support | Median | IQR | Support | Median | IQR | |||||
| 1 | Age | Years | 4825 | 65.8 | 14.3 | 1778 | 67 | 14.8 | < | 94 | 75.8 | 18.7 | 6509 | 66 | 14.3 | < 0.001 |
| 2 | Height | cm | 4653 | 170 | 11 | 1743 | 170 | 11 |
| 70 | 169.5 | 10.3 | 6326 | 170 | 11 | 0.303 |
| 3 | Weight | kg | 4697 | 80 | 20 | 1748 | 80 | 20 |
| 75 | 78 | 21 | 6370 | 80 | 20 | 0.266 |
| 4 | BMI | kg/m2 | 4643 | 27.7 | 6.5 | 1739 | 28.1 | 6.4 |
| 70 | 27 | 6.3 | 6312 | 27.8 | 6.5 | 0.173 |
| 5 | Lymphocyte count | 10 e−3/ul | 4718 | 1.9 | 1 | 1719 | 1.8 | 0.9 | < | 84 | 1.7 | 1.5 | 6353 | 1.9 | 0.9 | 0.284 |
| 6 | Monocyte count | 10 e−3/ul | 4718 | 0.6 | 0.3 | 1719 | 0.6 | 0.2 |
| 84 | 0.7 | 0.5 | 6353 | 0.6 | 0.3 | < 0.001 |
| 7 | Eosinophil count | 10 e−3/ul | 4718 | 0.1 | 0.1 | 1719 | 0.1 | 0.1 |
| 84 | 0 | 0.1 | 6353 | 0.1 | 0.1 | < 0.001 |
| 8 | Neutrophil count | 10 e−3/ul | 4703 | 5.2 | 2.9 | 1707 | 4.9 | 2.4 | < | 84 | 9.1 | 4.7 | 6326 | 5.1 | 2.8 | < 0.001 |
| 9 | Basophile count | 10 e−3/ul | 4718 | 0.1 | 0 | 1719 | 0.1 | 0 |
| 84 | 0.1 | 0 | 6353 | 0.1 | 0 | 0.732 |
| 10 | Haemoglobin level | g/dl | 4701 | 14.4 | 2.1 | 1715 | 14.3 | 2.1 |
| 83 | 12.8 | 3.1 | 6333 | 14.4 | 2.1 | < 0.001 |
| 11 | RDW | % | 4634 | 12.2 | 1.3 | 1692 | 12.4 | 1.3 | < | 84 | 12.5 | 1.9 | 6242 | 12.2 | 1.4 | 0.004 |
| 12 | Haematocrit | % | 4707 | 42.4 | 5.9 | 1715 | 42.4 | 5.9 |
| 84 | 38 | 8.1 | 6338 | 42.5 | 5.9 | < 0.001 |
| 13 | MCV | fL | 4718 | 90.9 | 6.3 | 1719 | 91.5 | 6.2 | < | 84 | 92.5 | 8.6 | 6353 | 91 | 6.2 | 0.129 |
| 14 | Platelet count | 10 e−3/ul | 4718 | 223 | 79 | 1719 | 214 | 77 | < | 84 | 236 | 112 | 6353 | 221 | 78 | 0.392 |
| 15 | MPV | fL | 4682 | 8.5 | 2.1 | 1688 | 8.6 | 2.3 | < | 84 | 8.4 | 2.3 | 6286 | 8.5 | 2.2 | 0.238 |
| 16 | NLR | Ratio | 4703 | 2.7 | 2 | 1707 | 2.6 | 1.8 |
| 84 | 5.1 | 5 | 6326 | 2.6 | 1.9 | < 0.001 |
| 17 | PLR | Ratio | 4718 | 115.3 | 65.6 | 1719 | 115.7 | 63.3 |
| 84 | 131.1 | 125.1 | 6353 | 115.2 | 64.4 | 0.097 |
| 18 | Fibrinogen | mg/dl | 4612 | 403 | 128 | 1715 | 398 | 124 |
| 83 | 439 | 173.5 | 6244 | 401 | 126 | 0.027 |
| 19 | LDL | mg/dl | 3561 | 109 | 66 | 1465 | 95 | 57 | < | 59 | 99 | 49.5 | 4967 | 104 | 64 | 0.236 |
| 20 | HDL | mg/dl | 3588 | 48 | 18 | 1480 | 51 | 21 | < | 59 | 45 | 16 | 5009 | 49 | 20 | 0.001 |
| 21 | Total Cholesterol | mg/dl | 3580 | 177 | 71 | 1474 | 164 | 65 | < | 59 | 163 | 61 | 4995 | 173 | 71 | 0.026 |
| 22 | Triglycerides | mg/dl | 3560 | 122 | 89 | 1469 | 117 | 78 |
| 59 | 109 | 78.5 | 4970 | 121 | 85 | 0.352 |
| 23 | CRP | mg/dl | 1067 | 0.7 | 3 | 443 | 0.4 | 1.4 | < | 36 | 5.5 | 9.4 | 1474 | 0.6 | 2.3 | < 0.001 |
| 24 | TSH | μU/ml | 3975 | 1.3 | 1.2 | 1588 | 1.4 | 1.3 | < | 69 | 1.5 | 1.8 | 5494 | 1.3 | 1.3 | 0.127 |
| 25 | Urea | mg/dl | 3227 | 37 | 17 | 843 | 38 | 18 |
| 49 | 50 | 33 | 4021 | 37 | 17 | < 0.001 |
| 26 | Creatinine | mg/dl | 4712 | 1 | 0.4 | 1727 | 1 | 0.4 |
| 84 | 1.3 | 0.6 | 6355 | 1 | 0.4 | <0.001 |
| 27 | GFR | ml/min | 4468 | 76 | 29 | 1684 | 73 | 29 | < | 79 | 52 | 30.5 | 6073 | 75 | 29 | < 0.001 |
| 28 | Glycated Haemoglobin | % | 349 | 6.4 | 1.9 | 69 | 6.4 | 1.6 |
| 7 | 7.9 | 1.8 | 411 | 6.3 | 1.8 | 0.009 |
| 29 | Sodium | mmol/l | 4766 | 141 | 4 | 1745 | 141 | 4 | < | 85 | 138 | 6 | 6426 | 141 | 4 | < 0.001 |
| 30 | Potassium | mmol/l | 4765 | 4.4 | 0.6 | 1745 | 4.4 | 0.5 | < | 84 | 4.3 | 0.8 | 6426 | 4.4 | 0.6 | 0.8 |
| 31 | Prothrombin time | s | 4721 | 13.3 | 1.3 | 1748 | 13.4 | 1.5 |
| 87 | 15.2 | 3.2 | 6382 | 13.3 | 1.4 | < 0.001 |
| 32 | Thrombin time | s | 4322 | 16 | 1.5 | 1680 | 16.1 | 1.4 | < | 73 | 16.3 | 2.5 | 5929 | 16 | 1.4 | 0.166 |
| 33 | Heart rate at admission | 1/min | 4943 | 72 | 18 | 1826 | 72 | 18 |
| 97 | 79 | 30 | 6672 | 72 | 18 | 0.012 |
| 34 | Systolic blood pressure | mmHg | 4943 | 120 | 20 | 1826 | 120 | 20 |
| 97 | 100 | 40 | 6672 | 120 | 20 | < 0.001 |
| 35 | Diastolic blood pressure | mmHg | 4943 | 80 | 17 | 1826 | 80 | 12 |
| 97 | 70 | 18 | 6672 | 80 | 12 | < 0.001 |
| 36 | Troponin I level | ng/l | 2446 | 0 | 0.8 | 551 | 0 | 0.1 | < 0.001 | 40 | 6.3 | 35.4 | 2957 | 0 | 0.6 | < 0.001 |
| 37 | Troponin T level | ng/l | 2344 | 0 | 0.1 | 1210 | 0 | 0 | < 0.001 | 42 | 0.5 | 1.7 | 3512 | 0 | 0.1 | < 0.001 |
| 38 | Alanine transaminase | U/l | 2289 | 24 | 18 | 1077 | 23 | 16 | 0.012 | 47 | 29 | 47 | 3319 | 24 | 17 | 0.006 |
| 39 | Aspartate transaminase | U/l | 2321 | 24 | 16 | 1100 | 24 | 11 | 0.003 | 48 | 50 | 101.2 | 3373 | 24 | 14 | < 0.001 |
The p-values apply to the univariate Mann–Whitney-U test
IQR inter-quartile range, BMI body mass index, RDW red cell distribution width, MCV mean cell volume, MPV mean platelet volume, NLR neutrophil to lymphocyte ratio, PLR platelet to lymphocyte ratio, LDL low density lipoprotein, HDL high-density lipoprotein, CRP C-reactive protein, TSH thyroid stimulating hormone, GFR glomerular filtration rate
Basic characteristic of nominal features divided by target groups
| Feature | Values | Count where significant lesion; n = 4943 (100%) | Count where no significant lesion; n = 1826 (100%) | Count, where patient died in hospital; n = 97 (100%) | Count where no in-hospital death; n = 6672 (100%) | |
|---|---|---|---|---|---|---|
| 1 | CABG during hospitalisation or planned after discharge | Not qualified | 4174 (84%) | 1826 (100%) | 89 (91.8%) | 5991 (90%) |
| Qualified | 769 (16%) | 0 | 8 (8.2%) | 761 (11%) | ||
| 2 | Dysglycemia | No | 3489 (71%) | 1319 (72%) | 63 (64.9%) | 4745 (71%) |
| Yes | 1454 (29%) | 507 (28%) | 34 (35.1%) | 1927 (29%) | ||
| 3 | Cardiac arrest | False | 4879 (99%) | 1818 (100%) | 77 (79.4%) | 6620 (99%) |
| True | 64 (1%) | 8 (%) | 20 (20.6%) | 52 (1%) | ||
| 4 | Hypertension | True | 4584 (93%) | 1701 (93%) | 82 (84.5%) | 6203 (93%) |
| False | 359 (7%) | 125 (7%) | 15 (15.5%) | 469 (7%) | ||
| 5 | PCI during hospitalization | True | 4247 (86%) | 0 | 76 (78.4%) | 4171 (63%) |
| False | 696 (14%) | 1826 (100%) | 21 (21.6%) | 2501 (37%) | ||
| 6 | Smoking | Former Smoker | 2565 (52%) | 1020 (56%) | 44 (45.4%) | 3541 (53%) |
| Non-Smoker | 1272 (26%) | 528 (29%) | 33 (34. %) | 1767 (26%) | ||
| Active Smoker | 1106 (22%) | 278 (15%) | 20 (20.6%) | 1364 (20%) | ||
| 7 | History of CABG | False | 4546 (92%) | 1542 (84%) | 89 (91.8%) | 5999 (90%) |
| True | 397 (8%) | 284 (16%) | 8 (8.2%) | 673 (10%) | ||
| 8 | History of PCI | False | 3366 (68%) | 1065 (58%) | 80 (82.5%) | 4351 (65%) |
| True | 1577 (32%) | 761 (42%) | 17 (17.5%) | 2321 (35%) | ||
| 9 | History of myocardial infarction | False | 3886 (79%) | 1358 (74%) | 79 (81.4%) | 5165 (77%) |
| True | 1057 (21%) | 468 (26%) | 18 (18.6%) | 1507 (23%) | ||
| 10 | Sex | Male | 3342 (68%) | 1138 (62%) | 56 (57.7%) | 4424 (66%) |
| Female | 1488 (30%) | 641 (35%) | 38 (39.2%) | 2091 (31%) | ||
| 11 | Affected artery | Not specified | 619 (13%) | 1793 (98%) | 18 (18.6%) | 2394 (36%) |
| RCA | 1525 (31%) | 8 (< 1%) | 23 (23.7%) | 1510 (23%) | ||
| LAD | 1531 (31%) | 2 (< 1%) | 37 (38.1%) | 1496 (22%) | ||
| Cx | 770 (16%) | 0 (< 1%) | 11 (11.3%) | 759 (11%) | ||
| OM | 191 (4%) | 1 (< 1%) | 1 (1%) | 191 (3%) | ||
| D | 109 (2%) | 1 (< 1%) | 0 | 110 (2%) | ||
| LM | 108 (2%) | 2 (< 1%) | 5 (5.2%) | 105 (2%) | ||
| Graft | 90 (2%) | 3 (< 1%) | 0 | 93 (1%) | ||
| 12 | History of heart failure | False | 4197 (85%) | 1511 (83%) | 66 (68%) | 5642 (85%) |
| True | 746 (15%) | 315 (17%) | 31 (32%) | 1030 (15%) | ||
| 13 | History of renal failure | False | 4633 (94%) | 1679 (92%) | 84 (86.6%) | 6228 (93%) |
| True | 310 (6%) | 147 (8%) | 13 (13.4%) | 844 (13%) | ||
| 14 | History of peripheral atherosclerosis | False | 4604 (93%) | 1674 (92%) | 89 (91.8%) | 6189 (93%) |
| True | 339 (7%) | 152 (8%) | 8 (8.2%) | 483 (7%) | ||
| 15 | History of stroke | False | 4734 (96%) | 1727 (95%) | 92 (94.8%) | 6369 (95%) |
| True | 209 (4%) | 99 (5%) | 5 (5.2%) | 303 (5%) | ||
| 16 | Death during hospitalisation | False | 4860 (98%) | 1812 (99%) | 0 | 6672 (100%) |
| True | 83 (2%) | 14 (1%) | 97 (100%) | 0 |
Best predictive performance results in fivefold cross-validation of classifiers trained on the simplified set and the full set of features
| Sensitivity [%] (recall) | Specificity [%] | Accuracy [%] | G-mean [%] | AUC | ||
|---|---|---|---|---|---|---|
| Logistic regression | 78 ± 25 | 30 ± 31 | 65 ± 10 | 48.4a | 54 ± 3 | Significant lesion |
| Xgboost | 58 ± 20 | 57 ± 8 | 57.0a | 57 ± 2 | ||
| DRSA-BRE (full set of features) | 56.9 ± 0.2 | 66.9 ± 0.2 | 59.6 ± 0.2 | 61.7 ± 0.02 | 61.9a | |
| Logistic regression | 47 ± 34 | 90 ± 11 | 89 ± 10 | 65.0a | 68 ± 11 | In-hospital death |
| Xgboost | 80 ± 9 | 79 ± 4 | 80 ± 4 | 79.5a | 78 ± 3 | |
| DRSA-BRE | 79.3 ± 1.7 | 80.6 ± 0.5 | 81.0 ± 0.5 | 79.9 ± 1 | 80.8a | |
| DRSA-BRE (full set of features) | 81.0 ± 2.4 | 81.1 ± 0.5 | 81.0 ± 0.5 | 81.0 ± 1 | 81.0a |
aIndicates that value was not directly estimated during experiments
Fig. 1Receiver operating curves presenting the performance of XGboost and logistic regression in the detection of significant coronary lesion and in-hospital death. The green dot represents the approximate performance of the DRSA-BRE classifier
Fig. 2Averaged feature importance scores for the prediction of in-hospital death provided by the XGboost classifier
Fig. 3Confirmation measures for the detection of in-hospital death provided by the DRSA-BRE classifier (full set of features)
Fig. 4Confirmation measures for the detection of in-hospital death provided by the DRSA-BRE classifier (simplified set of features)