| Literature DB >> 36028534 |
Masahiko Takeda1, Takehiko Oami1, Yosuke Hayashi1, Tadanaga Shimada1, Noriyuki Hattori1, Kazuya Tateishi2, Rie E Miura1,3, Yasuo Yamao1,3, Ryuzo Abe1, Yoshio Kobayashi2, Taka-Aki Nakada4,5.
Abstract
Rapid and precise prehospital recognition of acute coronary syndrome (ACS) is key to improving clinical outcomes. The aim of this study was to investigate a predictive power for predicting ACS using the machine learning-based prehospital algorithm. We conducted a multicenter observational prospective study that included 10 participating facilities in an urban area of Japan. The data from consecutive adult patients, identified by emergency medical service personnel with suspected ACS, were analyzed. In this study, we used nested cross-validation to evaluate the predictive performance of the model. The primary outcomes were binary classification models for ACS prediction based on the nine machine learning algorithms. The voting classifier model for ACS using 43 features had the highest area under the receiver operating curve (AUC) (0.861 [95% CI 0.775-0.832]) in the test score. After validating the accuracy of the model using the external cohort, we repeated the analysis with a limited number of selected features. The performance of the algorithms using 17 features remained high AUC (voting classifier, 0.864 [95% CI 0.830-0.898], support vector machine (radial basis function), 0.864 [95% CI 0.829-0.887]) in the test score. We found that the machine learning-based prehospital algorithms showed a high predictive power for predicting ACS.Entities:
Mesh:
Year: 2022 PMID: 36028534 PMCID: PMC9418242 DOI: 10.1038/s41598-022-18650-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Baseline characteristics and clinical outcomes in the internal cohort.
| ACS (n = 192) | Non-ACS (n = 363) | P value | |
|---|---|---|---|
| Age, years | 68 (58.5–77) | 73 (60–82) | 0.005 |
| Male sex, n (%) | 152 (79.2) | 214 (59.0) | < 0.001 |
| Diabetes mellitus, n (%) | 37 (19.3) | 63 (17.4) | 0.577 |
| Hypertension, n (%) | 72 (37.5) | 146 (40.2) | 0.532 |
| Dyslipidemia, n (%) | 14 (7.3) | 17 (4.7) | 0.203 |
| Stable angina, n (%) | 15 (7.8) | 60 (16.5) | 0.004 |
| Old myocardial infarction, n (%) | 24 (12.5) | 66 (18.2) | 0.084 |
| Prior PCI, n (%) | 19 (9.9) | 45 (12.4) | 0.380 |
| Prior CABG, n (%) | 2 (1.0) | 6 (1.7) | 0.566 |
| Intracranial hemorrhage, n (%) | 2 (1.0) | 3 (0.8) | 0.799 |
| Cerebral infarction, n (%) | 11 (5.7) | 20 (5.5) | 0.915 |
| Prior antiplatelet or anticoagulant therapy, n (%) | 12 (6.3) | 40 (11.0) | 0.067 |
| Heart rate (beats/min) | 74 (60–90) | 88 (72–110) | < 0.001 |
| Systolic blood pressure (mmHg) | 143 (120–169) | 147.5 (122–176) | 0.237 |
| Diastolic blood pressure (mmHg) | 87 (70–102) | 87.5 (72–102) | 0.926 |
| Body temperature (°C) | 36.0 (35.8–36.2) | 36.2 (36.0–36.8) | < 0.001 |
| Blood oxygen saturation (%) | 98 (96–99) | 97 (93–99) | < 0.001 |
| Respiratory rate (times/min) | 20 (18–24) | 20 (18–24) | 0.007 |
| Japan Coma Scale = 0, n (%) | 167 (87.0) | 293 (80.7) | 0.062 |
| Oxygen therapy, n (%) | 69 (35.9) | 143 (39.4) | 0.425 |
| ST elevation, n (%) | 94 (49.0) | 28 (7.7) | < 0.001 |
| ST depression, n (%) | 62 (32.3) | 115 (31.7) | 0.883 |
| ST change, n (%) | 156 (81.3) | 143 (39.4) | < 0.001 |
| Arrhythmia, n (%) | 44 (22.9) | 84 (23.1) | 0.953 |
| 1. Cold hands, n (%) | 76 (39.6) | 89 (24.5) | < 0.001 |
| 2. Hand moistening, n (%) | 66 (34.4) | 79 (21.7) | 0.001 |
| 3. Dyspnea, n (%) | 47 (24.5) | 121 (33.3) | 0.031 |
| 4. Palpitations, n (%) | 30 (15.6) | 97 (26.7) | 0.003 |
| 5. Throbbing pain, n (%) | 40 (20.8) | 72 (19.8) | 0.780 |
| 6. Sharp/stabbing pain, n (%) | 17 (8.9) | 25 (6.9) | 0.405 |
| 7. Positional chest pain, n (%) | 25 (13.0) | 33 (9.1) | 0.150 |
| 8. Reproduction of chest pain by palpation, n (%) | 6 (3.1) | 11 (3.0) | 0.951 |
| 9. Chest pain with breathing or cough, n (%) | 7 (3.7) | 20 (5.5) | 0.332 |
| 10. Pressing pain, n (%) | 149 (77.6) | 226 (62.3) | < 0.001 |
| 11. Nausea or vomiting, n (%) | 54 (28.1) | 55 (15.2) | < 0.001 |
| 12. Cold sweat, n (%) | 111 (57.8) | 117 (32.2) | < 0.001 |
| 13. Pain radiating to jaw or shoulder, n (%) | 31 (16.2) | 21 (5.8) | < 0.001 |
| 14. Similarity to previous ischemic episode, n (%) | 29 (15.1) | 68 (18.7) | 0.284 |
| 15. Chest pain aggravated by walk, n (%) | 22 (11.5) | 53 (14.6) | 0.303 |
| 16. Worsening pain, n (%) | 52 (27.1) | 99 (27.3) | 0.962 |
| 17. Pain at rest, n (%) | 151 (78.7) | 262 (72.2) | 0.097 |
| 18. Persistent pain, n (%) | 179 (93.2) | 291 (80.2) | < 0.001 |
| 19. Recurrent pain within 24 h, n (%) | 41 (21.4) | 64 (17.6) | 0.287 |
| 20. Chronic pain, n (%) | 16 (8.3) | 58 (16.0) | 0.012 |
| 21. Pain severity (10-point scale) | 6 (0–8) | 4 (0–7) | 0.005 |
Data are presented as median and interquartile range for continuous features.
P-values were calculated using Pearson’s chi-square test or Mann–Whitney U test.
CABG (coronary artery bypass grafting), ECG (electrocardiogram), PCI (percutaneous coronary intervention).
Prehospital diagnostic algorithms for acute coronary syndrome using 43 features.
| Models | AUC | Sensitivity | Accuracy | Specificity | F1-score | PPV | NPV |
|---|---|---|---|---|---|---|---|
| XGBoost | 0.887 | 0.819 | 0.887 | 0.826 | 0.755 | 0.712 | 0.890 |
| Logistic regression | 0.893 | 0.818 | 0.893 | 0.807 | 0.762 | 0.698 | 0.906 |
| Random forest | 0.922 | 0.860 | 0.922 | 0.874 | 0.805 | 0.781 | 0.909 |
| SVM (Linear) | 0.894 | 0.822 | 0.894 | 0.823 | 0.762 | 0.713 | 0.898 |
| SVM (radial basis function) | 0.902 | 0.842 | 0.902 | 0.836 | 0.790 | 0.737 | 0.916 |
| MLP | 0.893 | 0.829 | 0.893 | 0.826 | 0.772 | 0.722 | 0.905 |
| LDA | 0.890 | 0.826 | 0.890 | 0.834 | 0.763 | 0.723 | 0.894 |
| LGBM | 0.894 | 0.823 | 0.894 | 0.819 | 0.764 | 0.711 | 0.902 |
| Voting | 0.927 | 0.852 | 0.927 | 0.839 | 0.804 | 0.744 | 0.928 |
| XGBoost | 0.849 | 0.756 | 0.792 | 0.811 | 0.715 | 0.684 | 0.864 |
| Random forest | 0.850 | 0.755 | 0.798 | 0.821 | 0.725 | 0.711 | 0.865 |
| Logistic regression | 0.843 | 0.740 | 0.780 | 0.801 | 0.703 | 0.693 | 0.857 |
| SVM (Linear) | 0.847 | 0.745 | 0.789 | 0.813 | 0.709 | 0.690 | 0.861 |
| SVM (radial basis function) | 0.834 | 0.735 | 0.791 | 0.821 | 0.708 | 0.687 | 0.855 |
| MLP | 0.834 | 0.709 | 0.786 | 0.826 | 0.695 | 0.695 | 0.846 |
| LDA | 0.860 | 0.761 | 0.802 | 0.823 | 0.727 | 0.706 | 0.870 |
| LGBM | 0.841 | 0.756 | 0.778 | 0.791 | 0.705 | 0.671 | 0.860 |
| Voting | 0.861 | 0.772 | 0.803 | 0.821 | 0.733 | 0.711 | 0.873 |
| XGBoost | 0.840 | 0.897 | 0.790 | 0.697 | 0.800 | 0.722 | 0.885 |
| Random forest | 0.803 | 0.690 | 0.726 | 0.758 | 0.702 | 0.714 | 0.735 |
| Logistic regression | 0.831 | 0.793 | 0.758 | 0.727 | 0.754 | 0.719 | 0.800 |
| SVM (Linear) | 0.838 | 0.793 | 0.758 | 0.727 | 0.754 | 0.719 | 0.800 |
| SVM (radial basis function) | 0.808 | 0.828 | 0.742 | 0.667 | 0.750 | 0.686 | 0.815 |
| MLP | 0.818 | 0.793 | 0.758 | 0.727 | 0.754 | 0.719 | 0.800 |
| LDA | 0.832 | 0.862 | 0.774 | 0.697 | 0.781 | 0.714 | 0.852 |
| LGBM | 0.789 | 0.552 | 0.742 | 0.909 | 0.667 | 0.842 | 0.698 |
| Voting | 0.828 | 0.862 | 0.790 | 0.727 | 0.794 | 0.735 | 0.857 |
AUC (area under the receiver operating characteristic curve), LDA (linear discriminant analysis), LGBM (light gradient boosting machine), MLP (multilayer perceptron), NPV (negative predictive values), PPV (posititive predictive values), SVM (support vector machine), XGBoost (eXtreme Gradient Boosting).
Figure 1Relationship between the number of features and the area under the receiver operating characteristic curve for the prediction algorithm. The line plot depicts sequential changes in the AUC with the number of features for the prediction algorithm in (a) the training score (blue) and (b) the test score (yellow). The dotted vertical line indicates the highest predictive value in the test score. (n = 17, AUC of the training score = 0.881, AUC of the test score = 0.859). The error bars indicate 95% confidence intervals. AUC (area under the receiver operating characteristic curve).
Figure 2Receiver operating characteristic curve of prehospital diagnostic algorithms for acute coronary syndrome with 17 features. ROC curves of the top six machine learning algorithms for the prehospital prediction of ACS using 17 features are shown. The ROC curves are depicted at 1-specificity on the x-axis and sensitivity on the y-axis using (a) the training score, (b) the test score, and (c) external cohort score. AUC is presented with 95% confidence interval. ACS (acute coronary syndrome), AUC (area under the receiver operating characteristic curve), CI (confidence interval), LDA (linear discriminant analysis), LR (logistic regression), MLPC (multilayer perceptron classifier), ROC (receiver operating characteristic), SVM (R) (support vector machine radial basis function), VC (voting classifier), XGB (eXtreme Gradient Boosting).
Figure 3SHAP values of the prehospital diagnostic algorithm for acute coronary syndrome using 17 features. The impact of the features on the model output was expressed as the SHAP value calculated with the support vector machine (radial basis function). The features are placed in descending order according to their importance. The association between the feature value and SHAP value indicates a positive or negative impact of the predictors. The extent of the value is depicted as red (high) or blue (low) plots. SHAP (SHapley Additive exPlanation).