| Literature DB >> 34938570 |
Saarang Panchavati1, Carson Lam1, Nicole S Zelin1, Emily Pellegrini1, Gina Barnes1, Jana Hoffman1, Anurag Garikipati1, Jacob Calvert1, Qingqing Mao1, Ritankar Das1.
Abstract
Diagnosis and appropriate intervention for myocardial infarction (MI) are time-sensitive but rely on clinical measures that can be progressive and initially inconclusive, underscoring the need for an accurate and early predictor of MI to support diagnostic and clinical management decisions. The objective of this study was to develop a machine learning algorithm (MLA) to predict MI diagnosis based on electronic health record data (EHR) readily available during Emergency Department assessment. An MLA was developed using retrospective patient data. The MLA used patient data as they became available in the first 3 h of care to predict MI diagnosis (defined by International Classification of Diseases, 10th revision code) at any time during the encounter. The MLA obtained an area under the receiver operating characteristic curve of 0.87, sensitivity of 87% and specificity of 70%, outperforming the comparator scoring systems TIMI and GRACE on all metrics. An MLA can synthesize complex EHR data to serve as a clinically relevant risk stratification tool for MI.Entities:
Year: 2021 PMID: 34938570 PMCID: PMC8667565 DOI: 10.1049/htl2.12017
Source DB: PubMed Journal: Healthc Technol Lett ISSN: 2053-3713
Structured data extracted from the electronic health record if available in the patent record, used by the machine learning algorithm to predict myocardial infarction diagnosis
| Demographics | |
|---|---|
| Age | Sex |
|
| |
| Chest pain | |
|
| |
| Prior myocardial infarction | Diabetes mellitus |
| Hypertension | Hyperlipidemia |
| Tobacco use | |
|
| |
| Systolic blood pressure | Diastolic blood pressure |
| Heart rate | Respiratory rate |
| Peripheral oxygen saturation (SpO2) | Temperature |
|
| |
| Sodium | Troponin I |
| Potassium | Lactate |
| Blood urea nitrogen | Hematocrit |
| Creatinine | Platelet count |
| Bicarbonate | White blood cell count |
| Glucose | International normalised ratio (INR) |
| Aspartate transaminase | Blood pH |
| Alanine transaminase | Urine output |
| Total bilirubin | |
FIGURE 1Patient encounters used to train and test a machine learning algorithm to predict myocardial infarction based on electronic health data available within the first 3 h.
Demographic information for the hold out test dataset used to test the machine learning algorithm
| Patients with MI ( | Patients without MI ( |
| |
|---|---|---|---|
|
| |||
| <30 | 1 (0.4%) | 41 (2.6%) | 0.04 |
| 30–49 | 25 (9.9%) | 160 (10.0%) | 1.00 |
| 50–59 | 34 (13.4%) | 223 (13.9%) | 0.92 |
| 60–69 | 53 (20.9%) | 362 (22.6%) | 0.63 |
| 70–79 | 55 (21.7%) | 341 (21.3%) | 0.87 |
| <80 | 85 (33.6%) | 473 (29.6%) | 0.21 |
|
| |||
| Male | 169 (66.8%) | 831 (51.9%) | 0.01 |
| Female | 84 (33.2%) | 769 (48.1%) | 0.01 |
| Unknown | 0 (0.0%) | 0 (0.0%) | 1.0 |
|
| |||
| American Indian or Alaska Native | 0 (0.0%) | 0 (0.0%) | 1.0 |
| Asian | 58 (22.9%) | 379 (23.7%) | 0.87 |
| Black or African American | 23 (9.1%) | 254 (15.9%) | 0.004 |
| Native Hawaiian or Other Pacific Islander | 9 (3.6%) | 33 (2.1%) | 0.17 |
| White or Caucasian | 116 (45.8%) | 703 (43.9%) | 0.59 |
| Other | 43 (17.0%) | 211 (13.2%) | 0.11 |
| Unknown/declined | 4 (1.6%) | 20 (1.2%) | 0.56 |
|
| |||
| Hispanic or Latino | 16 (6.3%) | 140 (8.8%) | 0.61 |
|
| |||
| Obesity | 13 (5%) | 143 (9%) | 0.05 |
| Diabetes mellitus | 108 (43%) | 516 (32%) | 0.001 |
| Dyslipidemia | 130 (51%) | 560 (35%) | < 0.001 |
| Hypertension | 201 (79%) | 1126 (70%) | 0.003 |
| Peripheral vascular disease | 28 (11%) | 69 (4%) | < 0.001 |
| Angina | 52 (21%) | 96 (6%) | < 0.001 |
| Heart failure | 116 (46%) | 473 (30%) | < 0.001 |
| CKD | 100 (40%) | 422 (26%) | < 0.001 |
| HIV infection and AIDS | 7 (3%) | 51 (3%) | 0.85 |
| Dementia | 25 (10%) | 173 (11%) | 0.74 |
| COPD | 35 (14%) | 314 (20%) | 0.03 |
| Depression | 24 (9%) | 214 (13%) | 0.10 |
| Current tobacco use | 25 (10%) | 174 (11%) | 0.74 |
| Prior MI | 61 (24%) | 148 (9%) | < 0.001 |
| Prior ischemic stroke or TIA | 2 (1%) | 20 (1%) | 0.76 |
Abbreviations: Acquired immunodeficiency syndrome (AIDS); chronic kidney disease (CKD); chronic obstructive pulmonary disease (COPD); human immunodeficiency virus (HIV); myocardial infarction (MI); transient ischemic attack (TIA).
FIGURE 2Area under receiving operating characteristic curves and clinical operating points for (A) machine learning and GRACE clinical prediction model of myocardial infarction diagnosis and (B) machine learning and TIMI clinical prediction model of myocardial infarction diagnosis.
Performance metrics of machine learning algorithm and comparator models for myocardial infarction prediction
| MLA | GRACE | TIMI | |
|---|---|---|---|
|
| 0.87 | 0.61 | 0.78 |
|
| 0.87 | 0.78 | 0.84 |
|
| 0.70 | 0.33 | 0.57 |
|
| 3.0 | 1.2 | 1.9 |
|
| 0.18 | 0.67 | 0.28 |
|
| 16.5 | 1.8 | 7.0 |
|
| 0.32 | 0.16 | 0.24 |
|
| 0.97 | 0.91 | 0.96 |
Abbreviations: Area under the receiver operating characteristic (AUROC); likelihood ratio (LR); machine learning algorithm (MLA).; diagnostic odds ratio (DOR); positive predictive value (PPV); negative predictive value (NPV).
FIGURE 3Top unique feature correlations and distribution of feature importance for each patient encounter for machine learning models. Input variables are ranked in descending order of feature importance. Red indicates a high feature value and blue indicates a low feature value. Points to the right and left sides of the line of neutral contribution resulted in higher and lower prediction scores, respectively. Abbreviations: alanine aminotransferase (ALT); blood pressure (BP); heart rate (HR); international normalised ratio (INR); myocardial infarction (MI); peripheral oxygen saturation (SpO2).