| Literature DB >> 35005676 |
Arun R Sridhar1, Zih-Hua Chen Amber2, Jacob J Mayfield1, Alison E Fohner3, Panagiotis Arvanitis4, Sarah Atkinson1, Frieder Braunschweig5,6, Neal A Chatterjee1, Alessio Falasca Zamponi4, Gregory Johnson7, Sanika A Joshi2, Mats C H Lassen8, Jeanne E Poole1, Christopher Rumer1, Kristoffer G Skaarup8, Tor Biering-Sørensen8, Carina Blomstrom-Lundqvist4, Cecilia M Linde5,6, Mary M Maleckar9, Patrick M Boyle2,10,11.
Abstract
BACKGROUND: Adverse events in COVID-19 are difficult to predict. Risk stratification is encumbered by the need to protect healthcare workers. We hypothesize that artificial intelligence (AI) can help identify subtle signs of myocardial involvement in the 12-lead electrocardiogram (ECG), which could help predict complications.Entities:
Keywords: 12-lead ECG; Arrhythmia; Artificial intelligence; COVID-19; Deep learning; Heart failure prognosis; Mortality; Risk factors
Year: 2021 PMID: 35005676 PMCID: PMC8719367 DOI: 10.1016/j.cvdhj.2021.12.003
Source DB: PubMed Journal: Cardiovasc Digit Health J ISSN: 2666-6936
Demographics, comorbidities, and outcomes, overall and by enrolling hospital system
| N | Overall | UW | KI | UU | UC | Missing (%) | |
|---|---|---|---|---|---|---|---|
| 1386 | 420 | 481 | 308 | 177 | |||
| Age (years), mean (SD) | 63.43 (16.90) | 63.26 (16.41) | 59.91 (17.04) | 65.88 (17.96) | 69.15 (13.24) | <.001 | 0.0 |
| Sex at birth = female, n (%) | 547 (39.5) | 169 (40.2) | 161 (33.5) | 134 (43.5) | 83 (46.9) | .004 | 0.0 |
| BMI (kg/m2), mean (SD) | 28.38 (6.58) | 29.70 (7.81) | 28.63 (5.96) | 27.85 (6.90) | 27.06 (5.79) | .001 | 20.8 |
| Ethnicity, n (%) | - | 13.1 | |||||
| Hispanic/Latinx | 85 (7.1) | 84 (20.2) | 0 (0.0) | 1 (0.3) | - | ||
| Non-Hispanic/Latinx | 609 (50.5) | 327 (78.6) | 13 (2.7) | 269 (87.3) | - | ||
| Unknown / unavailable | 511 (42.4) | 5 (1.2) | 468 (97.3) | 38 (12.3) | - | ||
| Race, n (%) | |||||||
| FN/AK Native | 9 (0.6) | 9 (2.1) | 0 (0.0) | 0 (0.0) | - | <.001 | 0.0 |
| Asian | 118 (8.5) | 71 (16.9) | 7 (1.5) | 40 (13.0) | - | <.001 | 0.0 |
| Black/AA | 74 (5.3) | 57 (13.6) | 11 (2.3) | 6 (1.9) | - | <.001 | 0.0 |
| HI FN/Pac Isl | 7 (0.5) | 7 (1.7) | 0 (0.0) | 0 (0.0) | - | .001 | 0.0 |
| White | 694 (50.1) | 265 (63.1) | 228 (47.4) | 201 (65.3) | - | <.001 | 0.0 |
| Other | 57 (4.1) | 3 (0.7) | 50 (10.4) | 4 (1.3) | - | <.001 | 0.0 |
| Unknown / unavailable | 252 (18.2) | 8 (1.9) | 187 (38.9) | 57 (18.5) | - | <.001 | 0.0 |
| Hypertension (%) | 737 (54.0) | 214 (53.2) | 236 (49.3) | 184 (59.7) | 103 (58.2) | .021 | 1.4 |
| CAD (%) | 187 (13.8) | 59 (14.8) | 65 (13.7) | 48 (15.6) | 15 ( 8.5) | .144 | 2.1 |
| CIED (%) | - | 13.3 | |||||
| Pacemaker | 22 (1.8) | 9 (2.2) | 8 (1.7) | 5 (1.6) | - | ||
| ICD | 5 (0.4) | 4 (1.0) | 1 (0.2) | 0 (0.0) | - | ||
| Arrhythmic event (%) | 125 (9.0) | 28 (6.7) | 48 (10.0) | 36 (11.7) | 13 (7.3) | .084 | 0.0 |
| TE (%) | 132 (9.5) | 36 (8.6) | 52 (10.8) | 29 (9.4) | 15 (8.5) | .660 | 0.0 |
| HF (%) | 109 (7.9) | 19 (4.5) | 58 (12.1) | 23 (7.5) | 9 (5.1) | <.001 | 0.0 |
| Mortality (%) | 245 (17.7) | 88 (21.0) | 73 (15.2) | 69 (22.4) | 15 (8.5) | <.001 | 0.0 |
P values are for tests of differences between centers (continuous variables: ANOVA; categorical variables: χ2).
AA = African American; BMI = body mass index; CAD = coronary artery disease; CIED = cardiac implanted electronic device; FN/AK Native = First Nations or Alaskan Native; HF = heart failure; HI FN/Pac Isl = Hawaiian First Nations / Pacific Islander; ICD = implanted cardioverter-defibrillator; KI = Karolinska Institutet; TE = thromboembolic event; UC = University of Copenhagen; UU = Uppsala University; UW = University of Washington.
Figure 1Flowchart showing inclusion of patients into databases for both classification problems to be addressed via artificial intelligence–based predictive modeling. The left branch of the tree concerns the first convolutional neural network with long short-term memory (CNN-LSTM1), concerned with differentiating between electrocardiograms (ECGs) from patients who survived vs died. The right branch describes the database used for CNN-LSTM2, which independently predicts the likelihood that each ECG belongs to a patient from 4 groups (no event vs major adverse cardiovascular events [MACE], as shown in legend). AE = arrhythmic event; HF = heart failure; TE = thromboembolic event.
Figure 2Schematics illustrating machine learning network architectures for convolutional neural network long short-term memories 1 and 2 (LSTM1/2). Each network consists of 3 sections: (I) convolution layers, shown here as “feature maps” and “pooling” subsections; (II) recurrent neural network layers (labeled “Long-Short Term Memory”); and (III) fully connected layers that produce outputs (ie, predicted probabilities for each class). AE = arrhythmic event; HF = heart failure; ReLu = rectified linear unit; TE = thromboembolic event.
Figure 3Data summarizing predictive power of artificial intelligence–based and conventional models. A: Receiver operator characteristic (ROC) curves for convolutional neural network long short-term memory 1 (CNN-LSTM1) and the corresponding conventional model, which attempted to differentiate between electrocardiograms of patients who survived vs died. B: False-positive and false-negative rates ( and , respectively) for CNN-LSTM1 as a function of model threshold; at a nominal threshold (dashed yellow line) associated with a 20% , the corresponding (∼70%) is shown by a dashed black arrow. C: ROC curves for CNN-LSTM2 (multilabel model independent prediction of different major adverse cardiovascular event types; single curve derived via macro-averaging) and the corresponding conventional model (binary prediction: any event vs no event). See Figure 4 and Supplemental Figure 2 for additional plots. AUC = area under the curve; MACE = major adverse cardiovascular event.
Figure 4Individual receiver operator characteristic (ROC) curves for the 4 independent classification tasks performed by convolutional neural network long short-term memory 2 (CNN-LSTM2). A: Prediction of no event during hospitalization from intake electrocardiogram. B: Prediction of arrhythmic event. C: Prediction of thromboembolic event. D: Prediction of heart failure event. AUC = area under the curve; MACE = major adverse cardiovascular event.
Figure 5Violin plots with box-and-whisker annotations showing raw network outputs. A: Results for classification task (survival vs death) in convolutional neural network long short-term memory 1 (CNN-LSTM1). B, C: Results for 2 classification tasks in CNN-LSTM2 (B: arrhythmic events; C: heart failure events). X-axis labels show ground truth labels; dashed lines in each panel show optimal classification thresholds, as explained in Methods. See Supplemental Figure 3 for additional plots. AE = arrhythmic event; HF = heart failure; TE = thromboembolic event.
Figure 6Summary data for holdout testing of both artificial intelligence–based models trained with electrocardiograms (ECGs) from before a cutoff date (June 9, 2020) and tested with ECGs from after that date. A: Receiver operator characteristic (ROC) curve for convolutional neural network long short-term memory 1 (CNN-LSTM1) trained and tested using the holdout protocol defined above. B: Same as panel A but for CNN-LSTM2. Both macro-averaged and individual event type ROCs are shown superimposed on this plot. AUC = area under the curve; MACE = major adverse cardiovascular event.