| Literature DB >> 35355847 |
Sulaiman S Somani1, Hossein Honarvar1, Sukrit Narula2, Isotta Landi1, Shawn Lee3, Yeraz Khachatoorian4, Arsalan Rehmani3, Andrew Kim4, Jessica K De Freitas1, Shelly Teng1, Suraj Jaladanki1, Arvind Kumar1, Adam Russak1, Shan P Zhao1, Robert Freeman5, Matthew A Levin6, Girish N Nadkarni1, Alexander C Kagen7, Edgar Argulian3, Benjamin S Glicksberg1.
Abstract
Aims: Clinical scoring systems for pulmonary embolism (PE) screening have low specificity and contribute to computed tomography pulmonary angiogram (CTPA) overuse. We assessed whether deep learning models using an existing and routinely collected data modality, electrocardiogram (ECG) waveforms, can increase specificity for PE detection. Methods and results: We create a retrospective cohort of 21 183 patients at moderate- to high suspicion of PE and associate 23 793 CTPAs (10.0% PE-positive) with 320 746 ECGs and encounter-level clinical data (demographics, comorbidities, vital signs, and labs). We develop three machine learning models to predict PE likelihood: an ECG model using only ECG waveform data, an EHR model using tabular clinical data, and a Fusion model integrating clinical data and an embedded representation of the ECG waveform. We find that a Fusion model [area under the receiver-operating characteristic curve (AUROC) 0.81 ± 0.01] outperforms both the ECG model (AUROC 0.59 ± 0.01) and EHR model (AUROC 0.65 ± 0.01). On a sample of 100 patients from the test set, the Fusion model also achieves greater specificity (0.18) and performance (AUROC 0.84 ± 0.01) than four commonly evaluated clinical scores: Wells' Criteria, Revised Geneva Score, Pulmonary Embolism Rule-Out Criteria, and 4-Level Pulmonary Embolism Clinical Probability Score (AUROC 0.50-0.58, specificity 0.00-0.05). The model is superior to these scores on feature sensitivity analyses (AUROC 0.66-0.84) and achieves comparable performance across sex (AUROC 0.81) and racial/ethnic (AUROC 0.77-0.84) subgroups.Entities:
Keywords: Deep learning; Electrocardiogram; Machine learning; Pulmonary embolism
Year: 2021 PMID: 35355847 PMCID: PMC8946569 DOI: 10.1093/ehjdh/ztab101
Source DB: PubMed Journal: Eur Heart J Digit Health ISSN: 2634-3916
Baseline characteristics in the PE-positive and PE-negative cohorts
| CTPA or PE-encounter | |||||
|---|---|---|---|---|---|
| Missing | Overall | Negative | Positive |
| |
|
| 23 793 | 21 358 | 2435 | ||
| Age (years), mean (SD) | 122 | 57.9 (17.7) | 57.6 (17.8) | 60.7 (17.0) | <0.001 |
| Sex, | |||||
| Female (code = 0) | 13 | 14 786 (62.2) | 13 441 (63.0) | 1345 (55.3) | <0.001 |
| Male (code = 1) | 8994 (37.8) | 7907 (37.0) | 1087 (44.7) | ||
| Race, | |||||
| Asian | 189 | 591 (2.5) | 531 (2.5) | 60 (2.5) | <0.001 |
| Black | 6376 (27.0) | 5674 (26.8) | 702 (29.0) | ||
| Hispanic | 3486 (14.8) | 3217 (15.2) | 269 (11.1) | ||
| White | 8083 (34.2) | 7170 (33.8) | 913 (37.8) | ||
| Other | 5068 (21.5) | 4595 (21.7) | 473 (19.6) | ||
| Arrhythmia, | 2642 | 2371 (11.2) | 2112 (11.1) | 259 (11.8) | 1.000 |
| Coronary artery disease, | 2642 | 3272 (15.5) | 2950 (15.6) | 322 (14.7) | 1.000 |
| Cancer, | 2642 | 5030 (23.8) | 4507 (23.8) | 523 (23.8) | 1.000 |
| Chronic kidney disease, | 2642 | 1811 (8.6) | 1612 (8.5) | 199 (9.1) | 1.000 |
| Coagulopathy, | 2642 | 378 (1.8) | 333 (1.8) | 45 (2.1) | 1.000 |
| Chronic obstructive pulmonary disease, | 2642 | 2235 (10.6) | 2062 (10.9) | 173 (7.9) | 0.001 |
| Diabetes mellitus, | 2642 | 3752 (17.7) | 3378 (17.8) | 374 (17.0) | 1.000 |
| History of DVT or PE, | 2642 | 1875 (8.9) | 1548 (8.2) | 327 (14.9) | <0.001 |
| Congestive heart failure, | 2642 | 2704 (12.8) | 2406 (12.7) | 298 (13.6) | 1.000 |
| Hypertension, | 2642 | 7439 (35.2) | 6634 (35.0) | 805 (36.7) | 1.000 |
| Pregnancy, | 2642 | 613 (2.9) | 568 (3.0) | 45 (2.1) | 0.466 |
| Pulmonary hypertension, | 2642 | 1364 (6.4) | 1173 (6.2) | 191 (8.7) | <0.001 |
| Rheumatological disease, | 2642 | 3904 (18.5) | 3500 (18.5) | 404 (18.4) | 1.000 |
| Heart rate (b.p.m.), mean (SD) | 5031 | 90.6 (22.6) | 90.3 (22.9) | 93.1 (20.3) | <0.001 |
| Systolic blood pressure (mmHg), mean (SD) | 5175 | 128.7 (26.8) | 128.8 (27.1) | 127.4 (23.9) | 0.538 |
| Diastolic blood pressure (mmHg), mean (SD) | 5127 | 71.2 (15.1) | 71.1 (15.3) | 72.0 (14.1) | 0.241 |
| Respiration rate (breaths per minute), mean (SD) | 5227 | 19.6 (4.4) | 19.6 (4.3) | 20.3 (5.6) | <0.001 |
| Oxygen saturation (%), mean (SD) | 7592 | 96.9 (14.8) | 96.9 (13.8) | 96.9 (21.4) | 1.000 |
| BNP (pg/mL), mean (SD) | 18 986 | 437.7 (1078.4) | 449.2 (1115.4) | 381.7 (873.6) | 1.000 |
| D-dimer (µg/mL FEU), mean (SD) | 16 248 | 2.7 (7.2) | 2.3 (7.1) | 6.5 (6.5) | <0.001 |
| Troponin (ng/mL), mean (SD) | 10 375 | 0.2 (4.3) | 0.2 (4.5) | 0.3 (1.7) | 1.000 |
| PR interval length (ms), mean (SD) | 7737 | 151.5 (29.5) | 151.5 (29.3) | 151.3 (31.0) | 1.000 |
| QRS duration (ms), mean (SD) | 6402 | 89.3 (19.9) | 89.2 (20.0) | 90.0 (19.7) | 1.000 |
| QTc (ms), mean (SD) | 6402 | 450.0 (37.4) | 449.3 (36.9) | 454.8 (40.3) | <0.001 |
| Cardiac axis (degrees), mean (SD) | 6402 | 26.1 (50.0) | 26.2 (49.4) | 24.8 (54.5) | 1.000 |
| PE location, | |||||
| No PE | 0 | 21 358 (89.8) | 21 358 (100.0) | <0.001 | |
| Truncal | 126 (0.5) | 126 (5.2) | |||
| Main | 515 (2.2) | 515 (21.1) | |||
| Lobar | 692 (2.9) | 692 (28.4) | |||
| Segmental | 1102 (4.6) | 1102 (45.3) | |||
Unit of analysis is unique patient encounter or, by extension, unique CTPAs.
DVT, deep vein thrombosis; FEU, fibrinogen equivalent units.
Fusion model performance benchmarked against clinical scores
| Threshold source | Model | Threshold | TP | TN | FP | FN | Sensitivity | Specificity | PPV | NPV | McNemar test |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Cross-validation fold | Fusion model | 0.037 | 22 | 14 | 62 | 0 | 1.00 | 0.18 | 0.26 | 1.00 | (Base) |
| Fusion model (no D-dimer) | 0.040 | 22 | 14 | 62 | 0 | 1.00 | 0.18 | 0.26 | 1.00 | 1.000 | |
| Wells | Wells’ criteria | 1 | 14 | 24 | 52 | 8 | 0.64 | 0.32 | 0.21 | 0.75 | 0.005 |
| Le Gal | Revised Geneva Score | 4 | 15 | 20 | 56 | 7 | 0.68 | 0.26 | 0.21 | 0.74 | 0.041 |
| Kline | PERC | 1 | 22 | 2 | 74 | 0 | 1.00 | 0.03 | 0.23 | 1.00 | 0.185 |
| Roy | 4PEPS | 0 | 19 | 19 | 57 | 3 | 0.86 | 0.25 | 0.25 | 0.86 | 0.002 |
| Holdout test set | Fusion model | 0.091 | 22 | 43 | 33 | 0 | 1.00 | 0.57 | 0.40 | 1.00 | (Base) |
| Fusion model (no D-dimer) | 0.072 | 22 | 21 | 55 | 0 | 1.00 | 0.28 | 0.29 | 1.00 | <0.001 | |
| Wells’ criteria | 0 | 22 | 0 | 76 | 0 | 1.00 | 0.00 | 0.22 | 0.00 | <0.001 | |
| Revised Geneva Score | 0 | 22 | 0 | 76 | 0 | 1.00 | 0.00 | 0.22 | 0.00 | <0.001 | |
| PERC | 1 | 22 | 2 | 74 | 0 | 1.00 | 0.03 | 0.23 | 1.00 | <0.001 | |
| 4PEPS | −1 | 22 | 4 | 72 | 0 | 1.00 | 0.05 | 0.23 | 1.00 | <0.001 |
Encounters are classified as PE-positive if the clinical score or model likelihood is greater than or equal to the threshold value.
4PEPS, 4-Level Pulmonary Embolism Clinical Probability Score; FN, false negatives; FP, false positives; NPV, negative predictive value; PERC, Pulmonary Embolism Rule-Out Criteria; PPV, positive predictive value; TN, true negatives; TP, true positives.