| Literature DB >> 31390040 |
Imon Banerjee1,2, Miji Sofela3, Jaden Yang4, Jonathan H Chen5, Nigam H Shah5, Robyn Ball4, Alvin I Mushlin6, Manisha Desai4, Joseph Bledsoe7, Timothy Amrhein8, Daniel L Rubin1,2, Roham Zamanian9, Matthew P Lungren2.
Abstract
Importance: Pulmonary embolism (PE) is a life-threatening clinical problem, and computed tomographic imaging is the standard for diagnosis. Clinical decision support rules based on PE risk-scoring models have been developed to compute pretest probability but are underused and tend to underperform in practice, leading to persistent overuse of CT imaging for PE. Objective: To develop a machine learning model to generate a patient-specific risk score for PE by analyzing longitudinal clinical data as clinical decision support for patients referred for CT imaging for PE. Design, Setting, and Participants: In this diagnostic study, the proposed workflow for the machine learning model, the Pulmonary Embolism Result Forecast Model (PERFORM), transforms raw electronic medical record (EMR) data into temporal feature vectors and develops a decision analytical model targeted toward adult patients referred for CT imaging for PE. The model was tested on holdout patient EMR data from 2 large, academic medical practices. A total of 3397 annotated CT imaging examinations for PE from 3214 unique patients seen at Stanford University hospitals and clinics were used for training and validation. The models were externally validated on 240 unique patients seen at Duke University Medical Center. The comparison with clinical scoring systems was done on randomly selected 100 outpatient samples from Stanford University hospitals and clinics and 101 outpatient samples from Duke University Medical Center. Main Outcomes and Measures: Prediction performance of diagnosing acute PE was evaluated using ElasticNet, artificial neural networks, and other machine learning approaches on holdout data sets from both institutions, and performance of models was measured by area under the receiver operating characteristic curve (AUROC).Entities:
Mesh:
Year: 2019 PMID: 31390040 PMCID: PMC6686780 DOI: 10.1001/jamanetworkopen.2019.8719
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
Figure 1. Overview of the Pulmonary Embolism (PE) Prediction Pipeline
A, Pulmonary Embolism Result Forecast Model (PERFORM): the proposed workflow begins with all raw structured electronic medical record (EMR) data within 1 year prior to the encounter that is then arranged as a timeline into feature vectors. A machine learning model is then trained with the feature vectors labeled with ground truth PE outcome data to arrive at a model capable of predicting the probability of PE for an unseen patient from the holdout internal and external data set. This can be applied to provide a risk score in clinical decision support for patients referred for computed tomographic (CT) imaging for PE. B, Overview of the temporal feature engineering (each type of EMR data is color coded) and example encounters of 2 patients and how the features have been computed by preserving temporal sequence. CTA indicates CT angiography; ED, emergency department; ICD, International Classification of Diseases; and PERC, Pulmonary Embolism Rule-out Criteria.
Stratified Patient Characteristics of the Internal (SHC) and External (Duke) Data Set
| Variable | No. (%) | |||
|---|---|---|---|---|
| SHC Inpatients (n = 3214) | Duke Inpatients (n = 240) | SHC Outpatients (n = 100) | Duke Outpatients (n = 101) | |
| Age, mean (SD) | 60.53 (19.43) | 70.2 (14.2) | 57.74 (19.87) | 73.06 (15.3) |
| Sex | ||||
| Men | 1510 (47.0) | 108 (45.0) | 33 (33) | 42 (41.83) |
| Women | 1704 (53.0) | 132 (55.0) | 67 (67.0) | 59 (58.4) |
| Race/ethnicity | ||||
| White | 1952 (60.73) | 120 (50.0) | 59 (59.0) | 56 (55.4) |
| Black | 194 (6.03) | 97 (40.4) | 11 (11.0) | 42 (41.6) |
| Asian | 342 (10.65) | 0 | 6 (6.0) | 0 |
| Native American | 65 (0.2) | 10 (0.4) | 5 (5.0) | 0 |
| Other | 413 (12.83) | 10 (0.4) | 9 (9.0) | 1 (0.01) |
| Unknown | 248 (7.71) | 3 (1.3) | 10 (10.0) | 2 (1.1) |
| Total | 637 (19.8) | 31 (13.11) | 29 (29.0) | 32 (31.7) |
| Negative | 42 (6.6) | 30 (96.8) | 2 (2.0) | 4 (4.0) |
| Prior diseases of pulmonary circulation | 495 (15.4) | 35 (14.6) | 55 (55.0) | 13 (12.9) |
| Cancer | 1376 (42.8) | 61 (25.4) | 34 (34.0) | 21 (20.8) |
| Anticoagulant therapy | 1180 (36.7) | 53 (22.1) | 55 (55.0) | 27 (26.53) |
| Ground truth: acute PE-positive | 1967 (61.2) | 38 (15.8) | 29 (29.0) | 23 (22.77) |
Abbreviations: Duke, Duke University Medical Center; PE, pulmonary embolism; SHC, Stanford University hospital and clinics.
Quantitative Analysis of Model Performance
| Variable | SHC Data | Duke Data | ||
|---|---|---|---|---|
| AUROC | AUROC | |||
| ElasticNet model | 0.93 | .01 | 0.70 | .17 |
| PE neural model | 0.85 | 0.72 | ||
| Machine learning models | ||||
| ElasticNet model | 0.73 | .42 | 0.74 | .01 |
| PE neural model | 0.81 | 0.81 | ||
| Clinical scoring | ||||
| Wells | 0.48 | NA | 0.51 | NA |
| PERC | 0.51 | 0.60 | ||
| rGeneva | 0.53 | 0.47 | ||
Abbreviations: AUROC, area under the receiver operating characteristic curve; Duke, Duke University Medical Center; NA, not applicable; PE, pulmonary embolism; PERC, Pulmonary Embolism Rule-out Criteria; rGeneva, revised Geneva; SHC, Stanford University hospital and clinics.
Figure 2. Performance in Setting 1 and 2
A and B, Area under the receiver operating characteristic (AUROC) performance of ElasticNet model (A) and pulmonary embolism (PE) Neural model (B) on holdout internal Stanford hospital and clinics (SHC) and external Duke validation. C and D, Performance in setting 3: comparison with 3 clinical scoring systems on 100 outpatient samples (medical record manually reviewed by experts) seen at SHC (C) and 101 from Duke (D): PERC (Pulmonary Embolism Rule-out Criteria), Wells, rGeneva against our proposed ElasticNet and PE neural model.
Figure 3. Performance in Setting 1 and 2
Distribution of positive and negative cases in decile of predicted probability value on the primary y-axis and true number of negative computed tomographic (CT) examinations in each decile fitted as a curve on the secondary y-axis: proposed ElasticNet model for the holdout test set from Stanford (A) and Duke (B) and pulmonary embolism (PE) Neural model for the holdout test set from Stanford (C) and Duke (D).