| Literature DB >> 35894016 |
Xuan V Nguyen1, Engin Dikici1, Sema Candemir1, Robyn L Ball2, Luciano M Prevedello1.
Abstract
The emergence of the COVID-19 pandemic over a relatively brief interval illustrates the need for rapid data-driven approaches to facilitate clinical decision making. We examined a machine learning process to predict inpatient mortality among COVID-19 patients using clinical and chest radiographic data. Modeling was performed with a de-identified dataset of encounters prior to widespread vaccine availability. Non-imaging predictors included demographics, pre-admission clinical history, and past medical history variables. Imaging features were extracted from chest radiographs by applying a deep convolutional neural network with transfer learning. A multi-layer perceptron combining 64 deep learning features from chest radiographs with 98 patient clinical features was trained to predict mortality. The Local Interpretable Model-Agnostic Explanations (LIME) method was used to explain model predictions. Non-imaging data alone predicted mortality with an ROC-AUC of 0.87 ± 0.03 (mean ± SD), while the addition of imaging data improved prediction slightly (ROC-AUC: 0.91 ± 0.02). The application of LIME to the combined imaging and clinical model found HbA1c values to contribute the most to model prediction (17.1 ± 1.7%), while imaging contributed 8.8 ± 2.8%. Age, gender, and BMI contributed 8.7%, 8.2%, and 7.1%, respectively. Our findings demonstrate a viable explainable AI approach to quantify the contributions of imaging and clinical data to COVID mortality predictions.Entities:
Keywords: COVID; chest radiographs; computer-aided diagnosis/prognosis; explainable AI; machine learning; multi-modal analysis; transfer learning
Mesh:
Year: 2022 PMID: 35894016 PMCID: PMC9326627 DOI: 10.3390/tomography8040151
Source DB: PubMed Journal: Tomography ISSN: 2379-1381
Figure 1The overview of the mortality prediction framework. The imaging features are acquired using transfer-learned VGG-16 () and are subsequently processed in tandem with the clinical data features via an MLP to produce the patient’s mortality probability.
Figure 2Lung region detection component applied on example CXRs. First column (A-1 to A-3): example chest radiographs; middle column (B-1 to B-3): predicted lung probability maps; third column (C-1 to C-3): lung regions in bounding boxes suitable to process in a convolutional neural network.
Clinical data variables used for prediction.
| Field Type | Variable Description | |
|---|---|---|
| Age | Categorical: [18, 59], (59, 74], (74, 90] | Age intervals (in years) at admission; truncated for patients > 90 years of age. |
| Gender | Categorical: Male, Female, Unknown/Missing | Documented gender in the electronic health record; dropped in select cases for de-identification. |
| Kidney replacement therapy | Categorical: Yes, No, Unknown/Missing | Documented renal replacement therapy. |
| Kidney transplant | Categorical: Yes, No, Unknown/Missing | History of kidney transplant. |
| Hypertension | Categorical: Yes, No, Unknown/Missing | Documented ICD-10 code for hypertension, taking anti-hypertensive medications, and/or documented blood pressure > 140/90. |
| Diabetes mellitus | Categorical: Yes, No, Unknown/Missing | Documented ICD-10 code for diabetes type 1 or 2 or taking insulin or oral medications for diabetes. |
| Coronary artery disease | Categorical: Yes, No, Unknown/Missing | Documented ICD-10 code for coronary artery disease, history of stent placement, or existing catheter report documenting disease. |
| Heart failure | Categorical: HFpEF, HFrEF, No, Unknown/Missing | For HFrEF, documented ICD-10 code for HFrEF or echocardiogram documenting reduced ejection fraction (reduced EF is <40%, 40% or higher is preserved EF). For HFpEF, documented ICD-10 code for HFpEF or echocardiogram documenting diastolic dysfunction. |
| Chronic kidney disease | Categorical: Yes, No, Unknown/Missing | Documented ICD-10 code for chronic kidney disease or reduced GFR on lab work. |
| Malignancy | Categorical: Yes, No, Unknown/Missing | Documented ICD-10 code for malignancies or receiving treatment for active malignancy. |
| COPD | Categorical: Yes, No, Unknown/Missing | Documented ICD-10 code for chronic obstructive pulmonary disease or pulmonary function tests documenting obstructive defect along with positive smoking history. |
| Other lung disease | Categorical: Yes, No, Unknown/Missing | Documented ICD-10 code for other lung diseases including asthma, interstitial lung disease, pulmonary hypertension, chronic pulmonary embolism, or lung resection. |
| Smoking status | Categorical: Current, Former, Never, Unknown/Missing | Patient’s smoking status as either Current, Former, Never Smoker, or Unknown. This referred only to cigarettes and cigars. E-cigarettes and marijuana were not counted. |
| ACE inhibitor use | Categorical: Yes, No, Unknown/Missing | Admission medication reconciliation documenting use of an ACE inhibitor as a home medication. |
| Angiotensin receptor blocker use | Categorical: Yes, No, Unknown/Missing | Admission medication reconciliation documenting use of an angiotensin receptor blocker as a home medication. |
| Antibiotic use | Categorical: Yes, No, Unknown/Missing | On an antibiotic prior to presentation. |
| NSAID use | Categorical: Yes, No, Unknown/Missing | Admission medication reconciliation documenting use of a non-steroidal anti-inflammatory drug as a home medication. |
| Cough | Categorical: Yes, No, Unknown/Missing | Reported cough on admission. |
| Dyspnea on admission | Categorical: Yes, No, Unknown/Missing | Reported shortness of breath on admission. |
| Nausea | Categorical: Yes, No, Unknown/Missing | Reported nausea on admission. |
| Vomiting | Categorical: Yes, No, Unknown/Missing | Reported vomiting on admission. |
| Diarrhea | Categorical: Yes, No, Unknown/Missing | Reported diarrhea on admission. |
| Abdominal pain | Categorical: Yes, No, Unknown/Missing | Reported abdominal pain on admission. |
| Subjective fever | Categorical: Yes, No, Unknown/Missing | Subjective or objective fever at home. Fever in ED was not counted. |
| Days symptomatic | Integer value or Unknown/Missing | The number of days prior to presentation that symptoms began. |
| BMI | Categorical: <30, [30, 35], >35, Unknown/Missing | Body mass index (kg/m2) |
| HbA1c | Categorical: <6.5, [6.5, 7.9], >7.9, Unknown/Missing | Hemoglobin A1c (%) |
| Temperature over 38 C | Categorical: Yes, No, Unknown/Missing | Temperature at time of admission over 38 degrees centigrade. |
Abbreviations: HFpEF: heart failure with preserved ejection fraction; HFrEF: heart failure with reduced ejection fraction.
Characteristics of patients included in the analysis.
| In-Hospital Mortality | ||||||||
|---|---|---|---|---|---|---|---|---|
| All | Column % | Yes | Column % | No | Column % | |||
| Total patients | 841 | 180 | 661 | |||||
| Gender | <0.0001 | * | ||||||
| Male | 489 | 58% | 97 | 54% | 392 | 59% | ||
| Female | 321 | 38% | 54 | 30% | 267 | 40% | ||
| Not recorded | 31 | 4% | 29 | 16% | 2 | 0% | ||
| Age | <0.0001 | * | ||||||
| 18–59 | 380 | 45% | 29 | 16% | 351 | 53% | ||
| 60–74 | 252 | 30% | 65 | 36% | 187 | 28% | ||
| >75 | 209 | 25% | 86 | 48% | 123 | 19% | ||
| Comorbidities | ||||||||
| Hypertension | <0.0001 | * | ||||||
| Yes | 377 | 45% | 108 | 60% | 269 | 41% | ||
| No | 339 | 40% | 43 | 24% | 296 | 45% | ||
| Not recorded | 125 | 15% | 29 | 16% | 96 | 15% | ||
| Diabetes mellitus | 0.16 | NS | ||||||
| Yes | 215 | 26% | 55 | 31% | 160 | 24% | ||
| No | 503 | 60% | 97 | 54% | 406 | 61% | ||
| Not recorded | 123 | 15% | 28 | 16% | 95 | 14% | ||
| Coronary artery disease | <0.0001 | * | ||||||
| Yes | 131 | 16% | 49 | 27% | 82 | 12% | ||
| No | 582 | 69% | 99 | 55% | 483 | 73% | ||
| Not recorded | 128 | 15% | 32 | 18% | 96 | 15% | ||
| Heart failure | <0.0001 | * | ||||||
| Yes | 62 | 7% | 33 | 18% | 29 | 4% | ||
| No | 647 | 77% | 114 | 63% | 533 | 81% | ||
| Not recorded | 132 | 16% | 33 | 18% | 99 | 15% | ||
| Chronic kidney disease | 0.020 | NS | ||||||
| Yes | 69 | 8% | 23 | 13% | 46 | 7% | ||
| No | 645 | 77% | 126 | 70% | 519 | 79% | ||
| Not recorded | 127 | 15% | 31 | 17% | 96 | 15% | ||
| Malignancy | 0.034 | NS | ||||||
| Yes | 69 | 8% | 23 | 13% | 46 | 7% | ||
| No | 638 | 76% | 127 | 71% | 511 | 77% | ||
| Not recorded | 134 | 16% | 30 | 17% | 104 | 16% | ||
| Chronic obstructive pulmonary disease | 0.0053 | NS | ||||||
| Yes | 56 | 7% | 21 | 12% | 35 | 5% | ||
| No | 660 | 78% | 129 | 72% | 531 | 80% | ||
| Not recorded | 125 | 15% | 30 | 17% | 95 | 14% | ||
| Other lung disease | 0.77 | NS | ||||||
| Yes | 109 | 13% | 24 | 13% | 85 | 13% | ||
| No | 605 | 72% | 126 | 70% | 479 | 72% | ||
| Not recorded | 127 | 15% | 30 | 17% | 97 | 15% | ||
1 p values based on chi-square test of independence, with df = 2, uncorrected for multiple comparisons. The * denotes comparisons that are significant based on a Bonferroni-adjusted α of 0.005.
Figure 3ROC curves showing the performance of prediction models. (a) Performance of the model using demographic and non-imaging clinical variables only; (b) performance of the model using imaging data and demographic and other clinical variables.
Figure 4Pareto chart for the LIME analysis results for the predictive model using imaging and demographic and other clinical variables. Predictors are shown in order of descending mean contributions.
Effects of specific values of the most predictive clinical variables on increasing or decreasing mortality.
| Reduced Mortality | Increased Mortality | |
|---|---|---|
| Age | 18–59 | 74–90 |
| Gender | Female | Male |
| BMI | Below 30 | Over 30 |
| HbA1c | <6.6 | >6.6 |
Figure 5ROC curves showing performance of the prediction model utilizing combined imaging and non-imaging clinical data following partitioning based on ICU admission status. (a) Test performance with the ICU subgroup; (b) test performance with the non-ICU subgroup.