| Literature DB >> 32864270 |
Joseph Paul Cohen1, Lan Dao2, Karsten Roth3, Paul Morrison4, Yoshua Bengio1, Almas F Abbasi5, Beiyi Shen5, Hoshmand Kochi Mahsa5, Marzyeh Ghassemi6, Haifang Li5, Tim Duong5.
Abstract
Introduction The need to streamline patient management for coronavirus disease-19 (COVID-19) has become more pressing than ever. Chest X-rays (CXRs) provide a non-invasive (potentially bedside) tool to monitor the progression of the disease. In this study, we present a severity score prediction model for COVID-19 pneumonia for frontal chest X-ray images. Such a tool can gauge the severity of COVID-19 lung infections (and pneumonia in general) that can be used for escalation or de-escalation of care as well as monitoring treatment efficacy, especially in the ICU. Methods Images from a public COVID-19 database were scored retrospectively by three blinded experts in terms of the extent of lung involvement as well as the degree of opacity. A neural network model that was pre-trained on large (non-COVID-19) chest X-ray datasets is used to construct features for COVID-19 images which are predictive for our task. Results This study finds that training a regression model on a subset of the outputs from this pre-trained chest X-ray model predicts our geographic extent score (range 0-8) with 1.14 mean absolute error (MAE) and our lung opacity score (range 0-6) with 0.78 MAE. Conclusions These results indicate that our model's ability to gauge the severity of COVID-19 lung infections could be used for escalation or de-escalation of care as well as monitoring treatment efficacy, especially in the ICU. To enable follow up work, we make our code, labels, and data available online.Entities:
Keywords: chest x-ray; covid-19 pneumonia; deep learning artificial intelligence; severity scoring
Year: 2020 PMID: 32864270 PMCID: PMC7451075 DOI: 10.7759/cureus.9448
Source DB: PubMed Journal: Cureus ISSN: 2168-8184
Figure 1Diagram of features extracted from the images
The two dataset blocks show that COVID-19 images were not used to train the neural network. The network diagram is split into three sections. The feature extraction layers are convolutional layers which transform the image into a 1024 dimensional vector which is called the intermediate network features. These features are then transformed using the task prediction layer (a sigmoid function for each task) into the outputs for each task. The different groupings of outputs used in this work are shown.
Performance metrics of each set of features for the Opacity Score and Geographic Extent prediction
Evaluation is performed on 50 randomly chosen train test splits and the metrics here are computed on a hold out test set.
R2: coefficient of determination; MAE: mean absolute error; MSE: mean squared error. “four outputs” refers to lung opacity, pneumonia, infiltration, and consolidation
| Task | Using features: | # parameters (fewer is better) | Pearson Correlation | R2 | MAE | MSE |
| Opacity Score | "lung opacity" output | 1+1 | 0.78±0.04 | 0.58±0.09 | 0.78±0.05 | 0.86±0.11 |
| 4 outputs | 4+1 | 0.78±0.04 | 0.58±0.09 | 0.76±0.05 | 0.87±0.12 | |
| 18 outputs | 18+1 | 0.73±0.09 | 0.44±0.16 | 0.86±0.11 | 1.15±0.33 | |
| Intermediate network features | 1024+1 | 0.66±0.08 | 0.25±0.21 | 1.01±0.09 | 1.54±0.28 | |
| No data | 0+1 | -0.00±0.00 | -0.08±0.10 | 1.24±0.10 | 2.26±0.36 | |
| Geographic Extent | "lung opacity" output | 1+1 | 0.80±0.05 | 0.60±0.09 | 1.14±0.11 | 2.06±0.34 |
| 4 outputs | 4+1 | 0.79±0.05 | 0.57±0.10 | 1.19±0.11 | 2.17±0.37 | |
| 18 outputs | 18+1 | 0.76±0.08 | 0.47±0.16 | 1.32±0.17 | 2.73±0.89 | |
| Intermediate network features | 1024+1 | 0.74±0.08 | 0.43±0.16 | 1.36±0.13 | 2.88±0.58 | |
| No data | 0+1 | 0.00±0.00 | -0.08±0.10 | 2.00±0.17 | 5.60±0.95 |
Figure 2Scatter plots showing alignment between our best model predictions and human annotation (ground truth) for Geographical Extent and Opacity scores
Evaluation is on a hold out test set. The grey dashed line is a perfect prediction. Red lines indicate error from a perfect prediction.
R2: coefficient of determination.
Figure 3Feature representations visualized with geographic extent prediction and survival outcomes
A spatial representation of pneumonia specific features (lung opacity, pneumonia, infiltration, and consolidation) when projected into 2 dimensions (2D) using a t-distributed stochastic neighbor embedding (t-SNE). In this 2D space, the high dimensional (4D) distances are preserved, specifically what is nearby. CXR images which have similar outputs are close to each other. Features are extracted for all 208 images in the dataset and the geographic extent prediction is shown for each image. The survival information available in the dataset represented by the shape of the marker.
Figure 4Saliency maps of model predictions
Examples of correct (a,b) and incorrect (c,d) predictions by the model are shown with a saliency map generated by computing the gradient of the output prediction with respect to the input image and then blurred using a 5x5 Gaussian kernel. The assigned and predicted scores for Geographic Extent are shown to the right.