| Literature DB >> 33091743 |
Hanqing Chao1, Xi Fang1, Jiajin Zhang1, Fatemeh Homayounieh2, Chiara D Arru2, Subba R Digumarthy2, Rosa Babaei3, Hadi K Mobin3, Iman Mohseni3, Luca Saba4, Alessandro Carriero5, Zeno Falaschi5, Alessio Pasche5, Ge Wang1, Mannudeep K Kalra6, Pingkun Yan7.
Abstract
While image analysis of chest computed tomography (CT) for COVID-19 diagnosis has been intensively studied, little work has been performed for image-based patient outcome prediction. Management of high-risk patients with early intervention is a key to lower the fatality rate of COVID-19 pneumonia, as a majority of patients recover naturally. Therefore, an accurate prediction of disease progression with baseline imaging at the time of the initial presentation can help in patient management. In lieu of only size and volume information of pulmonary abnormalities and features through deep learning based image segmentation, here we combine radiomics of lung opacities and non-imaging features from demographic data, vital signs, and laboratory findings to predict need for intensive care unit (ICU) admission. To our knowledge, this is the first study that uses holistic information of a patient including both imaging and non-imaging data for outcome prediction. The proposed methods were thoroughly evaluated on datasets separately collected from three hospitals, one in the United States, one in Iran, and another in Italy, with a total 295 patients with reverse transcription polymerase chain reaction (RT-PCR) assay positive COVID-19 pneumonia. Our experimental results demonstrate that adding non-imaging features can significantly improve the performance of prediction to achieve AUC up to 0.884 and sensitivity as high as 96.1%, which can be valuable to provide clinical decision support in managing COVID-19 patients. Our methods may also be applied to other lung diseases including but not limited to community acquired pneumonia. The source code of our work is available at https://github.com/DIAL-RPI/COVID19-ICUPrediction.Entities:
Keywords: Artificial intelligence; COVID-19; Chest CT; Outcome prediction
Mesh:
Year: 2020 PMID: 33091743 PMCID: PMC7553063 DOI: 10.1016/j.media.2020.101844
Source DB: PubMed Journal: Med Image Anal ISSN: 1361-8415 Impact factor: 8.545
Statistics (mean ± std, except for gender) of DVB features for Site A dataset.
| ICU admission | Not admitted | ICU admitted | Data # |
|---|---|---|---|
| Gender (M:F) | 43: 28 | 29: 13 | 113 |
| Age (year) | 56.7 ± 16.0 | 66.9 ± 16.2 | 113 |
| Lym_r (%) | 22.7 ± 8.3 | 15.6 ± 12.8 | 113 |
| WBC | 5831.0 ± 1848.9 | 7966.7 ± 4556.2 | 113 |
| Lym | 1244.7 ± 482.8 | 1010.4 ± 943.7 | 113 |
| Temperature (∘) | 37.3 ± 0.6 | 37.6 ± 0.6 | 98 |
| SpO2 (%) | 91.9 ± 7.41 | 86.5 ± 8.53 | 100 |
Statistics (mean ± std, except for gender) of DVB features for Site B dataset.
| ICU admission | Not admitted | ICU admitted | Data # |
|---|---|---|---|
| Gender (M:F) | 23: 24 | 39: 39 | 125 |
| Age (year) | 74.8 ± 15.0 | 72.7 ± 11.1 | 125 |
| Lym_r (%) | 18.6 ± 12.7 | 13.0 ± 12.8 | 125 |
| WBC | 7175.7 ± 4288.9 | 11722.3 ± 7249.3 | 125 |
| Lym | 1058.1 ± 596.7 | 1613.8 ± 3872.7 | 125 |
Statistics (mean ± std, except for gender) of DVB features for Site C dataset.
| ICU admission | Not admitted | ICU admitted | Data # |
|---|---|---|---|
| Gender (M:F) | 13: 8 | 24: 12 | 57 |
| Age (year) | 70.0 ± 13.7 | 66.9 ± 12.3 | 57 |
| Temperature (∘) | 39.0 ± 1.0 | 37.8 ± 0.9 | 50 |
| SpO2 (%) | 92.3 ± 5.25 | 84.5 ± 7.74 | 31 |
Fig. 1Framework of the proposed methods including the utilized inputs and expected output.
Fig. 2Lung lobes and pulmonary opacities segmentation results. Areas colored in magenta indicate the segmented lesions. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Radiomics feature types and number of each kind of features.
| Group | Feature type | # features | Sum |
|---|---|---|---|
| Texture | First order | 18 | 93 |
| GLCM | 24 | ||
| GLRLM | 16 | ||
| GLSZM | 16 | ||
| NGTDM | 5 | ||
| GLDM | 14 | ||
| Shape | Shape (3D) | 17 | 17 |
Image filter types and extracted radiomics features from each type of filtered images.
| Image filter type | Extracted features | # features |
|---|---|---|
| No filter (Original image) | Texture + Shape | |
| Square filter | Texture | 93 |
| Square-root(Sqrt) filter | Texture | 93 |
| Logarithm filter | Texture | 93 |
| Exponential filter | Texture | 93 |
| Wavelet filters (HHH, HHL, HLH, LHH, HLL, LHL, LLH, LLL) | Texture | |
| Laplacian of Gaussian (LoG) filters | Texture |
Fig. 3Architecture of the Wide & Deep Net (Cheng et al., 2016) based deep neural network (DNN). Three different kinds of features are first processed separately by one or two fully connected layers. Then the learned features are concatenated for the final prediction.
Fig. 4ROC curves of various feature combinations on Site A dataset. DVB: non-imaging features including Demographic data, Vital signals and Blood test results; HLQ: Hierarchical Lobe-wise Quantification features; WLR: Whole Lung Radiomics features.
Comparison among the features used in exist state-of-the-art works and different combinations of the proposed features on ICU admission prediction on Site A dataset. One-tailed t-test is used to evaluate the statistical significance between a feature combination and the best performer.
| Features | AUC | Sensitivity (PPV = 70%) | K | ||||
|---|---|---|---|---|---|---|---|
| Mean | 95% CI | Mean | 95% CI | ||||
| Img feature (Tang 2020) | 0.818 | (0.796, 0.839) | 51.0% | (39.3%, 62.6%) | 8 | ||
| Img feature (Zhu 2020) | 0.776 | (0.762, 0.790) | 48.6% | (35.4%, 61.7%) | 46 | ||
| DVB | 0.855 | (0.844, 0.866) | 76.7% | (73.2%, 80.1%) | 1 | ||
| HLQ | 0.789 | (0.781, 0.797) | 51.4% | (45.3%, 57.5%) | 21 | ||
| WLR | 0.859 | (0.843, 0.873) | 71.4% | (60.5%, 82.3%) | 70 | ||
| WLR + HLQ | 0.866 | (0.857, 0.875) | 68.6% | (57.6%, 79.5%) | 61 | ||
| WLR + DVB | (0.867, 0.886) | (76.0%, 86.8%) | 4 | ||||
| HLQ + DVB | (0.844, 0.885) | 70.0% | (60.9%, 79.1%) | 4 | |||
| WLR + HLQ + DVB | (0.875, 0.893) | – | (79.9%, 88.7%) | – | 52 | ||
Fig. 5Variation of AUC along choosing the top K features.
The top 52 features ranked by feature ranking strategy introduced in Section 3.5 on Site A dataset. The third and sixth columns show the Gini importance of the corresponding feature averaged in the 5-fold cross validation.
Red text indicates non-imaging features. Green text indicates lobe-wise quantification features, HU1-HU4 are the four HU intervals. Blue text indicates whole lung radiomics features encoded as Filter-FeatureType-Parameter.
Comparison of different machine learning methods with selected features on Site A dataset. One-tailed t-test is used to evaluate the statistical significance between the results of random forest and other methods.
| Methods | AUC | Sensitivity (PPV = 70%) | ||||
|---|---|---|---|---|---|---|
| Mean | 95% CI | Mean | 95% CI | |||
| Random Forests | (0.875, 0.893) | – | (79.9%, 88.7%) | – | ||
| SVM | 0.867 | (0.855, 0.880) | 71.0% | (64.9%, 77.0%) | ||
| Logistic Regression | 0.785 | (0.758, 0.812) | 31.0% | (14.8%, 47.1%) | ||
| DNN (small) | 0.816 | (0.804, 0.828) | – | – | ||
| DNN w/ all features | 0.751 | (0.723, 0.779) | 25.7% | (0.8%, 50.6%) | ||
| WD Net w/ all features | 0.823 | (0.807, 0.838) | 58.1% | (40.7%, 75.5%) | ||
Comparison among the features used in exist state-of-the-art works and different combinations of the proposed features on ICU admission prediction on Site B dataset.
| Features | AUC | Sensitivity (PPV = 70%) | K | ||||
|---|---|---|---|---|---|---|---|
| Mean | 95% CI | Mean | 95% CI | ||||
| Img feature (Tang 2020) | 0.770 | (0.745, 0.796) | 83.1% | (75.8%, 90.4%) | 10 | ||
| Img feature (Zhu 2020) | 0.767 | (0.752, 0.781) | 83.8% | (82.2%, 85.5%) | 39 | ||
| DVB | 0.671 | (0.643, 0.700) | 78.7% | (69.7%, 87.7%) | 4 | ||
| HLQ | 0.791 | (0.774, 0.809) | 84.6% | (81.3%, 88.0%) | 3 | ||
| WLR | 0.841 | (0.827, 0.855) | (93.4%, 96.3%) | – | 55 | ||
| WLR + HLQ | (0.833, 0.861) | – | (89.5%, 95.7%) | 12 | |||
| WLR+DVB | (0.828, 0.854) | 91.8% | (90.5%, 93.1%) | 33 | |||
| HLQ + DVB | 0.796 | (0.777, 0.815) | 84.4% | (80.7%, 88.0%) | 4 | ||
| WLR + HLQ + DVB | (0.833, 0.855) | 92.6% | (90.0%, 95.1%) | 12 | |||
Fig. 6ROC curves on Site B dataset.
The top 12 WLR + HLQ features ranked by feature ranking strategy introduced in Section 3.5 on Site B dataset. The third and sixth columns show the Gini importance of the corresponding feature averaged in the 5-fold cross validation.
Green text indicates lobe-wise quantification features, HU1-HU4 are the four HU intervals. Blue text indicates whole lung radiomics features encoded as Filter-FeatureType-Parameter.
Comparison of different machine learning methods with selected features on Site B dataset.
| Methods | AUC | Sensitivity (PPV = 70%) | ||||
|---|---|---|---|---|---|---|
| Mean | 95% CI | Mean | 95% CI | |||
| Random Forests | 0.844 | (0.833, 0.855) | – | 92.6% | (90.0%, 95.1%) | – |
| SVM | (0.838, 0.866) | (93.2%, 97.5%) | ||||
| Logistic Regression | 0.798 | (0.783, 0.812) | (88.6%, 91.9%) | |||
| DNN (small) | (0.805, 0.858) | 86.9% | (83.5%, 90.3%) | |||
| DNN w/ all features | 0.704 | (0.670, 0.738) | 75.38% | (71.6%, 79.2%) | ||
| WD Net w/ all features | 0.769 | (0.753, 0.786) | 85.6% | (82.6%, 88.7%) | ||
The 12 best WLR + HLQ + DVB features used for the experiments in Table 10 ranked by feature ranking strategy introduced in Section 3.5. The third and sixth columns show the Gini importance of the corresponding feature averaged in the 5-fold cross validation.
Red text indicates non-imaging features. Green text indicates lobe-wise quantification features, HU1-HU4 are the four HU intervals. Blue text indicates whole lung radiomics features encoded as Filter-FeatureType-Parameter.
Comparison among the features used in exist state-of-the-art works and different combinations of the proposed features on ICU admission prediction on Site C dataset.
| Features | AUC | Sensitivity (PPV = 70%) | K | ||||
|---|---|---|---|---|---|---|---|
| Mean | 95% CI | Mean | 95% CI | ||||
| Img feature (Tang 2020) | 0.763 | (0.670, 0.856) | 85.0% | (76.4%, 93.6%) | 10 | ||
| Img feature (Zhu 2020) | 0.675 | (0.645, 0.706) | 73.9% | (59.0%, 88.8%) | 39 | ||
| DVB | 0.595 | (0.524, 0.665) | 63.3% | (36.1%, 90.5%) | 4 | ||
| HLQ | 0.691 | (0.660, 0.722) | 86.7% | (80.3%, 93.0%) | 7 | ||
| WLR | 0.815 | (0.782, 0.848) | (92.8%, 98.3%) | 12 | |||
| WLR + HLQ | (0.813, 0.839) | (94.4%, 97.8%) | – | 20 | |||
| WLR+DVB | (0.809, 0.861) | (91.6%, 98.4%) | 15 | ||||
| HLQ + DVB | 0.760 | (0.705, 0.815) | 85.6% | (77.6%, 93.5%) | 2 | ||
| WLR + HLQ + DVB | (0.804, 0.876) | – | 94.4% | (92.3%, 96.6%) | 35 | ||
Fig. 7ROC curves on Site C dataset.
The top 35 features ranked by feature ranking strategy introduced in Section 3.5 on Site C dataset. The third and sixth columns show the Gini importance of the corresponding feature averaged in the 5-fold cross validation.
Red text indicates non-imaging features. Green text indicates lobe-wise quantification features, HU1-HU4 are the four HU intervals. Blue text indicates whole lung radiomics features encoded as Filter-FeatureType-Parameter.
Comparison of different machine learning methods with selected features on Site C dataset.
| Methods | AUC | Sensitivity (PPV = 70%) | ||||
|---|---|---|---|---|---|---|
| Mean | 95% CI | Mean | 95% CI | |||
| Random Forests | (0.804, 0.876) | – | (92.3%, 96.6%) | – | ||
| SVM | 0.811 | (0.782, 0.839) | (89.2%, 97.5%) | |||
| Logistic Regression | 0.717 | (0.666, 0.768) | 86.7% | (79.3%, 94.0%) | ||
| DNN (small) | 0.695 | (0.619, 0.771) | (53.6%, 100.0%) | |||
| DNN w/ all features | 0.568 | (0.550, 0.587) | 52.8% | (29.2%, 76.4%) | ||
| WD Net w/ all features | 0.528 | (0.479, 0.578) | 34.4% | (24.1%, 44.8%) | ||
Transferring WLR + HLQ features across the three datasets.
| Methods | A → B | A → C | B → A | B → C | C → A | C → B | Mean |
|---|---|---|---|---|---|---|---|
| Random Forests | 0.740 (36) | 0.685 (29) | 0.754 (12) | 0.633 (11) | 0.591 (17) | 0.717 (20) | 0.687 |
| SVM | 0.694 (3) | 0.715 | |||||
| Logistic Regression | 0.744 (11) | 0.706 (23) | 0.756 (1) | 0.698 (1) | 0.642 (19) | 0.716 |