| Literature DB >> 35418698 |
Daniel Gourdeau1,2, Olivier Potvin3, Jason Henry Biem4, Florence Cloutier5,6, Lyna Abrougui6, Patrick Archambault5,6,7, Carl Chartrand-Lefebvre8, Louis Dieumegarde3, Christian Gagné9, Louis Gagnon4, Raphaelle Giguère6, Alexandre Hains9, Huy Le10,11, Simon Lemieux4,12, Marie-Hélène Lévesque4,12, Simon Nepveu8, Lorne Rosenbloom10,11, An Tang8, Issac Yang10,11, Nathalie Duchesne4,13, Simon Duchesne3,4.
Abstract
The COVID-19 pandemic repeatedly overwhelms healthcare systems capacity and forced the development and implementation of triage guidelines in ICU for scarce resources (e.g. mechanical ventilation). These guidelines were often based on known risk factors for COVID-19. It is proposed that image data, specifically bedside computed X-ray (CXR), provide additional predictive information on mortality following mechanical ventilation that can be incorporated in the guidelines. Deep transfer learning was used to extract convolutional features from a systematically collected, multi-institutional dataset of COVID-19 ICU patients. A model predicting outcome of mechanical ventilation (remission or mortality) was trained on the extracted features and compared to a model based on known, aggregated risk factors. The model reached a 0.702 area under the curve (95% CI 0.707-0.694) at predicting mechanical ventilation outcome from pre-intubation CXRs, higher than the risk factor model. Combining imaging data and risk factors increased model performance to 0.743 AUC (95% CI 0.746-0.732). Additionally, a post-hoc analysis showed an increase performance on high-quality than low-quality CXRs, suggesting that using only high-quality images would result in an even stronger model.Entities:
Mesh:
Year: 2022 PMID: 35418698 PMCID: PMC9007057 DOI: 10.1038/s41598-022-10136-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Training and testing set statistics. The * denotes a comorbidity where information about the comorbidity was not available for a single patient.
Figure 2Study flowchart. The CheXpert training set is used to pre-train a general CXR feature extractor that is subsequently used on COVID patients. The CheXpert test set is not publicly available, and results were reported on the validation set.
Figure 3Distribution of the mortality score in the train and test split of the dataset. The split is done using stratified sampling (on the basis of the mortality score) to limit small-sample variance. Deaths and survivals were evenly split between training and testing.
Figure 4Cross-validated performances of the developed models. The best-performing classifier for the imaging features was a support vector classifier using 2 features with a linear kernel, achieving a 0.88 AUC. The mortality score model obtains a 0.62 AUC with a logistic regression model. There are 152 positive samples and 126 negative samples in the validation set.
Figure 5Testing performances of the developed models. The model using the deep imaging features performed worse on the testing set than on the validation set (0.70 compared to 0.88), pointing to some overfitting. In contrast, the mortality score model similarly (0.66 compared to 0.62). Combining the imaging features with the mortality score yielded a slightly higher AUC of 0.74, using the same SVM classifier. There are 145 positive samples (survival) and 39 negative samples (Deaths) in the testing set.
Figure 6Testing performances of the imaging features on low and high-quality images. The model’s prediction on high-quality images (Score of 6-7) have a slightly higher AUC than on lower quality images.
Binary classification performances of the proposed models on the testing set.
| Model | Accuracy (%) | Sensitivity (%) | Specificity (%) | PPV (%) | NPV (%) |
|---|---|---|---|---|---|
| Imaging features | 75.0 | 51.3 | 81.4 | 43.6 | 86.1 |
| Mortality Score | 67.4 | 51.3 | 71.7 | 32.8 | 85.6 |
| Combined | 75.5 | 48.7 | 82.8 | 43.3 | 85.7 |
Figure 7Average patient trajectory in the testing set by outcome. The cumulative sum of the imaging model predictions over repeated CXRs enabled the tracking over time, post-intubation, of the potential recovery of patients. The shaded area represents a standard deviation, with discontinuities in the curve when the number of patients change. The curves are stopped when only three patients are left in each outcome group.