| Literature DB >> 35788637 |
Abhishar Sinha1, Swati Purohit Joshi2, Purnendu Sekhar Das3, Soumya Jana4, Rahuldeb Sarkar5,6.
Abstract
Outcome prediction for individual patient groups is of paramount importance in terms of selection of appropriate therapeutic options, risk communication to patients and families, and allocating resource through optimum triage. This has become even more necessary in the context of the current COVID-19 pandemic. Widening the spectrum of predictor variables by including radiological parameters alongside the usually utilized demographic, clinical and biochemical ones can facilitate building a comprehensive prediction model. Automation has the potential to build such models with applications to time-critical environments so that a clinician will be able to utilize the model outcomes in real-time decision making at bedside. We show that amalgamation of computed tomogram (CT) data with clinical parameters (CP) in generating a Machine Learning model from 302 COVID-19 patients presenting to an acute care hospital in India could prognosticate the need for invasive mechanical ventilation. Models developed from CP alone, CP and radiologist derived CT severity score and CP with automated lesion-to-lung ratio had AUC of 0.87 (95% CI 0.85-0.88), 0.89 (95% CI 0.87-0.91), and 0.91 (95% CI 0.89-0.93), respectively. We show that an operating point on the ROC can be chosen to aid clinicians in risk characterization according to the resource availability and ethical considerations. This approach can be deployed in more general settings, with appropriate calibrations, to predict outcomes of severe COVID-19 patients effectively.Entities:
Mesh:
Year: 2022 PMID: 35788637 PMCID: PMC9252998 DOI: 10.1038/s41598-022-15327-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Flowchart. Top panel shows the flow chart of the project. This shows the experiment of training 3 models on different feature sets. Model-CP was trained only on clinical parameters. Model-CTSS was trained on clinical features appended with CT severity score. Model-ALLR was trained on clinical features appended with ALLR. The bottom panel is the expanded view of lung and inflammation segmentation. The lungs mask are produced by a segmentation model from CT scan. This mask is used to get the region of interest (ROI). The ROI is given as input to another segmentation model that produces the inflammation masks.
Summary of parameters in the MGMCH dataset and the number of records that have the parameter recorded.
| Parameter | Number of records with parameter value included | Summary |
|---|---|---|
| Age (in years) | 302 | 59 (40-79) |
| Sex | 302 | 75.16% male |
| Day of | 264 | 4 (3.75-5) |
| CT severity score | 302 | 15 (10-20) |
| CRP (mg/L) | 202 | 34.9 (13.77-78.5) |
| D Dimer (ng/mL) | 191 | 278 (221.5-483) |
| Ferritin ( | 159 | 343 (155.75-713.95) |
| Prevalence | ||
| Diabetes | 300 | 19.53% |
| Obesity | 300 | 16.22% |
| Hypertension | 300 | 30.13% |
| Need for oxygen | 302 | 39.74% |
| ICU admission | 302 | 21.52% |
| Need for ventilation | 302 | 11.59% required |
| Death | 302 | 5.96% mortality |
Each summary data is presented as either a percentage or a median value (interquartile range). aNumber of days passed from the symptom onset.
Validation AUCs for 50 iterations for the three models.
| Random forest | XGBoost | |||
|---|---|---|---|---|
| Mean (std) | 95% CI | Mean (std) | 95% CI | |
| Model-CP | 0.87 (0.06) | 0.85–0.88 | 0.84 (0.09) | 0.81–0.87 |
| Model-CTSS | 0.89 (0.06) | 0.87–0.91 | 0.88 (0.07) | 0.86–0.90 |
| Model-ALLR | 0.91 (0.06) | 0.89–0.93 | 0.89 (0.07) | 0.87–0.91 |
Model-CP model with clinical parameters as input, Model-CTSS model with clinical parameters and CT severity score as input, Model-ALLR model with clinical parameters and ALLR as input.
Figure 2Relative feature importance. Panel (A) shows mean feature importance of the model having CT severity score and clinical parameters as the input. Panel (B) shows mean feature importance of the model having ALLR and clinical parameters as the input. The error bars show the standard deviations.
Figure 3Mean ROC curve. The figure shows the ROC curves obtained by taking the mean of validation ROC curves over 50 iterations for predicting need for ventilation for the three models. The blue markers on the curves show the operating points where the mentioned cost function is minimized.
Confusion matrices of predicting need for ventilation for validation sets of 50 iterations.
| Model-CP | Model-CTSS | Model-ALLR | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Pred -ve | Pred +ve | std | Pred -ve | Pred +ve | std | Pred -ve | Pred +ve | std | |
|
| |||||||||
| Actual -ve | 47.66 (0.882) | 6.34 (0.118) | 3.94 (0.07) | 47.34 (0.877) | 6.66 (0.123) | 4.60 (0.08) | 47.74 (0.884) | 6.26 (0.116) | 3.54 (0.06) |
| Actual +ve | 1.94 (0.28) | 5.06 (0.72) | 1.40 (0.2) | 1.68 (0.24) | 5.32 (0.76) | 1.08 (0.15) | 1.44 (0.21) | 5.56 (0.79) | 1.15 (0.16) |
|
| |||||||||
| Actual -ve | 43.34 (0.80) | 10.66 (0.20) | 5.35 (0.1) | 42.9 (0.79) | 11.1 (0.21) | 5.76 (0.1) | 44.16 (0.82) | 9.84 (0.18) | 5.05 (0.09) |
| Actual +ve | 1.18 (0.17) | 5.82 (0.83) | 1.07 (0.15) | 0.9 (0.13) | 6.1 (0.87) | 0.96 (0.14) | 0.74 (0.11) | 6.26 (0.89) | 0.87 (0.12) |
|
| |||||||||
| Actual -ve | 35.66 (0.66) | 18.34 (0.34) | 12.87 (0.2) | 38.94 (0.72) | 15.06 (0.28) | 6.52 (0.12) | 39.36 (0.73) | 14.64 (0.27) | 8.81 (0.16) |
| Actual +ve | 0.56 (0.08) | 6.44 (0.92) | 0.64 (0.09) | 0.5 (0.07) | 6.5 (0.93) | 0.70 (0.1) | 0.3 (0.04) | 6.7 (0.96) | 0.54 (0.07) |
The values outside the parenthesis show the number of records and the values in parenthesis are the normalized values. The confusion matrices were calculated using thresholds that minimize the cost function mentioned in the first column.