| Literature DB >> 34993693 |
Piergiuseppe Liuzzi1,2, Silvia Campagnini3,4, Chiara Fanciullacci2, Chiara Arienti5, Michele Patrini5, Maria Chiara Carrozza1, Andrea Mannini1,2.
Abstract
COVID-19 cases are increasing around the globe with almost 5 million of deaths. We propose here a deep learning model capable of predicting the duration of the infection by means of information available at hospital admission. A total of 222 patients were enrolled in our observational study. Anagraphical and anamnestic data, COVID-19 signs and symptoms, COVID-19 therapy, hematochemical test results, and prior therapies administered to patients are used as predictors. A set of 55 features, all of which can be taken in the first hours of the patient's hospitalization, was considered. Different solutions were compared achieving the best performance with a sequential convolutional neural network-based model merged in an ensemble with two different meta-learners linked in cascade. We obtained a median absolute error of 2.7 days (IQR = 3.0) in predicting the duration of the infection; the error was equally distributed in the infection duration range. This tool could preemptively give an outlook of the COVID-19 patients' expected path and the associated hospitalization effort. The proposed solution could be viable in tackling the huge burden and the logistics complexity of hospitals or rehabilitation centers during the pandemic waves. With data taken ad admission, entering a PCA-based feature selection, a k-fold cross-validated CNN-based model was implemented. After external texting, a median absolute error of 2.7 days [IQR = 3 days].Entities:
Keywords: Artificial intelligence; COVID-19; Convolutional neural network; Duration of infection; Prognostic models; Rehabilitation
Mesh:
Year: 2022 PMID: 34993693 PMCID: PMC8739354 DOI: 10.1007/s11517-021-02479-8
Source DB: PubMed Journal: Med Biol Eng Comput ISSN: 0140-0118 Impact factor: 3.079
Fig. 1Distribution of the age with respect to infection duration and their respective box-plots
Predictors used in the final model, before entering the PCA analysis. aCumulative Illness Rating Scale (CIRS), bCOVID-19 therapy prescribed prior to admission. Numerical features are italicized while categorical features are reported with regular font
| Data group | Feature name | Median and IQR (numerical) or relative positive frequency (binary) |
|---|---|---|
| Anagraphical data (3) | ||
| Sex [1 female] | 45.79% | |
| RSA [1 if patients comes from residential care unit] | 0.52% | |
| Admission clinical scales (3) | ||
| Admission signs and symptoms (2) | Fever | 58.42% |
| Dyspnea | 44.74% | |
| Admission supports (8) | Invasive mechanical ventilation (IMV) | 36.84% |
| O2 therapy | 56.32% | |
| IMV or O2 therapy | 62.22% | |
| Extracorporeal membrane oxygenation (ECMO) | 1.58% | |
| Urinary catheter | 42.63% | |
| Tracheal cannulation | 13.68% | |
| Artificial alimentation | 14.21% | |
| Venous cannulation | 33.68% | |
| COVID-19 therapyb (17) | Favipiravir | 8.42% |
| Avigan | 7.59% | |
| Tocilizumab | 2.11% | |
| Remdesivir | 37.89% | |
| Lopinavir-ritonavir association | 20% | |
| Darunavir | 65.79% | |
| Cobicistat | 65.79% | |
| Ruxolitinib | 0.52% | |
| Ribavirin | 1.05% | |
| Hydroxychloroquine | 40.52% | |
| Azithromycin | 0.52% | |
| Colchicine | 3.16% | |
| Heparin | 66.32% | |
| Enoxaparin sodium | 0.52% | |
| Baricitinib | 36.84% | |
| Corticosteroids | 62.11% | |
| Other antibiotics different from azithromycin | 27.90% | |
| Therapy prior to COVID-19 (17) | ACE inhibitors | 11.05% |
| Sartans | 7.37% | |
| Antimineralocorticoid | 11.58% | |
| Antiplatelet | 19.47% | |
| Anticoagulant | 35.79% | |
| Statin | 26.84% | |
| Beta-blockers | 26.32% | |
| Calcium channel blockers | 1.05% | |
| Amiodarone | 1.05% | |
| Non-steroidal anti-inflammatory drug | 3.16% | |
| Steroid therapy | 1.58% | |
| Levodopa | 18.42% | |
| Immunosuppression | 0% | |
| Anxiolytic-antidepressant | 33.68% | |
| Proton-pump inhibitor | 6.84% | |
| Vitamines | 3.68% | |
| Other therapies | 3.16% | |
| Hematochemics (5) | ||
Fig. 2Panel (a): CNN-core model and its integration in CNN-MLP and CNN-LR ensembles involving MLP and LR meta-learners (green and blue boxes respectively). Panel (b): pseudo-code for the training of the metalearners. Panel (c): pseudo-code of the validation and testing phase
Grid values for the optimization of the ridge linear regression (A), random forest (B), and convolutional NN (C). Subscripts refer respectively to the FCL layers (for the number of neurons) and to the convolutional layers (for the number of filters). The output FCL layer is a single-output neuron, being this a single-output regression
| Optimized variable | Search range | Best A |
|---|---|---|
| [0–10] with step 0.1 | 1.1 | |
| [1,5,50,100] | 5 | |
| [1,5,10,15,20] | 20 | |
| [0.00001, 0.0001,0.01,0.1] | 0.001 | |
| [32,64,128,256,512] | 256 | |
| [8,32,64,128] | 128 | |
| [16,32,64] | 32 | |
| [12,32,64] | 64 | |
| [5,10,20,50] | 5 |
Fig. 3Graphical representation of the infection duration versus significant variables. Results from the CIRS severity and comorbidity index correlation with infection duration are reported in the upper panels while group comparisons are reported in the middle and lower panels
Fig. 4Box plot of the absolute error (days). The CNN result is referred to the CNN-core, while the CNN after voting refers to the model after being combined in the ensemble/meta learning step and the voting procedure (with no detrending). The CNN detrended + voting plot corresponds to the final result with detrending, preceding ensembling, and voting (absolute test error calculated on ypred after voting between CNN-MLP and CNN-LR)
Fig. 5Left panel: Scatter plot of predicted infection duration with respect to the real value. Values obtained by aggregating together the output of 10 different runs of the procedure with fixed hyper-parameters. Results obtained with three different methods. Right panel: Absolute prediction error (calculated from the 10 different run aggregated together) of the best performing solution (green dots in the left panel)
Summary of literature findings on predicting COVID-19 length of stay compared with our solution
| Training and validation (# patients) | Testing (# patients) | Outcome | Results | |
|---|---|---|---|---|
| Nemati et al. [ | 1182 | – | Discharge-time probability (survival analysis) | Discharge probability = 1 after ~ 27 days. C-index from Stagewise GB = 71.47% |
| Qi et al. [ | 31 | – | Short- and long-term hospital stay (≤ 10 days) | Data at admission, AUC = 0.97 (95% CI 0.83–1) |
| Ebinger et al. [ | 772 | 193 | Short- and long-term hospital stay (≤ 8 days) | Models trained on hospital day 1–2-3. Increasing accuracies over time with an accuracy of 0.765 (AUC = 0.819) if trained on day 3 |
| Chiari et al. [ | 524 | 132 | Length of stay | Models trained on hospital day 2–4-6–8. Best results trained after 8 hospitalization days with a mean absolute error of 4.11 days |
| Setti et al. [ | 62 | 25 | Length of stay, post-COVID rehabilitation | Data taken in the first week from admission to rehabilitation, median test error of 7.04 days [IQR = 10.7] |