| Literature DB >> 35721825 |
Ghadeer O Ghosheh1, Bana Alamad1, Kai-Wen Yang1, Faisil Syed2, Nasir Hayat1, Imran Iqbal2, Fatima Al Kindi2, Sara Al Junaibi2, Maha Al Safi2, Raghib Ali1, Walid Zaher3, Mariam Al Harbi2, Farah E Shamout1.
Abstract
Clinical evidence suggests that some patients diagnosed with coronavirus disease 2019 (COVID-19) experience a variety of complications associated with significant morbidity, especially in severe cases during the initial spread of the pandemic. To support early interventions, we propose a machine learning system that predicts the risk of developing multiple complications. We processed data collected from 3,352 patient encounters admitted to 18 facilities between April 1 and April 30, 2020, in Abu Dhabi (AD), United Arab Emirates. Using data collected during the first 24 h of admission, we trained machine learning models to predict the risk of developing any of three complications after 24 h of admission. The complications include Secondary Bacterial Infection (SBI), Acute Kidney Injury (AKI), and Acute Respiratory Distress Syndrome (ARDS). The hospitals were grouped based on geographical proximity to assess the proposed system's learning generalizability, AD Middle region and AD Western & Eastern regions, A and B, respectively. The overall system includes a data filtering criterion, hyperparameter tuning, and model selection. In test set A, consisting of 587 patient encounters (mean age: 45.5), the system achieved a good area under the receiver operating curve (AUROC) for the prediction of SBI (0.902 AUROC), AKI (0.906 AUROC), and ARDS (0.854 AUROC). Similarly, in test set B, consisting of 225 patient encounters (mean age: 42.7), the system performed well for the prediction of SBI (0.859 AUROC), AKI (0.891 AUROC), and ARDS (0.827 AUROC). The performance results and feature importance analysis highlight the system's generalizability and interpretability. The findings illustrate how machine learning models can achieve a strong performance even when using a limited set of routine input variables. Since our proposed system is data-driven, we believe it can be easily repurposed for different outcomes considering the changes in COVID-19 variants over time.Entities:
Year: 2022 PMID: 35721825 PMCID: PMC9188985 DOI: 10.1016/j.ibmed.2022.100065
Source DB: PubMed Journal: Intell Based Med ISSN: 2666-5212
Examples of machine learning studies that aim to predict various outcomes for in-patients with confirmed COVID-19 diagnosis. We refer the readers to extensive published literature reviews [2,23,24].
| Reference | Outcome | Input Data | Models | Study Location |
|---|---|---|---|---|
| [ | Deterioration (intubation or ICU admission or mortality) | Chest X-ray images and clinical data (patient demographics, seven vital-sign variables, and 24 laboratory-test results) | Convolutional neural network for chest X-ray images and gradient boosting model for clinical data | United States |
| [ | Mortality | Five laboratory-test results | Support vector machine | United States |
| [ | Severe progression (high oxygen flow rate, mechanical ventiliation or mortality) | Chest CT scans, patient demographics, five vital-sign variables, symptoms, comorbidities, 14 laboratory-test results, and chest CT radiology report findings | Deep neural network and logistic regression | France |
| [ | Prognostication (intubation or hospital admission, or mortality) | Chest X-ray images, two vital-sign variables, and nine laboratory-test results | Convolutional neural network | United States |
| [ | Sepsis | Eight laboratory-test results | Gradient boosting model | China |
| [ | AKI | Findings of abdominal CT scans, demographics, vital signs, comorbidities, and three laboratory-test results | Logistic regression | United States |
| [ | ARDS | Demgraphics, interventions, comorbidities, 17 laboratory-test results, and eight vital signs | Gradient boosting model | United States |
Fig. 1Overview of our proposed model development approach and expected application in practice. As shown in the first row, we develop our complication-specific models by first preprocessing the data, identifying the occurrences of the complications based on the criteria shown in Table 2, training and selecting the best-performing models on the validation set, and then evaluating the performance on the test set, retrospectively. As for the application (second row), we expect our system to predict the risk of developing any of the three complications for any patient after 24 h of admission.
Criteria used to define the occurrence of complications.
| Complication | Definition | Reference |
|---|---|---|
| SBI | Positive blood, urine, throat or sputum cultures within 24 h of sample collection | |
| AKI | Based on the Kidney Disease Improving Global Guidelines (KDIGO) classification, increase in Serum Creatinine by ≥ 0.3 mg/dl within 48 h | [ |
| Increase in Serum Creatinine by ≥ to 1.5 times | ||
| Urine volume | ||
| ARDS | Based on the Berlin definition, presence of bilateral opacity in radiology reports | [ |
| Oxygenation: PaO2/FiO2 ≤ 300 mm Hg | ||
| Timing: ≤ one week | ||
| Origin: pulmonary |
Based on SEHA's clinical standards.
Urine output was not measured in our dataset because it is collected in the intensive care unit.
Fig. 2(a) The UAE map showcasing the location of the healthcare facilities included in this study. (b) Flowchart for the overall dataset showing how the inclusion and exclusion criteria were applied to obtain the final training and test sets, where n represents the number of patient encounters, and p represents the number of unique patients.
Summary of the baseline characteristics of the patient cohort in the training sets and test sets and the prevalence of the predicted complications. Note that n represents the total number of patients while % is the proportion of patients within the respective dataset.
| Training set A | Test set A | Training set B | Test set B | |
|---|---|---|---|---|
| Encounters, n | 1829 | 587 | 711 | 225 |
| Age, mean (IQR) | 41.7 (17.0) | 45.5 (18.0) | 39.3 (17.0) | 42.7 (20.0) |
| Male, n (%) | 1582 (86.5) | 522 (88.9) | 622 (87.5) | 191 (84.8) |
| Arab, n (%) | 295 (16.1) | 89 (15.2) | 120 (16.9) | 43 (19.1) |
| Non-Arab, n (%) | 1534 (83.9) | 498 (84.8) | 591 (83.1) | 182 (80.9) |
| Mortality, n (%) | 36 (2.0) | 22 (3.7) | 9 (1.3) | 3 (1.3) |
| SBI, n (%) | 92 (5.0) | 45 (7.7) | 23 (3.2) | 17 (7.6) |
| Developed within 24 h from admission, n (%) | 1 (0.1) | 3 (0.5) | 1 (0.1) | 1 (0.4) |
| Developed after 24 h from admission, n (%) | 91 (5.0) | 42 (7.2) | 22 (3.1) | 16 (7.1) |
| AKI, n (%) | 126 (6.9) | 52 (8.9) | 32 (4.5) | 16 (7.1) |
| Developed within 24 h from admission, n (%) | 28 (1.5) | 9 (1.5) | 14 (2.0) | 3 (1.3) |
| Developed after 24 h from admission, n (%) | 98 (5.4) | 43 (7.3) | 18 (2.5) | 13 (5.8) |
| ARDS, n (%) | 117 (6.4) | 57 (9.7) | 45 (6.3) | 24 (10.7) |
| Developed within 24 h from admission, n (%) | 61 (3.3) | 26 (4.4) | 23 (3.2) | 13 (5.8) |
| Developed after 24 h from admission, n (%) | 56 (3.1) | 31 (5.3) | 22 (3.1) | 11 (4.9) |
Characteristics of the variables that were used as input features to our models. The mean and interquartile ranges are shown for the demographic features, and vital-sign measurements. For the comorbidities and symptoms admission, n denotes the number of patients and % denotes the percentage of patients per the respective dataset.
| Variable, unit | Training set A | Test set A | Training set B | Test set B |
|---|---|---|---|---|
| Age | 41.7 (17.0) | 45.5 (18.0) | 39.3 (17.0) | 42.7 (20.0) |
| BMI | 26.9 (5.2) | 26.7 (5.7) | 26.5 (5.7) | 27.9 (6.2) |
| Male, n (%) | 1582 (86.5) | 522 (88.9) | 622 (87.5) | 191 (84.8) |
| Hypertension | 550 (30.1) | 213 (36.3) | 168 (23.6) | 71 (31.6) |
| Diabetes | 427 (23.3) | 221 (37.6) | 121 (17.0) | 73 (32.4) |
| Chronic kidney disease | 68 (3.7) | 30 (5.1) | 20 (2.8) | 7 (3.1) |
| Cancer | 30 (1.6) | 7 (1.2) | 12 (1.7) | 8 (3.6) |
| Cough | 851 (46.5) | 338 (57.6) | 259 (36.4) | 99 (44.0) |
| Fever | 28 (1.5) | 20 (3.4) | 3 (0.4) | 3 (1.3) |
| Shortness of breath | 190 (10.4) | 99 (16.9) | 71 (10.0) | 34 (15.1) |
| Sore throat | 238 (13.0) | 89 (15.2) | 118 (16.6) | 28 (12.4) |
| Rash | 29 (1.6) | 10 (1.7) | 15 (2.1) | 5 (2.2) |
| Systolic blood pressure, | 126.3 (15.0) | 126.8 (16.0) | 128.8 (15.5) | 128.2 (15.7) |
| Diastolic blood pressure, | 77.5 (9.8) | 76.9 (9.9) | 77.9 (10.3) | 77.5 (10.7) |
| Respiratory rate, | 18.9 (1.0) | 20.2 (2.5) | 18.1 (0.7) | 18.7 (0.8) |
| Peripheral pulse rate, | 82.6 (11.5) | 85.4 (11.6) | 81.7 (13.4) | 82.5 (12.5) |
| Oxygen saturation, | 98.4 (1.6) | 97.5 (2.1) | 98.5 (1.0) | 98.2 (1.4) |
| Temperature auxiliary, ° | 36.9 (0.4) | 37.0 (0.7) | 36.9 (0.4) | 37.1 (0.6) |
| Glasgow Coma Score | 14.8 (0.0) | 15.0 (0.0) | 14.8 (0.0) | 14.8 (0.0) |
Performance evaluation of the best performing models on test sets A & B, which were selected based on the average AUROC performance on the validation sets, as shown in Supplementary Section C. Model type indicates the type of the base learners within the final selected ensemble. All the metrics were computed using bootstrapping with 1,000 iterations [46].
| Complication | Result | Test Set A | Test Set B |
|---|---|---|---|
| SBI | Model Type | LR | LR |
| AUROC | 0.902 (0.862, 0.939) | 0.859 (0.762, 0.932) | |
| AUPRC | 0.436 (0.297, 0.609) | 0.387 (0.188, 0.623) | |
| Calibration Slope | 0.933 (0.321, 1.370) | 1.031 (−0.066, 1.550) | |
| Calibration Intercept | 0.031 (−0.111, 0.213) | 0.010 (−0.164, 0.273) | |
| AKI | Model Type | LR | LR |
| AUROC | 0.906 (0.856, 0.948) | 0.891 (0.804, 0.961) | |
| AUPRC | 0.436 (0.278, 0.631) | 0.387 (0.115, 0.679) | |
| Calibration Slope | 0.655 (0.043, 1.292) | 1.370 (−0.050, 2.232) | |
| Calibration Intercept | 0.059 (−0.136, 0.251) | −0.072 (−0.183, 0.154) | |
| ARDS | Model Type | LR | LGBM |
| AUROC | 0.854 (0.789, 0.909) | 0.827 (0.646, 0.969) | |
| AUPRC | 0.288 (0.172, 0.477) | 0.399 (0.150, 0.760) | |
| Calibration Slope | 0.598 (0.028, 1.149) | 0.742 (−0.029, 1.560) | |
| Calibration Intercept | 0.000 (−0.159, 0.164) | 0.050 (−0.166, 0.243) |
Fig. 3The (a) ROC curves, (b) PRC curves, and (c) calibration curves are shown for all model ensembles evaluated on test set A (top) and test set B (bottom). The color legend for all figures is shown on the right. The numerical values for the AUROC, AUPRC, calibration slopes and intercepts can be found in Table 5. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)
Fig. 4The four most important features are shown for each complication in (a) test set A and (b) test set B. Feature importance was computed using the average SHAP values of the six models per ensemble.
Fig. 5Timeline showing the development of complications with respect to number of days from admission (x-axis) for two sample patients. (a) For [ySBI, yAKI, yARDS], our system predictions (multiplied by a 100 to obtain percentages) were [64%, 73%, 51%]. (b) This patient did not develop and complications and our model predictions were [0.2%, 0.2%, 2%].