| Literature DB >> 35147843 |
Sara Saadatmand1, Khodakaram Salimifard2, Reza Mohammadi3, Maryam Marzban4, Ahmad Naghibzadeh-Tahami5.
Abstract
Medical oxygen is a critical element in the treatment process of COVID-19 patients which its shortage impacts the treatment process adversely. This study aims to apply machine learning (ML) to predict the requirement for oxygen-based treatment for hospitalized COVID-19 patients. In the first phase, demographic information, symptoms, and patient's background were extracted from the databases of two local hospitals in Iran, and preprocessing actions were applied. In the second step, the related features were selected. Lastly, five ML models including logistic regression (LR), random forest (RF), XGBoost, C5.0, and neural networks (NNs) were implemented and compared based on their accuracy and capability. Among the variables related to the patient's background, consuming opium due to the high rate of opium users in Iran was considered in the models. Of the 398 patients included in the study, 112 (28.14%) received oxygen-based treatment. Shortness of breath (71.42%), fever (62.5%), and cough (59.82%) had the highest frequency in patients with oxygen requirements. The most important variables for prediction were shortness of breath, cough, age, and fever. For opioid-addicted patients, in addition to the high mortality rate (23.07%), the rate of oxygen-based treatment was twice as high as non-addicted patients. XGBoost and LR obtained the highest area under the curve with values of 88.7% and 88.3%, respectively. For accuracy, LR and NNs achieved the best and same accuracy (86.42%). This approach provides a tool that accurately predicts the need for oxygen in the treatment process of COVID-19 patients and helps hospital resource management.Entities:
Keywords: COVID-19; Machine learning; Opioid addiction; Oxygen treatment; Prediction; XGBoost
Mesh:
Substances:
Year: 2022 PMID: 35147843 PMCID: PMC8832434 DOI: 10.1007/s11517-022-02519-x
Source DB: PubMed Journal: Med Biol Eng Comput ISSN: 0140-0118 Impact factor: 3.079
Fig. 1Steps of building machine learning approach for predicting oxygen-requirement treatment in hospitalized COVID-19 patients
Fig. 2Flow chart of selecting COVID-19 patients for predicting the necessity of oxygen-based treatment considering opioid and non-opioid addicted cases
Parameters’ values of five applied ML algorithms
| Algorithm | Parameter | Value/setting |
|---|---|---|
| Logistic regression | Fitting method | Iteratively reweighted least squares |
| Neural networks | Hidden layer | 1 |
| Input layer | 1 | |
| Output layer | 1 | |
| Fitting method | Entropy | |
| Maximum number of iteration | 100 | |
| Maximum number of weight | 1000 | |
| C5.0 | Boosting iterations | 10 |
| Decomposition model | Rule-based | |
| Random forest | Number of trees to grow | 200 |
| Number of variables randomly sampled | 10 | |
| Minimum size of terminal nodes | 1 | |
| XGboost | Max depth | 1 |
| Number of rounds | 150 | |
| Minimum child weight | 1 | |
| Eta | 0.3 | |
| Subsample ratio of columns | 0.8 | |
| Subsample | 0.5 |
Statistics of oxygen and non-oxygen required patients based on model variables
| Variable | All hospitalized patients ( | Non-oxygen required patients ( | Oxygen required patients ( | |
|---|---|---|---|---|
| Demographics | Age < = 18 | 34 (8.54%) | 31 (10.83%) | 3 (2.67%) |
| 19–60 | 296 (74.37%) | 227 (79.37%) | 69 (61.60%) | |
| Age > = 60 | 68 (17.08%) | 28 (9.79%) | 40 (35.71%) | |
| Sex = male | 217 (54.52%) | 154 (53.84%) | 63 (56.25%) | |
| Sex = female | 181 (45.47%) | 131 (45.80%) | 50 (44.64) | |
| Patient background | Hypothyroidism or hyperthyroidism | 28 (7.03%) | 15 (5.24%) | 13 (11.60%) |
| Diabetes | 53 (13.31%) | 22 (7.69%) | 31 (27.67%) | |
| Blood pressure | 61 (15.32%) | 25 (8.74%) | 36 (32.14%) | |
| Obesity | 15 (3.76%) | 14 (4.89%) | 1 (0.89%) | |
| Lung disease | 49 (12.31%) | 17 (5.94%) | 32 (28.57%) | |
| Cardiovascular disease | 37 (9.29%) | 11 (3.84%) | 26 (23.21%) | |
| Acute renal impairment | 5 (1.25%) | 2 (0.69%) | 3 (2.67%) | |
| Cigarettes | 18 (4.52%) | 11 (3.84%) | 7 (6.25%) | |
| Opioid addiction | 26 (6.53%) | 11 (3.84%) | 15 (13.39%) | |
| Symptoms | Fever | 197 (49.49%) | 127 (44.40%) | 70 (62.5%) |
| Cough | 145 (36.43%) | 78 (27.27%) | 67 (59.82%) | |
| Sore throat | 72 (18.09%) | 66 (23.07%) | 6 (5.35%) | |
| Runny nose | 35 (8.79%) | 24 (8.39%) | 11 (9.82%) | |
| Chest pain | 81 (20.35%) | 47 (16.43%) | 34 (30.35%) | |
| Diarrhea | 65 (16.33%) | 53 (18.53%) | 12 (10.71%) | |
| Shortness of breath | 140 (35.17%) | 60 (20.97%) | 80 (71.42%) | |
| Nausea and vomiting | 81 (20.35%) | 75 (26.22%) | 6 (5.35%) |
Fig. 3(a) Percentage of opium-addicted and non-addicted hospitalized COVID-19 patients among females and males in two local hospitals in Iran. (b) Normalized percentage of opium-addicted and non-addicted hospitalized COVID-19 patients among female and male
Fig. 4(a) Fatality percentage for opium-addicted and non-addicted hospitalized COVID-19 patients in two local hospitals in Iran. (b) Normalized Fatality percentage for opium-addicted and non-addicted hospitalized COVID-19 patients
Fig. 5Comparison of accuracy and kappa of five applied ML algorithms in train set using box plot
Fig. 6Receiver operating characteristic (ROC) curves of five applied ML models to predict the oxygen-based treatment for COVID-19 patients. Area under the curve (AUC) and confidence interval are specified for each model
Fig. 7Confusion matrices of five ML models (a-e). Each plot represents the true positive (sensitivity) and true negatives (specificity) for predicting oxygen-based treatment for hospitalized COVID-19 patients
Performance measures of the five ML models in test set
| Model | Accuracy | Kappa | Sensitivity | Specificity | Balanced accuracy |
|---|---|---|---|---|---|
| Logistic regression | 0.8642 | 0.6787 | 0.9273 | 0.7308 | 0.8290 |
| Neural networks | 0.8642 | 0.6787 | 0.9273 | 0.7308 | 0.8290 |
| Random forest | 0.8148 | 0.5427 | 0.9273 | 0.5769 | 0.7521 |
| DecisionTree-C5.0 | 0.7901 | 0.4817 | 0.9091 | 0.5385 | 0.7238 |
| XGBoost | 0.8519 | 0.6531 | 0.9091 | 0.7308 | 0.8199 |