| Literature DB >> 33521226 |
Mohammad Pourhomayoun1, Mahdi Shakibi1.
Abstract
In the wake of COVID-19 disease, caused by the SARS-CoV-2 virus, we designed and developed a predictive model based on Artificial Intelligence (AI) and Machine Learning algorithms to determine the health risk and predict the mortality risk of patients with COVID-19. In this study, we used a dataset of more than 2,670,000 laboratory-confirmed COVID-19 patients from 146 countries around the world including 307,382 labeled samples. This study proposes an AI model to help hospitals and medical facilities decide who needs to get attention first, who has higher priority to be hospitalized, triage patients when the system is overwhelmed by overcrowding, and eliminate delays in providing the necessary care. The results demonstrate 89.98% overall accuracy in predicting the mortality rate. We used several machine learning algorithms including Support Vector Machine (SVM), Artificial Neural Networks, Random Forest, Decision Tree, Logistic Regression, and K-Nearest Neighbor (KNN) to predict the mortality rate in patients with COVID-19. In this study, the most alarming symptoms and features were also identified. Finally, we used a separate dataset of COVID-19 patients to evaluate our developed model accuracy, and used confusion matrix to make an in-depth analysis of our classifiers and calculate the sensitivity and specificity of our model.Entities:
Keywords: COVID-19; Coronavirus; Data analytics; Machine learning; Predictive analytics
Year: 2021 PMID: 33521226 PMCID: PMC7832156 DOI: 10.1016/j.smhl.2020.100178
Source DB: PubMed Journal: Smart Health (Amst) ISSN: 2352-6483
Fig. 1High-level system architecture.
Fig. 2Feature Selection: (a)Wrapper method, (b)Filter method.
The list of features used in the machine learning algorithm.
| Symptoms | o anorexia | o fever | o shortness of breath |
| o chest pain | o gasp | o somnolence | |
| o chills | o headache | o sore throat | |
| o conjunctivitis | o kidney failure | o sputum | |
| o cough | o lesions on chest radiographs | o septic shock | |
| o diarrhea | o hypertension | o Heart attack | |
| o dizziness | o Myalgia | o cold | |
| o dyspnea | o obnubilation | o cardiac disease | |
| o emesis | o pneumonia | o hypoxia | |
| o expectoration | o myelofibrosis | o fatigue | |
| o eye irritation | o respiratory distress | o rhinorrhea | |
| o diabetes | o COPD | o coronary heart disease | |
| o hypertensio | o Parkinson's disease | o prostate hypertrophy | |
| o chronic kidney disease | o asthma | o Tuberculosis | |
| o hypothyroidism | o cancer | o hepatitis B | |
| o cerebral infarction | o HIV positive | o chronic bronchitis | |
| o cardiac disease | o dyslipidemia | o any chronic disease | |
| o age | o country | o province | |
| o gender | o city | o travel history |
Fig. 3(a) Correlation heatmap for the most correlated features to the mortality risk: (a-I) Chronic diseases (pre-existing conditions); (a-II) Symptoms. (b) Correlation heatmap for the correlation between features: (b-I) Chronic diseases (pre-existing conditions); (b-II) Symptoms.
The accuracy of mortality prediction in patients with COVID-19 using 10-fold cross-validation.
Fig. 4ROC Curve comparison for all algorithms.
Fig. 5Neural Network confusion matrix for mortality prediction.
Fig. 6Sample results for predicting the risk of mortality (the probability of death).