| Literature DB >> 34250465 |
Elham Jamshidi1, Amirhossein Asgary2, Nader Tavakoli3, Alireza Zali1, Farzaneh Dastan4, Amir Daaee5, Mohammadtaghi Badakhshan6, Hadi Esmaily4, Seyed Hamid Jamaldini7, Saeid Safari1, Ehsan Bastanhagh8, Ali Maher9, Amirhesam Babajani10, Maryam Mehrazi3, Mohammad Ali Sendani Kashi11, Masoud Jamshidi12, Mohammad Hassan Sendani13, Sahand Jamal Rahi14, Nahal Mansouri15,16,17.
Abstract
Background: Early prediction of symptoms and mortality risks for COVID-19 patients would improve healthcare outcomes, allow for the appropriate distribution of healthcare resources, reduce healthcare costs, aid in vaccine prioritization and self-isolation strategies, and thus reduce the prevalence of the disease. Such publicly accessible prediction models are lacking, however.Entities:
Keywords: COVID-19; artificial intelligence; machine learning; mortality; symptom
Year: 2021 PMID: 34250465 PMCID: PMC8262614 DOI: 10.3389/frai.2021.673527
Source DB: PubMed Journal: Front Artif Intell ISSN: 2624-8212
List of predictors. Predictor variables for mortality risk and symptom prediction of COVID-19.
| Category | Variable | Description |
|---|---|---|
| Demographic | Age | In years |
| Sex | Male or female | |
| Past/Current Medical Conditions |
| Current chemotherapy, radiotherapy, immunotherapy, bone marrow or stem cell transplantation |
| Liver disorders | Chronic hepatitis (type B or C), alcohol-related liver disease, primary biliary cirrhosis, primary sclerosing cholangitis, hemochromatosis, cirrhosis | |
| Blood disorders | Anemia (iron deficiency, thalassemia minor and major, sickle cell disease), coagulopathies (hemophilia and platelet disorders) | |
| Immune disorders | Immune deficiency (acquired immunodeficiency syndrome, treatment with steroids and immune suppressors), autoimmune disease (rheumatoid arthritis, systemic lupus erythematosus, ankylosing spondylitis, vasculitis). | |
| Cardiovascular disease | Congestive heart failure, cardiovascular events (myocardial infarction, stroke, angina), valvular heart disease, arrhythmia (e.g. atrial fibrillation) | |
| Kidney disorders | Chronic kidney disease (stage 3, 4, and end-stage renal disease) | |
| Respiratory disorders | Asthma, chronic obstructive pulmonary disease (emphysema and chronic bronchitis), extrinsic allergic alveolitis, cystic fibrosis, interstitial lung disease, sarcoidosis, bronchiectasis, pulmonary hypertension | |
| Neurological disorders | Epilepsy, Parkinson’s disease, motor neuron disease, cerebral palsy, dementia, multiple sclerosis | |
| Endocrine disorders | Hyperthyroidism, hypothyroidism, cushing syndrome, pheochromocytoma, thyroiditis, hyperaldosteronism | |
| Diabetes mellitus | Type 1 and type 2 diabetes, maturity onset diabetes of the young, insipidus, gestational diabetes | |
| Hypertension | Primary and secondary | |
| Psychiatric disorders (removed due to low prevalence) | Bipolar disorder, psychosis, schizophrenia, major depression disorder | |
| Thrombosis (removed due to low prevalence) | Venous thromboembolism, pulmonary thromboembolism |
Patient characteristics and symptoms. Baseline characteristics, symptoms, and death outcomes for COVID-19 patients.
| Continuous variables | |
|---|---|
| Variable | Median (±IQR) |
| Age | 52 (±29) |
| Categorical/Binary variables | |
| Variable | Count (percent) |
| Sex | |
| Male | 12,597 (53.04%) |
| Female | 11,152 (46.96%) |
| Cardiovascular disease | 2,471 (10.4%) |
| Diabetes | 2,068 (8.71%) |
| Hypertension | 2,004 (8.44%) |
| Respiratory diseases | 546 (2.3%) |
| Cancer | 477 (2.01%) |
| Kidney disorders | 416 (1.75%) |
| Neurological disorders | 264 (1.11%) |
| Immune disorders | 178 (0.75%) |
| Blood disorders | 152 (0.64%) |
| Current pregnancy | 139 (0.59%) |
| Liver disorders | 119 (0.5%) |
| Endocrine disorders | 97 (0.41%) |
| Organ or bone marrow transplant | 29 (0.12%) |
| Mental illnesses | 19 (0.08%) |
| Thrombosis | 15 (0.06%) |
| Past COVID-19 infection | 10 (0.04%) |
| Outcomes | |
| Survived | 21,309 (89.73%) |
| Dead | 2,440 (10.27%) |
| Symptoms | |
| Cough | 11,995 (50.51%) |
| Respiratory distress | 10,342 (43.55%) |
| Muscular pain or fatigue | 9,249 (38.94%) |
| Fever or chill | 8,553 (36.01%) |
| Gastrointestinal problems | 2,469 (10.4%) |
| Headache | 1,120 (4.72%) |
| Chest pain | 745 (3.14%) |
| Consciousness disorders | 698 (2.94%) |
| Loss of smell or taste | 659 (2.77%) |
| Vertigo | 501 (2.11%) |
| Sore throat | 157 (0.66%) |
| Paresis or paralysis | 121 (0.51%) |
Comparison between survived and deceased patient groups. Comparative evaluation of the characteristics of survived and deceased COVID-19 patients.
| Continuous variables | ||||
|---|---|---|---|---|
| Variable | Median in survivors (±IQR) | Median in deceased (±IQR) | F-test statistics | F-test |
| Age | 49 (±27) | 70 (±21) | 2,039.47 | <0.001 |
| Categorical/Binary variables | ||||
| Variable | Count in survivors (percent in survivors) | Count in deceased (percent in deceased) | Chi2 statistics | Chi2 |
| Sex | ||||
| Male | 11,163 (52.39%) | 1,434 (58.77%) | 16.82 | <0.001 |
| Female | 10,146 (47.61%) | 1,006 (41.23%) | 19 | <0.001 |
| Cardiovascular disease | 2,039 (9.57%) | 432 (17.7%) | 139.29 | <0.001 |
| Diabetes | 1,693 (7.94%) | 375 (15.37%) | 138.57 | <0.001 |
| Hypertension | 1,676 (7.87%) | 328 (13.44%) | 80.71 | <0.001 |
| Respiratory disorder | 462 (2.17%) | 84 (3.44%) | 15.47 | <0.001 |
| Cancer | 343 (1.61%) | 134 (5.49%) | 164.28 | <0.001 |
| Kidney disorders | 317 (1.49%) | 99 (4.06%) | 82.54 | <0.001 |
| Neurological disorders | 207 (0.97%) | 57 (2.34%) | 36.68 | <0.001 |
| Immune disorders | 152 (0.71%) | 26 (1.07%) | 3.62 | 0.057 |
| Blood disorders | 112 (0.53%) | 40 (1.64%) | 42.43 | <0.001 |
| Current pregnancy | 133 (0.62%) | 6 (0.25%) | 5.35 | 0.021 |
| Liver disorders | 101 (0.47%) | 18 (0.74%) | 3.04 | 0.081 |
| Endocrine disorders | 88 (0.41%) | 9 (0.37%) | 0.1 | 0.747 |
| Organ or bone marrow transplant | 25 (0.12%) | 4 (0.16%) | 0.39 | 0.533 |
| Psychiatric disorders | 16 (0.08%) | 3 (0.12%) | 0.63 | 0.428 |
| Thrombosis | 13 (0.06%) | 2 (0.08%) | 0.15 | 0.696 |
| Past COVID-19 infection | 10 (0.05%) | 0 (0.0%) | 1.15 | 0.285 |
Machine learning methods and hyperparameters used.
| The Mortality Prediction Model | ||
|---|---|---|
| Method | Parameter | Value(s) |
| Logistic Regression | C | 1.0 |
| Random Forest | Number of trees | 500 |
| Min. Number of samples at a leaf node | %0.1 of all samples | |
| Criterion | Gini | |
| Artificial Neural Networks | Number of layers | 3 |
| Output space dimensionality for each layer | 32, 16, 1 | |
| Activation function for each layer | Tanh, tanh, sigmoid | |
| K-Nearest Neighbors | K | 10 |
| Weight function | Distance | |
| Linear Discriminant Analysis | Solver | SVD |
| Naive Bayes | Interval size of age categories | 0.1 |
| The Symptom Prediction Model | ||
| Method | Parameter | Value |
| Logistic Regression | C | 1.0 |
| Random Forest | Number of trees | 200 |
| Min. Number of samples at a leaf node | %0.1 of all samples | |
| Criterion | Gini | |
| Artificial Neural Network | Number of layers | 4 |
| Output space’s dimensionality for each layer | 32, 32, 32, 12 | |
| Activation function for each layer | Tanh, tanh, tanh, tanh, sigmoid | |
| K-Nearest Neighbors | K | 5 |
| Weight function | Distance | |
| Linear Discriminant Analysis | Solver | SVD |
| Naive Bayes | Interval size of age categories | 0.1 |
FIGURE 1Prevalence-weighted means ROC-AUCs for different ML models. The models were used to implement the Symptom Prediction Model (SPM). Error bars denote the standard deviation over different cross-validation folds.
FIGURE 2ROC-AUCs of different ML models which were used to implement the MPM. The Random Forest (RF) model outperformed the other approaches.
FIGURE 3ROC chart for Prediction of RF models in different timelines. An indicator of model performance on validation dataset in the peak months of COVID-19 outbreak in Iran.