| Literature DB >> 35281291 |
Himanshu Gupta1, Om Prakash Verma1.
Abstract
Background The COVID-19 pandemic has badly affected people of all ages globally. Therefore, its vaccine has been developed and made available for public use in unprecedented times. However, because of various levels of hesitancy, it did not have general acceptance. The main objective of this work is to identify the risk associated with the COVID-19 vaccines by developing a prognosis tool that will help in enhancing its acceptability and therefore, reducing the lethality of SARS-CoV-2.Entities:
Keywords: COVID-19; Machine Learning; Predictive Analysis; SARS-CoV-2; Statistical Analysis; VAERS
Year: 2022 PMID: 35281291 PMCID: PMC8904170 DOI: 10.1007/s12065-022-00704-3
Source DB: PubMed Journal: Evol Intell ISSN: 1864-5909
Description of cleaned VAERS dataset
| SN | Features | Description | Range | Mean | Standard Deviation |
|---|---|---|---|---|---|
| 1. | Age (A) | Age in Years | 0–119 | 49.66 | 18.31 |
| 2. | Sex (S) | Gender information (0: Females, 1: Male and 2: Unknown) | 0–2 | 0.31 | 0.51 |
| 3. | L_Threat (L) | Life-threatening illness | 0–1 | 0.02 | 0.13 |
| 4. | Hospital (H) | Hospitalized | 0–1 | 0.06 | 0.24 |
| 5. | Hospital Days (HD) | Number of days Hospitalized | 0–120 | 0.19 | 1.51 |
| 6. | Disable (Di) | Disability status | 0–1 | 0.01 | 0.12 |
| 7. | Num_Days (ND) | Number of days after vaccination | 0–456 | 4.46 | 14.15 |
| 8. | Other Medicine (OM) | Currently using any other medicine | 0–1 | 0.68 | 0.46 |
| 9. | Current Illness (C) | Illnesses at time of vaccination | 0–1 | 0.11 | 0.32 |
| 10. | Prior vaccination (P) | Any prior vaccination | 0–1 | 0.06 | 0.23 |
| 11. | Allergic History (Al) | Any known allergic history | 0–1 | 0.35 | 0.48 |
| 12. | Medical History (M) | Number of diseases | 0–16 | 0.51 | 0.99 |
| 13. | Emergency (E) | Visited emergency or not | 0–1 | 0.14 | 0.34 |
| 14. | Birth Defect (BD) | Any known birth defect | 0–1 | 0.00 | 0.02 |
| 15. | Died (D) | Died | 0–1 | 0.01 | 0.12 |
Fig. 1Impact of attributes on the COVID-19 outbreak
Fig. 2Correlation plot between different attributes of VAERS dataset
Fig. 3Layout of the proposed methodology for estimating serious complications of the SARS-CoV-2 vaccine
Developed MLP model summary
| Layer (type) | Output Shape | Number of Parameters | Activation Function |
|---|---|---|---|
| Dense | (None, 8) | 696 | relu |
| Dense | (None, 16) | 144 | selu |
| Dense | (None, 32) | 544 | selu |
| Dense | (None, 8) | 264 | relu |
| Dense | (None, 2) | 18 | relu |
| Dense | (None, 1) | 3 | sigmoid |
Fig. 4Developed MLP architecture
Hyperparameters of the developed MLP model
| SN | Hyperparameter | Description |
|---|---|---|
| Hidden Layers | 4 | |
| Hidden Layer Neurons | 16, 32, 8, 2 | |
| Learning rate | 0.01 | |
| Epochs | 1000 | |
| Activation Function | Selu, Relu, and Sigmoid |
Analysis of scenario 1
| Total = 354,451 | Died | Hospitalized | Covid positive |
|---|---|---|---|
| Died | 5,062 | 1,274 | 60 |
| Hospitalized | 1,274 | 21,926 | 209 |
| Covid positive | 60 | 209 | 3,486 |
Estimation of most significant attributes in scenario 1
| Outcome | Rank | Models | |||
|---|---|---|---|---|---|
| Chi-square | LR | RF | LGBM | ||
| Death | (1) | A | A | A | ND |
| (2) | S (M) | C | H | A | |
| (3) | HD | S (M) | S | HD | |
| (4) | C | COVID-19 | C | S (M) | |
| (5) | E | H | P | OM | |
| (6) | OM | E | HD | C | |
| (7) | Al | High Cholesterol | E | Al | |
| (8) | P | Anxiety | Al | E | |
| (9) | L | Diabetes | OM | Al | |
| (10) | Di | HD | ND | L | |
| Hospitalized | (1) | A | HD | HD | ND |
| (2) | S (M) | L | E | A | |
| (3) | HD | E | A | E | |
| (4) | E | A | Di | S (M) | |
| (5) | L | S (M) | L | L | |
| (6) | Di | Di | S (M) | C | |
| (7) | C | C | P | Al | |
| (8) | P | Kidney Disease | OM | Di | |
| (9) | ND | Heart Disease | ND | HD | |
| (10) | Al | Birth Defect | C | P | |
| COVID-19 positive | (1) | ND | Pain | ND | ND |
| (2) | Other Abnormalities | Heart Disease | Pain | A | |
| (3) | Pain | Other Abnormalities | Hyperlipidemia | Heart Disease | |
| (4) | Heart Disease | Cancer | Kidney Disease | Hyperlipidemia | |
| (5) | Dementia | Hyperlipidemia | Heart Disease | Kidney Disease | |
| (6) | Hyperlipidemia | Kidney Disease | Other Abnormalities | Cancer | |
| (7) | Kidney Disease | Diabetes | Diabetes | Pain | |
| (8) | Cancer | Atrial Fibrillation | Cancer | Diabetes | |
| (9) | Atrial Fibrillation | Dementia | Dementia | OM | |
| (10) | Diabetes | Thyroid | Anemia | Other Abnormalities | |
Outcome prediction in scenario 1
| SN | Parameter | Outcome | ML models | ||||
|---|---|---|---|---|---|---|---|
| LR | RF | NB | LGBM | MLP | |||
| 1 | Precision | Death | 0.41 | 0.06 | 0.15 | 0.51 | 0.02 |
| Hospitalized | 0.97 | 0.60 | 0.65 | 0.97 | 0.50 | ||
| COVID-19 | 0 | 0.01 | 0.13 | 0.08 | 0.01 | ||
| 2 | Recall | Death | 0.01 | 0.84 | 0.06 | 0.04 | 0.93 |
| Hospitalized | 0.71 | 0.81 | 0.30 | 0.72 | 0.85 | ||
| COVID-19 | 0 | 0.52 | 0.03 | 0.01 | 0.92 | ||
| 3 | Accuracy | Death | 0.98 | 0.80 | 0.98 | 0.99 | 0.18 |
| Hospitalized | 0.98 | 0.96 | 0.95 | 0.98 | 0.94 | ||
| COVID-19 | 0.99 | 0.71 | 0.99 | 0.99 | 0.09 | ||
| 4 | F1 score | Death | 0.03 | 0.11 | 0.09 | 0.08 | 0.03 |
| Hospitalized | 0.82 | 0.69 | 0.41 | 0.83 | 0.63 | ||
| COVID-19 | 0 | 0.03 | 0.04 | 0.01 | 0.02 | ||
Fig. 5Scenario 1: ROC and PR curve for a Death b Hospitalization c COVID-19 positive
Analysis of scenario 2
| Total = 354,451 | Died | Hospitalized | Covid positive |
|---|---|---|---|
| Died | 4,506 | 1,097 | 268 |
| Hospitalized | 1,097 | 21,926 | 1,443 |
| Covid positive | 268 | 1,443 | 5,953 |
Estimation of most influential attributes in scenario 2
| Outcome | Rank | Models | |||
|---|---|---|---|---|---|
| Chi-square | LR | RF | LGBM | ||
| Death | (1) | A | Birth Defect | A | ND |
| (2) | S (M) | Abortion Spontaneous | S (M) | A | |
| (3) | HD | Intensive Care | Headache | HD | |
| (4) | Cardiac Arrest | Loss of Consciousness | Pain | S (M) | |
| (5) | Intensive Care | L | Rash | Pain | |
| (6) | Pain | Hyperhidrosis | Dizziness | E | |
| (7) | C | Feeling Abnormal | Cardiac Arrest | OM | |
| (8) | Headache | Muscular Weakness | Chills | Al | |
| (9) | Chills | Chest pain | Injection Site Swelling | C | |
| (10) | E | Injection Site Pain | Injection Site Erythema | L | |
| Hospitalized | (1) | A | HD | E | A |
| (2) | COVID-19 | E | HD | L | |
| (3) | Intensive Care | L | L | S (M) | |
| (4) | Dyspnoea | Chest Pain | A | OM | |
| (5) | Injection Site Pain | COVID-19 | Thrombosis | E | |
| (6) | S (M) | Thrombosis | Dyspnoea | Dyspnoea | |
| (7) | Headache | Dyspnoea | Di | COVID-19 | |
| (8) | Chills | Intensive Care | S (M) | Thrombosis | |
| (9) | D | A | Injection Site Pain | Pain | |
| (10) | Condition Aggravated | S (M) | Injection Site Erythema | HD | |
| COVID-19 positive | (1) | A | Cough | Cough | ND |
| (2) | HD | Intensive Care | Rash | A | |
| (3) | Cough | Cardiac Arrest | OM | HD | |
| (4) | OM | Diarrhea | HD | C | |
| (5) | Pain | Dyspnoea | Dizziness | OM | |
| (6) | Intensive Care | Malaise | Injection Site Pain | Al | |
| (7) | Dyspnoea | OM | Injection Site Erythema | Cough | |
| (8) | Rash | C | Injection Site pruritus | Pain | |
| (9) | Dizziness | Al | Pain | S (M) | |
| (10) | Injection Site Pain | Increased Blood Pressure | Arthralgia | Pyrexia | |
Performance analysis of developed ML models in scenario 2
| SN | Parameter | Outcome | ML models | ||||
|---|---|---|---|---|---|---|---|
| LR | RF | NB | LGBM | MLP | |||
| 1 | Precision | Death | 0.84 | 0.25 | 0.73 | 0.81 | 0.61 |
| Hospitalized | 0.96 | 0.78 | 0.53 | 0.96 | 0.52 | ||
| COVID-19 | 0.66 | 0.18 | 0.50 | 0.70 | 0.19 | ||
| 2 | Recall | Death | 0.93 | 0.95 | 0.92 | 0.90 | 0.95 |
| Hospitalized | 0.73 | 0.77 | 0.48 | 0.75 | 0.88 | ||
| COVID-19 | 0.48 | 0.81 | 0.66 | 0.61 | 0.86 | ||
| 3 | Accuracy | Death | 0.99 | 0.96 | 0.99 | 0.99 | 0.99 |
| Hospitalized | 0.98 | 0.97 | 0.94 | 0.98 | 0.94 | ||
| COVID-19 | 0.99 | 0.94 | 0.98 | 0.99 | 0.94 | ||
| 4 | F1 score | Death | 0.93 | 0.39 | 0.81 | 0.85 | 0.74 |
| Hospitalized | 0.83 | 0.78 | 0.50 | 0.84 | 0.65 | ||
| COVID-19 | 0.55 | 0.30 | 0.57 | 0.65 | 0.31 | ||
Fig. 6Scenario 2: ROC and PR curve for a Death b Hospitalization c COVID-19 positive
Analysis of scenario 3
| Total = 354,451 | Died | Hospitalized | Covid positive |
|---|---|---|---|
| Died | 5.327 | 1,363 | 340 |
| Hospitalized | 1,363 | 21,926 | 1,443 |
| Covid positive | 340 | 1,443 | 5,953 |
Analysis of most dominant feature in scenario 3
| Outcome | Rank | Models | |||
|---|---|---|---|---|---|
| Chi-square | LR | RF | LGBM | ||
| Death | (1) | A | Cardiac Arrest | A | A |
| (2) | S (M) | A | H | S (M) | |
| (3) | HD | C | Pain | C | |
| (4) | Cardiac Arrest | S (M) | Headache | Pain | |
| (5) | Intensive Care | Abortion Spontaneous | S (M) | H | |
| (6) | Pain | Dyspnoea | Chills | ND | |
| (7) | C | Malaise | Rash | HD | |
| (8) | Headache | Intensive Care | HD | OM | |
| (9) | E | Chest Pain | Dizziness | E | |
| (10) | Chills | Feeling Abnormal | Injection Site Pain | Cardiac Arrest | |
| Hospitalized | (1) | A | HD | HD | HD |
| (2) | S (M) | E | E | L | |
| (3) | HD | L | L | S (M) | |
| (4) | Cardiac Arrest | Chest Pain | A | OM | |
| (5) | Intensive Care | Thrombosis | Thrombosis | Dyspnoea | |
| (6) | Headache | COVID-19 | Dyspnoea | E | |
| (7) | E | Dyspnoea | Injection Site pruritus | Al | |
| (8) | Chills | Intensive Care | Injection Site Pain | A | |
| (9) | COVID-19 | A | Di | Pain | |
| (10) | Dyspnoea | S (M) | S (M) | ND | |
| COVID-19 positive | (1) | A | Cough | Rash | Cough |
| (2) | HD | C | Cough | ND | |
| (3) | Cough | S (M) | OM | Pain | |
| (4) | OM | Dyspnoea | Dizziness | HD | |
| (5) | Pain | Malaise | Injection Site Pain | A | |
| (6) | Intensive Care | Diarrhea | Pain | OM | |
| (7) | Dyspnoea | Pyrexia | A | C | |
| (8) | Rash | A | Injection Site Erythema | S (M) | |
| (9) | Dizziness | Intensive Care | Injection Site Swelling | E | |
| (10) | Injection Site Pain | High Cholesterol | HD | Dizziness | |
Performance evaluation of developed ML models in scenario 3
| SN | Parameter | Outcome | ML models | ||||
|---|---|---|---|---|---|---|---|
| LR | RF | NB | LGBM | MLP | |||
| 1 | Precision | Death | 0.68 | 0.07 | 0.18 | 0.65 | 0.06 |
| Hospitalized | 0.96 | 0.76 | 0.54 | 0.96 | 0.49 | ||
| COVID-19 | 0.69 | 0.16 | 0.45 | 0.64 | 0.05 | ||
| 2 | Recall | Death | 0.11 | 0.86 | 0.32 | 0.17 | 0.82 |
| Hospitalized | 0.72 | 0.79 | 0.49 | 0.75 | 0.88 | ||
| COVID-19 | 0.25 | 0.88 | 0.59 | 0.49 | 0.74 | ||
| 3 | Accuracy | Death | 0.98 | 0.81 | 0.97 | 0.98 | 0.80 |
| Hospitalized | 0.98 | 0.97 | 0.94 | 0.98 | 0.94 | ||
| COVID-19 | 0.98 | 0.92 | 0.94 | 0.96 | 0.74 | ||
| 4 | F1 score | Death | 0.19 | 0.12 | 0.23 | 0.26 | 0.11 |
| Hospitalized | 0.83 | 0.78 | 0.51 | 0.84 | 0.63 | ||
| COVID-19 | 0.36 | 0.27 | 0.51 | 0.55 | 0.09 | ||
Fig. 7Scenario 3: ROC and PR curve for a Death b Hospitalization c COVID-19 positive