| Literature DB >> 34703636 |
Julius R Migriño1,2, Ani Regina U Batangan1.
Abstract
OBJECTIVE: The aim of this study was to create a decision tree model with machine learning to predict the outcomes of COVID-19 cases from data publicly available in the Philippine Department of Health (DOH) COVID Data Drop.Entities:
Mesh:
Year: 2021 PMID: 34703636 PMCID: PMC8521127 DOI: 10.5365/wpsar.2021.12.3.831
Source DB: PubMed Journal: Western Pac Surveill Response J ISSN: 2094-7321
Demographic characteristics of resolved cases (recovered or died) from the Philippines COVID Data Drop from 25 August 2020
| - | Recovered | Died | CFR (%) | |
|---|---|---|---|---|
| Sex, | - | - | - | - |
| Male | 73 919 | 1863 | 2.46 | - |
| Female | 58 477 | 1175 | 1.97 | - |
| Age, | - | - | - | < 0.001 |
| Mean age (years) | 38.05 (± 15.93) | 61.33 (± 16.73) | - | - |
| Age group (years), | - | - | - | < 0.001 |
| 0–4 | 1830 | 32 | 1.72 | - |
| 5–17 | 5563 | 26 | 0.47 | - |
| 18–29 | 37 080 | 100 | 0.27 | - |
| 30–39 | 32 632 | 147 | 0.45 | - |
| 40–49 | 22 315 | 294 | 1.30 | - |
| 50–64 | 21 907 | 995 | 4.34 | - |
| 65–74 | 6148 | 839 | 12.01 | - |
| 75–84 | 2092 | 459 | 17.99 | - |
| 3 85 | 494 | 144 | 22.57 | - |
| Region, | - | - | - | < 0.001 |
| BARMM | 455 | 11 | 2.36 | - |
| CAR | 370 | 8 | 2.12 | - |
| CARAGA | 297 | 4 | 1.33 | - |
| NCR | 74 572 | 1430 | 1.88 | - |
| Repatriate | 6586 | 15 | 0.23 | - |
| Region I: Ilocos Region | 609 | 26 | 4.09 | - |
| Region II: Cagayan Valley | 483 | 3 | 0.62 | - |
| Region III: Central Luzon | 3850 | 81 | 2.06 | - |
| Region IV-A: CALABARZON | 17 201 | 253 | 1.45 | - |
| Region IV-B: MIMAROPA | 396 | 7 | 1.74 | - |
| Region V: Bicol Region | 773 | 22 | 2.77 | - |
| Region VI: Western Visayas | 1865 | 48 | 2.51 | - |
| Region VII: Central Visayas | 16 256 | 1006 | 5.83 | - |
| Region VIII: Eastern Visayas | 1302 | 8 | 0.61 | - |
| Region IX: Zamboanga Peninsula | 889 | 37 | 4.00 | - |
| Region X: Northern Mindanao | 766 | 16 | 2.05 | - |
| Region XI: Davao Region | 1480 | 56 | 3.65 | - |
| Region XII: SOCCSKSARGEN | 429 | 4 | 0.92 | - |
BARMM: Bangsamoro Autonomous Region in Muslim Mindanao; CAR: Cordillera Administrative Region; CARAGA: Caraga Administrative Region; NCR: National Capital Region; CALABARZON: Batangas, Cavite, Laguna, Quezon, Rizal and Lucena; MIMAROPA: Mindoro, Marinduque, Romblon and Palawan; SOCCSKSARGEN: South Cotabato, Cotabato, Sultan Kudarat, Sarangani and General Santos.
Resolved cases (recovered or died) from the Philippines COVID Data Drop from 25 August 2020 by age group and sex (n = 133 097)
| Age group (years) | Males ( | Females ( | CFR ratioa | ||||
|---|---|---|---|---|---|---|---|
| Recovered | Died | CFR (%) | Recovered | Died | CFR (%) | ||
| 0–4 | 978 | 18 | 1.81 | 852 | 14 | 1.62 | 1.12 |
| 5–17 | 2848 | 12 | 0.42 | 2715 | 14 | 0.51 | 0.82 |
| 18–29 | 19 967 | 61 | 0.30 | 17 113 | 39 | 0.23 | 1.30 |
| 30–39 | 18 995 | 97 | 0.51 | 13 637 | 50 | 0.37 | 1.38 |
| 40–49 | 13 397 | 191 | 1.41 | 8918 | 103 | 1.14 | 1.24 |
| 50–64 | 12 001 | 647 | 5.12 | 9906 | 348 | 3.39 | 1.51 |
| 65–74 | 3157 | 506 | 13.81 | 2991 | 333 | 10.02 | 1.38 |
| 75–84 | 1012 | 266 | 20.81 | 1080 | 193 | 15.16 | 1.37 |
| 3 85 | 178 | 64 | 26.45 | 316 | 80 | 20.20 | 1.31 |
a CFR ratio is computed as CFR males/CFR females
Performance metrics for the two machine learning models: decision tree and naïve Bayes using the modelling data set and optimized hyperparameters
| Model | AUC | Accuracy | F-score | Sensitivity | Specificity | ||||
|---|---|---|---|---|---|---|---|---|---|
| Decision tree | 0.876 | ± 0.010 | 81.42% | ± 1.01% | 16.74% | ± 0.55% | 81.65%a | ± 1.64% | 81.41% |
| Naïve Bayes | 0.881a | ± 0.006 | 81.68%a | ± 0.05% | 16.75%a | ± 0.33% | 80.63% | ± 1.17% | 81.71%a |
a Highest values for each metric across all models
Figure 1Receiver operating characteristic (ROC) curves for the two machine learning models: decision tree and naïve Bayesa
Figure 2Decision tree for predicted outcomes of resolved cases (recovered or died) from the Philippines COVID Data Drop from 25 August 2020a