| Literature DB >> 35948797 |
Zurki Ibrahim1, Pinar Tulay1, Jazuli Abdullahi2.
Abstract
Coronavirus disease 2019 (COVID-19) has produced a global pandemic, which has devastating effects on health, economy and social interactions. Despite the less contraction and spread of COVID-19 in Africa compared to some other continents in the world, Africa remains amongst the most vulnerable regions due to less technology and unequipped or poor health system. Recent happenings showed that COVID-19 may stay for years owing to the discoveries of new variants (such as Omicron) and new wave of infections in several countries. Therefore, accurate prediction of new cases is vital to make informed decisions and in evaluating the measures that should be implemented. Studies on COVID-19 prediction are limited in Africa despite the risks and dangers that the virus possessed. Hence, this study was performed to predict daily COVID-19 cases in 10 African countries spread across the north, south, east, west and central Africa considering countries with few and large number of daily COVID-19 cases. Machine learning (ML) models due to their nonlinearity and accurate prediction capabilities were employed for this purpose, including artificial neural network (ANN), adaptive neuro-fuzzy inference system (ANFIS), support vector machine (SVM) and conventional multiple linear regression (MLR) models. As any other natural process, the COVID-19 pandemic may contain both linear and nonlinear aspects. In such circumstances, neither nonlinear (ML) nor linear (MLR) models could be sufficient; hence, combining both ML and MLR models may produce better accuracy. Consequently, to improve the prediction efficiency of the ML models, novel ensemble approaches including ANN-E and SVM-E were employed. The advantage of using ensemble approaches is that they provide collective benefits of all the standalone models, thereby reducing their weaknesses and enhancing their prediction capabilities. The obtained results showed that ANFIS led to better prediction performance with MAD = 0.0106, MSE = 0.0003, RMSE = 0.0185 and R2 = 0.9059 in the validation step. The results of the proposed ensemble approaches demonstrated very high improvements in predicting the COVID-19 pandemic in Africa with MAD = 0.0073, MSE = 0.0002, RMSE = 0.0155 and R2 = 0.9616. The ANN-E improved the standalone models performance in the validation step up to 10%, 14%, 42%, 6%, 83%, 11%, 7%, 5%, 7% and 31% for Morocco, Sudan, Namibia, South Africa, Uganda, Rwanda, Nigeria, Senegal, Gabon and Cameroon, respectively. This study results offer a solid foundation in the application of ensemble approaches for predicting COVID-19 pandemic across all regions and countries in the world.Entities:
Keywords: Africa; COVID-19; Ensemble approaches; Machine learning; Modelling; Pandemic
Year: 2022 PMID: 35948797 PMCID: PMC9365685 DOI: 10.1007/s11356-022-22373-6
Source DB: PubMed Journal: Environ Sci Pollut Res Int ISSN: 0944-1344 Impact factor: 5.190
Fig. 1Location of the study countries in Africa
Statistical description of the daily-confirmed COVID-19 cases in some African countries
| Country | Period | Minimum | Maximum | Mean | St. deviation |
|---|---|---|---|---|---|
| Morocco | 1/3/2020–16/12/2021 | 0 | 12,039 | 1453 | 2095 |
| Sudan | 1/3/2020–16/12/2021 | 0 | 1215 | 69 | 101 |
| Uganda | 1/3/2020–16/12/2021 | 0 | 20,692 | 196 | 852 |
| Rwanda | 1/3/2020–16/12/2021 | 0 | 3141 | 154 | 325 |
| Nigeria | 1/3/2020–16/12/2021 | 0 | 3402 | 336 | 413 |
| Senegal | 1/3/2020–16/12/2021 | 0 | 1722 | 113 | 178 |
| Namibia | 1/3/2020–16/12/2021 | 0 | 3937 | 205 | 413 |
| South Africa | 1/3/2020–16/12/2021 | 0 | 37,875 | 4925 | 5677 |
| Gabon | 1/3/2020–16/12/2021 | 0 | 640 | 57 | 117 |
| Cameroon | 1/3/2020–16/12/2021 | 0 | 8681 | 164 | 689 |
Fig. 2Time series plots for the daily-confirmed cases for all countries
Fig. 3The k-fold validation used in the study
Cumulative cases, validation and data partitioning
| Country | Cumulative cases | No. of observation | Training sample (75%) | Validation sample (25%) | Validation type |
|---|---|---|---|---|---|
| Morocco | 951,763 | 656 | 492 | 164 | k-fold |
| Sudan | 45,112 | 656 | 492 | 164 | k-fold |
| Uganda | 128,369 | 656 | 492 | 164 | k-fold |
| Rwanda | 100,978 | 656 | 492 | 164 | k-fold |
| Nigeria | 220,109 | 656 | 492 | 164 | k-fold |
| Senegal | 74,105 | 656 | 492 | 164 | k-fold |
| Namibia | 134,160 | 656 | 492 | 164 | k-fold |
| South Africa | 3,231,039 | 656 | 492 | 164 | k-fold |
| Gabon | 37,681 | 656 | 492 | 164 | k-fold |
| Cameroon | 107,662 | 656 | 492 | 164 | k-fold |
Fig. 4COVID-19 studies carried out for world countries based on artificial intelligence
Fig. 5COVID-19 studies based on author keywords
Fig. 6The general ensemble procedure applied
Fig. 7The proposed ANN-E approach applied
Fig. 8The proposed SVM-E approach applied
Fig. 9The applied methodological approach of the study
Input variables selected for the study countries
| Country | Inputs | |
|---|---|---|
| North Africa | Morocco | |
| Sudan | ||
| South Africa | Namibia | |
| South Africa | ||
| East Africa | Uganda | |
| Rwanda | ||
| West Africa | Nigeria | |
| Senegal | ||
| Central Africa | Gabon | |
| Cameroon |
Results of the applied models for North Africa
| Training | Validation | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Country | Model | MAD | MSE | RMSE |
| MAD | MSE | RMSE |
|
| Morocco | ANN | 0.0699 | 0.0122 | 0.1106 | 0.8486 | 0.0336 | 0.0019 | 0.0436 | 0.8302 |
| ANFIS | 0.0321 | 0.0035 | 0.0591 | 0.9515 | 0.0204 | 0.0011 | 0.0326 | 0.9154 | |
| SVM | 0.0423 | 0.0059 | 0.0767 | 0.9347 | 0.0185 | 0.0008 | 0.0287 | 0.9185 | |
| MLR | 0.0604 | 0.0115 | 0.0107 | 0.9078 | 0.0208 | 0.0001 | 0.0341 | 0.8405 | |
| Sudan | ANN | 0.0369 | 0.0032 | 0.0564 | 0.6000 | 0.0353 | 0.0028 | 0.0564 | 0.4854 |
| ANFIS | 0.0306 | 0.0029 | 0.0536 | 0.8315 | 0.0213 | 0.0012 | 0.0345 | 0.5343 | |
| SVM | 0.0299 | 0.0041 | 0.0642 | 0.5929 | 0.0242 | 0.0029 | 0.0537 | 0.3330 | |
| MLR | 0.0374 | 0.0048 | 0.0691 | 0.3248 | 0.0443 | 0.0055 | 0.0740 | 0.1135 | |
Results of the applied models for East Africa
| Training | Validation | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Country | Model | MAD | MSE | RMSE |
| MAD | MSE | RMSE |
|
| Uganda | ANN | 0.0078 | 0.0002 | 0.0126 | 0.3552 | 0.0141 | 0.0060 | 0.0777 | -0.0025 |
| ANFIS | 0.0064 | 0.0001 | 0.0113 | 0.4807 | 0.0181 | 0.0056 | 0.0750 | 0.0650 | |
| SVM | 0.0036 | 0.0001 | 0.0090 | 0.6682 | 0.0080 | 0.0060 | 0.0774 | 0.0048 | |
| MLR | 0.0081 | 0.0002 | 0.0134 | 0.2712 | 0.0144 | 0.0060 | 0.0775 | 0.0015 | |
| Rwanda | ANN | 0.0319 | 0.0072 | 0.0851 | 0.9232 | 0.0112 | 0.0003 | 0.0182 | 0.7012 |
| ANFIS | 0.0233 | 0.0023 | 0.0478 | 0.9205 | 0.0106 | 0.0003 | 0.0185 | 0.9059 | |
| SVM | 0.0432 | 0.0110 | 0.1006 | 0.5824 | 0.0150 | 0.0020 | 0.0446 | 0.5366 | |
| MLR | 0.0485 | 0.0094 | 0.0970 | 0.6417 | 0.0166 | 0.0015 | 0.0392 | 0.6119 | |
Results of the applied models for West Africa
| Training | Validation | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Country | Model | MAD | MSE | RMSE |
| MAD | MSE | RMSE |
|
| Nigeria | ANN | 0.0374 | 0.0035 | 0.0592 | 0.7964 | 0.0295 | 0.0031 | 0.0560 | 0.7206 |
| ANFIS | 0.0330 | 0.0029 | 0.0537 | 0.8958 | 0.0247 | 0.0016 | 0.0400 | 0.7699 | |
| SVM | 0.0387 | 0.0071 | 0.0844 | 0.7947 | 0.0288 | 0.0032 | 0.0562 | 0.4316 | |
| MLR | 0.0514 | 0.0088 | 0.0936 | 0.7312 | 0.0351 | 0.0041 | 0.0643 | 0.3015 | |
| Senegal | ANN | 0.0284 | 0.0055 | 0.0737 | 0.8334 | 0.0195 | 0.0009 | 0.0308 | 0.6285 |
| ANFIS | 0.0189 | 0.0017 | 0.0408 | 0.9492 | 0.0187 | 0.0008 | 0.0287 | 0.6765 | |
| SVM | 0.0335 | 0.0063 | 0.0791 | 0.8089 | 0.0185 | 0.0009 | 0.0292 | 0.6563 | |
| MLR | 0.0363 | 0.0061 | 0.0783 | 0.8128 | 0.0187 | 0.0009 | 0.0292 | 0.6647 | |
Results of the applied models for South Africa
| Training | Validation | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Country | Model | MAD | MSE | RMSE |
| MAD | MSE | RMSE |
|
| Namibia | ANN | 0.0505 | 0.0076 | 0.0870 | 0.5786 | 0.0394 | 0.0044 | 0.0667 | 0.3827 |
| ANFIS | 0.0255 | 0.0024 | 0.0488 | 0.8884 | 0.0183 | 0.0012 | 0.0343 | 0.8059 | |
| SVM | 0.0406 | 0.0093 | 0.0965 | 0.5297 | 0.0255 | 0.0050 | 0.0705 | 0.2400 | |
| MLR | 0.0477 | 0.0091 | 0.0955 | 0.5467 | 0.0259 | 0.0048 | 0.0692 | 0.2556 | |
| South Africa | ANN | 0.0351 | 0.0050 | 0.0706 | 0.8924 | 0.0224 | 0.0018 | 0.0427 | 0.8553 |
| ANFIS | 0.0364 | 0.0040 | 0.0630 | 0.9355 | 0.0195 | 0.0011 | 0.0331 | 0.8846 | |
| SVM | 0.0375 | 0.0063 | 0.0796 | 0.9110 | 0.0209 | 0.0015 | 0.0388 | 0.8160 | |
| MLR | 0.0523 | 0.0096 | 0.0977 | 0.8689 | 0.0240 | 0.0022 | 0.0471 | 0.7225 | |
Results of the applied models for Central Africa
| Training | Validation | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Country | Model | MAD | MSE | RMSE |
| MAD | MSE | RMSE |
|
| Gabon | ANN | 0.0567 | 0.0106 | 0.1006 | 0.8575 | 0.0447 | 0.0079 | 0.0888 | 0.5866 |
| ANFIS | 0.0490 | 0.0077 | 0.0878 | 0.9007 | 0.0411 | 0.0055 | 0.0741 | 0.6983 | |
| SVM | 0.0490 | 0.0120 | 0.1097 | 0.8142 | 0.0441 | 0.0103 | 0.1014 | 0.5289 | |
| MLR | 0.0835 | 0.0209 | 0.1193 | 0.6228 | 0.0700 | 0.0142 | 0.1445 | 0.4429 | |
| Cameroon | ANN | 0.0037 | 0.0006 | 0.0249 | 0.9042 | 0.0085 | 0.0022 | 0.0465 | 0.6531 |
| ANFIS | 0.0097 | 0.0005 | 0.0220 | 0.9225 | 0.0080 | 0.0012 | 0.0341 | 0.8200 | |
| SVM | 0.0089 | 0.0011 | 0.0339 | 0.8230 | 0.0104 | 0.0013 | 0.0354 | 0.7987 | |
| MLR | 0.0091 | 0.0011 | 0.0338 | 0.8231 | 0.0114 | 0.0012 | 0.0349 | 0.8041 | |
Fig. 10Performance comparison of the individual models based on R2 for (a) ANN, (b) ANFIS, (c) SVM and (d) MLR
Fig. 11Observed versus predicted daily-confirmed COVID-19 cases in the validation step for (a) complete dataset. (b) Zoom view
Results of the applied ensemble models based on ANN-E
| Training | Validation | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Region | Country | MAD | MSE | RMSE | MAD | MSE | RMSE | ||
| North Africa | Morocco | 0.0289 | 0.0025 | 0.0500 | 0.9653 | 0.0206 | 0.0009 | 0.0300 | 0.9282 |
| Sudan | 0.0210 | 0.0012 | 0.0343 | 0.8338 | 0.0265 | 0.0023 | 0.0478 | 0.6299 | |
| South Africa | Namibia | 0.0277 | 0.0025 | 0.0497 | 0.8968 | 0.0204 | 0.0011 | 0.0330 | 0.7980 |
| South Africa | 0.0276 | 0.0027 | 0.0519 | 0.9366 | 0.0203 | 0.0011 | 0.0328 | 0.9219 | |
| East Africa | Uganda | 0.0032 | 0.0000 | 0.0050 | 0.9958 | 0.0031 | 0.0000 | 0.0064 | 0.8314 |
| Rwanda | 0.0265 | 0.0047 | 0.0686 | 0.9207 | 0.0111 | 0.0003 | 0.0185 | 0.8059 | |
| West Africa | Nigeria | 0.0248 | 0.0014 | 0.0368 | 0.9118 | 0.0315 | 0.0026 | 0.0510 | 0.7926 |
| Senegal | 0.0168 | 0.0016 | 0.0394 | 0.9526 | 0.0185 | 0.0008 | 0.0288 | 0.6756 | |
| Central Africa | Gabon | 0.0485 | 0.0088 | 0.0939 | 0.8954 | 0.0404 | 0.0058 | 0.0761 | 0.6550 |
| Cameroon | 0.0042 | 0.0002 | 0.0145 | 0.9674 | 0.0073 | 0.0002 | 0.0155 | 0.9616 | |
Results of the applied ensemble models based on SVM-E
| Training | Validation | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Region | Country | MAD | MSE | RMSE | MAD | MSE | RMSE | ||
| North Africa | Morocco | 0.0299 | 0.0028 | 0.0527 | 0.9615 | 0.0202 | 0.0009 | 0.0302 | 0.9276 |
| Sudan | 0.0251 | 0.0024 | 0.0488 | 0.8408 | 0.0208 | 0.0011 | 0.0336 | 0.6142 | |
| South Africa | Namibia | 0.0277 | 0.0025 | 0.0498 | 0.8978 | 0.0181 | 0.0011 | 0.0328 | 0.7980 |
| South Africa | 0.0289 | 0.0028 | 0.0531 | 0.9383 | 0.0206 | 0.0010 | 0.0323 | 0.9180 | |
| East Africa | Uganda | 0.0058 | 0.0002 | 0.0146 | 0.9645 | 0.0040 | 0.0001 | 0.0087 | 0.6943 |
| Rwanda | 0.0295 | 0.0020 | 0.0452 | 0.9157 | 0.0204 | 0.0008 | 0.0283 | 0.8139 | |
| West Africa | Nigeria | 0.0331 | 0.0027 | 0.0520 | 0.8930 | 0.0286 | 0.0016 | 0.0406 | 0.7845 |
| Senegal | 0.0205 | 0.0022 | 0.0466 | 0.9337 | 0.0193 | 0.0009 | 0.0293 | 0.6629 | |
| Central Africa | Gabon | 0.0506 | 0.0072 | 0.7175 | 0.8999 | 0.0429 | 0.0055 | 0.0849 | 0.7175 |
| Cameroon | 0.0083 | 0.0002 | 0.0133 | 0.9728 | 0.0097 | 0.0004 | 0.0189 | 0.9429 | |
Fig. 12Performance comparison of all models for (a) North Africa, (b) East Africa, (c) West Africa, (d) South Africa and (e) Central Africa