| Literature DB >> 34025047 |
Tiago Tiburcio da Silva1, Rodrigo Francisquini1, Maric C V Nascimento1.
Abstract
In 2020, Brazil was the leading country in COVID-19 cases in Latin America, and capital cities were the most severely affected by the outbreak. Climates vary in Brazil due to the territorial extension of the country, its relief, geography, and other factors. Since the most common COVID-19 symptoms are related to the respiratory system, many researchers have studied the correlation between the number of COVID-19 cases with meteorological variables like temperature, humidity, rainfall, etc. Also, due to its high transmission rate, some researchers have analyzed the impact of human mobility on the dynamics of COVID-19 transmission. There is a dearth of literature that considers these two variables when predicting the spread of COVID-19 cases. In this paper, we analyzed the correlation between the number of COVID-19 cases and human mobility, and meteorological data in Brazilian capitals. We found that the correlation between such variables depends on the regions where the cities are located. We employed the variables with a significant correlation with COVID-19 cases to predict the number of COVID-19 infections in all Brazilian capitals and proposed a prediction method combining the Ensemble Empirical Mode Decomposition (EEMD) method with the Autoregressive Integrated Moving Average Exogenous inputs (ARIMAX) method, which we called EEMD-ARIMAX. After analyzing the results poor predictions were further investigated using a signal processing-based anomaly detection method. Computational tests showed that EEMD-ARIMAX achieved a forecast 26.73% better than ARIMAX. Moreover, an improvement of 30.69% in the average root mean squared error (RMSE) was noticed when applying the EEMD-ARIMAX method to the data normalized after the anomaly detection.Entities:
Keywords: ARIMAX; COVID-19; EEMD; anomaly; human mobility data; meteorological data
Year: 2021 PMID: 34025047 PMCID: PMC8130621 DOI: 10.1016/j.eswa.2021.115190
Source DB: PubMed Journal: Expert Syst Appl ISSN: 0957-4174 Impact factor: 6.954
Fig. 1Decomposed IMFs and residual obtained by EEMD considering the number of COVID-19 cases in São Paulo.
Fig. 2Flowchart of the proposed EEMD-ARIMAX method.
Spearman correlation between the number of COVID-19 cases and the meteorological data.
Spearman correlation between the number of COVID-19 cases and the human mobility data.
Variables per Brazilian capital which showed some level of correlation with the number of COVID-19 cases and were considered in the proposed models.
| Region | City-Federative unit | Meteorological variables | Mobility variables |
|---|---|---|---|
| North | Belém-PA | - | - |
| Boa Vista-RR | - | RR, GP, TS, WO | |
| Macapá-AP | - | - | |
| Manaus-AM | - | - | |
| Palmas-TO | Rain, Max Temp, Min Temp, Hum | RR, GP, PA, TS, WO, RE | |
| Porto Velho-RO | Min Temp | - | |
| Rio Branco-AC | - | - | |
| Northeast | Aracaju-SE | Max Temp, Min Temp | - |
| Fortaleza-CE | Max Temp, Hum | RR, PA, TS, RE | |
| João Pessoa-PB | Max Temp, Min Temp | - | |
| Maceió-AL | Rain, Max Temp, Min Temp | RE | |
| Natal-RN | Max Temp | - | |
| Recife-PE | Max Temp, Hum | PA, RE | |
| Salvador-BA | Max Temp, Min Temp | - | |
| São Luis-MA | Rain, Max Temp | TS, RE | |
| Teresina-PI | Rain, Max Temp, Min Temp, Hum | RR, PA, TS, WO, RE | |
| Midwest | Brasilia-DF | Rain, Min Temp, Hum | GP |
| Campo Grande-MS | - | RR, GP, PA, TS, WO | |
| Cuiabá-MT | Max Temp, Min Temp | RR, GP, PA, TS, WO, RE | |
| Goiânia-GO | Rain, Hum | RR, GP, PA, TS, WO, RE | |
| Southeast | Belo Horizonte-MG | - | WO |
| Rio de Janeiro-RJ | Min Temp | - | |
| São Paulo-SP | Min Temp | - | |
| Vitória-ES | - | GP, WO | |
| South | Curitiba-PR | - | - |
| Florianópolis-SC | - | RR, GP, PA, TS, WO, RE | |
| Porto Alegre-RS | - | RR, GP, TS, WO, RE |
Results achieved by ARIMAX and EEMD-ARIMAX methods.
| Region | City-Federative unit | ARIMAX | EEMD-ARIMAX | ||||
|---|---|---|---|---|---|---|---|
| ME | RMSE | MAE | ME | RMSE | MAE | ||
| North | Belém-PA (1,0,1) | 3.213 | 139.121 | 100.688 | −5.891 | 89.189 | 61.905 |
| Boa Vista-RR (0,1,1) | 5.092 | 235.997 | 123.145 | −2.248 | 159.335 | 97.518 | |
| Macapá-AP (3,0,2) | 0.229 | 185.697 | 79.845 | −7.852 | 148.587 | 69.708 | |
| Manaus-AM (2,1,3) | 9.106 | 213.732 | 152.327 | −12.994 | 142.140 | 102.373 | |
| Palmas-TO (2,0,2) | -0.223 | 62.519 | 37.374 | −0.168 | 53.240 | 34.604 | |
| Porto Velho-RO (2,1,3) | 6.074 | 186.198 | 104.206 | 0.266 | 162.095 | 98.116 | |
| Rio Branco-AC (0,1,2) | 1.088 | 46.482 | 28.736 | −1.487 | 30.006 | 18.009 | |
| Northeast | Aracaju-SE (0,1,1) | 1.413 | 163.284 | 91.061 | −0.183 | 107.398 | 58.659 |
| Fortaleza-CE (1,0,1) | 7.275 | 255.173 | 139.918 | 0.023 | 150.085 | 98.026 | |
| João Pessoa-PB (3,0,2) | 6.512 | 104.106 | 73.007 | −0.629 | 61.758 | 42.498 | |
| Maceió-AL (2,1,2) | 0.646 | 89.629 | 54.596 | 0.010 | 61.337 | 41.793 | |
| Natal-RN (1,0,3) | 4.485 | 209.895 | 104.098 | −12.078 | 183.497 | 99.912 | |
| Recife-PE (3,0,2) | 3.028 | 144.336 | 82.521 | −6.969 | 114.020 | 69.529 | |
| Salvador-BA (0,1,3) | 3.456 | 329.488 | 197.839 | −2.474 | 214.520 | 134.791 | |
| São Luis-MA (2,0,3) | 2.091 | 54.378 | 32.419 | −2.704 | 32.342 | 19.566 | |
| Teresina-PI (2,0,3) | 0.703 | 78.607 | 59.522 | −4.655 | 56.288 | 43.008 | |
| Midwest | Brasilia-DF (0,1,4) | 5.992 | 272.017 | 173.067 | 3.756 | 176.154 | 111.602 |
| Campo Grande-MS (0,1,4) | 4.305 | 130.909 | 65.215 | −7.399 | 87.561 | 50.282 | |
| Cuiabá-MT (1,0,4) | 1.487 | 58.115 | 28.884 | 1.521 | 35.695 | 19.566 | |
| Goiânia-GO (4,1,1) | 7.194 | 262.247 | 165.356 | −11.852 | 209.587 | 150.245 | |
| Southeast | Belo Horizonte-MG (2,1,3) | 6.596 | 273.975 | 183.085 | −1.762 | 186.489 | 123.697 |
| Rio de Janeiro-RJ (2,1,3) | 12.532 | 444.872 | 294.399 | −15.907 | 318.717 | 227.619 | |
| São Paulo-SP (0,1,5) | 18.479 | 988.765 | 660.780 | 4.203 | 775.817 | 494.870 | |
| Vitória-ES (1,0,2) | 3.598 | 62.274 | 42.943 | 1.440 | 41.234 | 27.729 | |
| South | Curitiba-PR (5,1,0) | 0.843 | 127.775 | 76.223 | −9.406 | 103.601 | 61.407 |
| Florianópolis-SC (0,1,3) | 26.907 | 230.129 | 80.612 | 15.483 | 210.716 | 86.626 | |
| Porto Alegre-RS (0,1,1) | 25.704 | 373.925 | 140.315 | −23.187 | 282.510 | 140.701 | |
Fig. 3Model errors of Goiânia - GO and the significance threshold.
Fig. 4Observed variation versus accentuated variation in all 27 capitals of Brazil on August 17, 2020.
Fig. 5Accentuated daily variations in the number of COVID-19 cases in Goiânia - GO and the anomaly threshold.
Fig. 6Percentage of days which EEMD-ARIMAX significantly mispredicted and corresponded to an anomaly.
Fig. 7Observed Daily Cases versus Normalized Daily Cases.
Results achieved by ARIMAX and EEMD-ARIMAX using normalized data.
| Region | City-Federative unit | ARIMAX | EEMD-ARIMAX | ||||
|---|---|---|---|---|---|---|---|
| ME | RMSE | MAE | ME | RMSE | MAE | ||
| North | Belém-PA (4,1,1) | 3.496 | 88.611 | 64.383 | 0.823 | 55.300 | 40.659 |
| Boa Vista-RR (1,0,1) | 7.276 | 228.034 | 115.729 | −15.266 | 165.307 | 100.733 | |
| Macapá-AP (2,1,3) | 2.783 | 129.141 | 56.598 | −0.702 | 79.429 | 39.340 | |
| Manaus-AM (4,1,1) | 0.796 | 18.994 | 12.861 | 0.005 | 12.406 | 8.771 | |
| Palmas-TO (3,1,2) | 1.301 | 49.561 | 35.493 | 1.416 | 35.471 | 26.719 | |
| Porto Velho-RO (2,1,3) | −0.193 | 151.779 | 87.690 | −0.024 | 131.221 | 19.299 | |
| Rio Branco-AC (2,0,2) | 0.287 | 44.530 | 29.891 | −1.546 | 28.245 | 17.304 | |
| Northeast | Aracaju-SE (0,1,1) | 2.672 | 103.176 | 65.999 | −2.111 | 75.438 | 48.003 |
| Fortaleza-CE (1,0,1) | 7.836 | 179.518 | 103.165 | 0.758 | 122.469 | 76.730 | |
| João Pessoa-PB (0,1,4) | 2.929 | 83.694 | 57.921 | 0.827 | 45.635 | 32.355 | |
| Maceió-AL (0,1,5) | 1.645 | 65.674 | 43.626 | −1.429 | 42.589 | 29.490 | |
| Natal-RN (0,1,3) | 2.701 | 155.989 | 83.457 | −12.897 | 130.216 | 72.329 | |
| Recife-PE (0,1,5) | 2.483 | 105.509 | 63.195 | −4.679 | 80.054 | 50.157 | |
| Salvador-BA (4,1,1) | 3.053 | 195.621 | 121.726 | −7.470 | 155.119 | 102.551 | |
| São Luis-MA (0,1,4) | 1.747 | 42.258 | 31.661 | 1.173 | 25.417 | 18.514 | |
| Teresina-PI (3,1,2) | 2.676 | 59.647 | 42.921 | 2.135 | 47.579 | 35.588 | |
| Midwest | Brasilia-DF (3,1,2) | 4.041 | 135.460 | 87.941 | −1.909 | 91.309 | 57.297 |
| Campo Grande-MS (3,1,2) | 3.944 | 90.317 | 55.276 | −0.974 | 62.449 | 40.751 | |
| Cuiabá-MT (2,1,3) | 1.410 | 54.245 | 37.052 | −0.109 | 35.409 | 24.553 | |
| Goiânia-GO (3,1,2) | 3.249 | 136.126 | 88.814 | −11.139 | 100.058 | 73.351 | |
| Southeast | Belo Horizonte-MG (3,1,2) | 4.806 | 156.051 | 109.193 | 1.246 | 114.420 | 81.809 |
| Rio de Janeiro-RJ (0,1,5) | 8.852 | 292.669 | 192.758 | −22.889 | 230.824 | 159.742 | |
| São Paulo-SP (0,1,5) | 12.976 | 628.305 | 420.853 | −57.823 | 497.835 | 314.477 | |
| Vitória-ES (3,1,2) | 1.569 | 56.394 | 41.890 | 3.439 | 44.391 | 32.097 | |
| South | Curitiba-PR (2,1,3) | 2.342 | 99.194 | 61.001 | −7.094 | 80.935 | 51.362 |
| Florianópolis-SC (0,1,3) | 18.719 | 184.337 | 76.301 | −8.676 | 144.855 | 68.046 | |
| Porto Alegre-RS (0,1,1) | 25.068 | 325.657 | 129.257 | −2.233 | 272.542 | 150.659 | |
Fig. 8RMSE of original data versus RMSE of normalized data.
Part 1 of the list of symbols and notations used in this paper.
| Symbol | Description |
|---|---|
| time series | |
| number of ensembles in EEMD | |
| number of IMFs to be extracted from | |
| variable that specifies an ensemble in a given iteration of EEMD | |
| number of meteorological and human mobility variables | |
| meteorological and human mobility variables | |
| time series obtained from | |
| standard deviation of | |
| standard deviation of | |
| a relatively small number which relates | |
| upper envelope of | |
| lower envelope of | |
| time series which correspond to average between | |
| time series obtained by the operation | |
| residual values found by applying EEMD to | |
| estimated residual values | |
| time series | |
| Spearman correlation coefficient |
Part 2 of the list of symbols and notations used in this paper.
| Symbol | Description |
|---|---|
| number of autoregressive terms in ARIMAX | |
| number of nonseasonal differences needed for stationarity in ARIMAX | |
| number of lagged forecast errors in ARIMAX | |
| constant of the ARIMAX | |
| error terms of the ARIMAX, for | |
| number of days in the dataset | |
| observed number of COVID-19 cases on day | |
| predicted number of COVID-19 cases on day | |
| error in the prediction of the number of COVID-19 cases on day | |
| absolute value of the difference between 1 and | |
| set | |
| spectrum of | |
| eigenvalues of graph Laplacian | |
| time series obtained by applying the inverse Fourier transform in | |