| Literature DB >> 33281306 |
Abstract
The Covid-19 pandemic is the most important health disaster that has surrounded the world for the past eight months. There is no clear date yet on when it will end. By now, more than 31 million people have been infected worldwide. Predicting the Covid-19 trend has become a challenging issue. In this study, data of COVID-19 between 20/01/2020 and 18/09/2020 for USA, Germany and Global was obtained from World Health Organization. Dataset consist of weekly confirmed cases and weekly cumulative confirmed cases for 35 weeks. Then the distribution of the data was examined using the most up-to-date Covid-19 weekly case data and its parameters were obtained according to the statistical distributions. Furthermore, time series prediction model using machine learning was proposed to obtain the curve of disease and forecast the epidemic tendency. Linear regression, multi-layer perceptron, random forest and support vector machines (SVM) machine learning methods were used. The performances of the methods were compared according to the RMSE, APE, MAPE metrics and it was seen that SVM achieved the best trend. According to estimates, the global pandemic will peak at the end of January 2021 and estimated approximately 80 million people will be cumulatively infected.Entities:
Keywords: Covid-19; Machine Learning; Multi-layer perceptron; Statistical distribution; Support vector machine
Year: 2020 PMID: 33281306 PMCID: PMC7698672 DOI: 10.1016/j.chaos.2020.110512
Source DB: PubMed Journal: Chaos Solitons Fractals ISSN: 0960-0779 Impact factor: 5.944
Descriptive statistics of weekly confirmed cases
| Mean | Std.Dev. | Minimum | Maximum | Skewness | Kurtosis | |
|---|---|---|---|---|---|---|
| 881.504 | 714.238 | 1.928 | 2.177.544 | 0,30 | -1,35 | |
| 29.274 | 21.516 | 0 | 66.963 | -0,01 | -1,05 | |
| 988 | 1.172 | 0 | 4.615 | 2,03 | 3,98 |
Fig. 1Histogram of weekly cases for global, Germany and USA
Goodness of fit test for weekly global cases
| Distribution | AD | P |
|---|---|---|
| Normal | 1,259 | <0,005 |
| Lognormal | 2,860 | <0,005 |
| Exponential | 2,594 | <0,003 |
| 2-Parameter Exponential | 1,665 | 0,012 |
| Weibull | 2,039 | <0,010 |
| 3-Parameter Weibull | 1,667 | <0,005 |
| Smallest Extreme Value | 1,564 | <0,010 |
| Largest Extreme Value | 1,153 | <0,010 |
| Gamma | 1,798 | <0,005 |
| Logistic | 1,232 | <0,005 |
Fig. 2Probability plot for weekly global cases
Estimates of distribution parameters for weekly global cases
| Distribution | Location | Shape | Scale | Threshold |
|---|---|---|---|---|
| 881.504 | - | 714.238 | - | |
| 12,80563 | - | 1,93553 | - | |
| - | - | 881.504 | - | |
| - | - | 905.445 | -23.941,9 | |
| - | 0,83125 | 820.779 | - | |
| - | 1,01121 | 910.043 | -24.993 | |
| 1.241.170 | - | 668.737 | - | |
| 542.319 | - | 580.244 | - | |
| - | 0,68580 | 1.285.360 | - | |
| 844.683 | - | 430.713 | - |
Fig. 3Probability plot for weekly cases in Germany
Fig. 4Probability plot for weekly cases in USA
Comparison of the methods.
| Methods | Metric | Global | Germany | USA |
|---|---|---|---|---|
| Random Forest | MAE | 269.274,0518 | 955,1736 | 42.496,719 |
| MAPE | 2,0726 | 0,4477 | 1,5608 | |
| RMSE | 340.926,4251 | 1.387,0147 | 53.864,6539 | |
| Linear Regression | MAE | 17.081,1363 | 224,0337 | 11.745,2812 |
| MAPE | 0,1853 | 0,1125 | 0,3331 | |
| RMSE | 21.816,6988 | 324,0253 | 16.508,4263 | |
| MLP | MAE | 139.330,4846 | 752,3291 | 19.819,1675 |
| MAPE | 0,8179 | 0,381 | 0,6497 | |
| RMSE | 223.638,9972 | 832.688,26 | 26.713,8094 | |
| SVM | MAE | 19.771,7317 | 191,0731 | 5.852,0147 |
| MAPE | 0,1247 | 0,0918 | 0,1406 | |
| RMSE | 25.825,8366 | 329,196 | 9.531,6776 |
Fig. 5Prediction of weekly cumulative global cases
Fig. 6Prediction of weekly cumulative cases for Germany
Fig. 7Prediction of weekly cumulative cases for USA