| Literature DB >> 34764569 |
Koushlendra Kumar Singh1, Suraj Kumar1, Prachi Dixit2, Manish Kumar Bajpai3.
Abstract
Corona Virus Disease 2019 (COVID19) has emerged as a global medical emergency in the contemporary time. The spread scenario of this pandemic has shown many variations. Keeping all this in mind, this article is written after various studies and analysis on the latest data on COVID19 spread, which also includes the demographic and environmental factors. After gathering data from various resources, all data is integrated and passed into different Machine Learning Models in order to check its appropriateness. Ensemble Learning Technique, Random Forest, gives a good evaluation score on the tested data. Through this technique, various important factors are recognized and their contribution to the spread is analyzed. Also, linear relationships between various features are plotted through the heat map of Pearson Correlation matrix. Finally, Kalman Filter is used to estimate future spread of SARS-Cov-2, which shows good results on the tested data. The inferences from the Random Forest feature importance and Pearson Correlation gives many similarities and few dissimilarities, and these techniques successfully identify the different contributing factors. The Kalman Filter gives a satisfying result for short term estimation, but not so good performance for long term forecasting. Overall, the analysis, plots, inferences and forecast are satisfying and can help a lot in fighting the spread of the virus. © Springer Science+Business Media, LLC, part of Springer Nature 2020.Entities:
Keywords: COVID19; Kalman filter; Pearson correlation; Random Forest
Year: 2020 PMID: 34764569 PMCID: PMC7676285 DOI: 10.1007/s10489-020-01948-1
Source DB: PubMed Journal: Appl Intell (Dordr) ISSN: 0924-669X Impact factor: 5.086
Fig. 2Heat Map of Pearson Coefficients for COVID-19 Spread
Temperature and Humidity data of 15 States of India
| S. No | State | Average min Temp | Average max Temp | Average Humidity |
|---|---|---|---|---|
| 01 | Andhra Pradesh | 27 | 31 | 73 |
| 02 | Delhi | 28 | 37 | 24 |
| 03 | Gujarat | 28 | 41 | 31 |
| 04 | Haryana | 20 | 33 | 32 |
| 05 | Jammu & Kashmir | 7 | 19 | 62 |
| 06 | Karnataka | 24 | 35 | 33 |
| 07 | Madhya Pradesh | 25 | 39 | 21 |
| 08 | Maharashtra | 30 | 34 | 57 |
| 09 | Punjab | 20 | 33 | 32 |
| 10 | Rajasthan | 24 | 35 | 29 |
| 11 | Tamil Nadu | 26 | 32 | 69 |
| 12 | Telengana | 29 | 40 | 27 |
| 13 | Uttar Pradesh | 27 | 37 | 24 |
| 14 | West Bengal | 14 | 37 | 66 |
| 15 | Kerala | 35 | 27 | 71 |
COVID-19 data of 15 states of India
| Date | State | Confirmed Cases | Cases 1 day ago | Growth in 1 day | Growth in 3 days | Growth in 5 days | Growth in 7 days | Growth rate for 1 day | Growth rate for 3 days | Growth rate for 5 days | Growth rate for 7 days |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 04–21 | Andhra Pradesh | 757 | 722 | 35 | 154 | 223 | 284 | 4.84 | 25.53 | 41.76 | 60.04 |
| 04–21 | Delhi | 2081 | 2003 | 78 | 374 | 503 | 571 | 3.89 | 21.90 | 31.87 | 37.81 |
| 04–21 | Gujarat | 2066 | 1851 | 215 | 794 | 1195 | 1449 | 11.61 | 62.42 | 137.19 | 234.84 |
| 04–21 | Haryana | 254 | 233 | 21 | 29 | 49 | 55 | 9.01 | 12.88 | 23.90 | 27.63 |
| 04–21 | Jammu & Kashmir | 368 | 350 | 18 | 40 | 68 | 98 | 5.14 | 12.19 | 22.66 | 36.29 |
| 04–21 | Karnataka | 415 | 395 | 20 | 44 | 100 | 157 | 5.06 | 11.85 | 31.74 | 60.85 |
| 04–21 | Kerala | 408 | 402 | 6 | 12 | 20 | 29 | 1.49 | 3.03 | 5.15 | 7.65 |
| 04–21 | Madhya Pradesh | 1540 | 1485 | 55 | 185 | 420 | 810 | 3.70 | 13.65 | 37.5 | 110.95 |
| 04–21 | Maharashtra | 4669 | 4203 | 466 | 1346 | 1750 | 2332 | 11.08 | 40.50 | 59.95 | 99.78 |
| 04–21 | Punjab | 245 | 219 | 26 | 43 | 59 | 69 | 11.87 | 21.28 | 31.72 | 39.20 |
| 04–21 | Rajasthan | 1576 | 1478 | 98 | 347 | 553 | 697 | 6.63 | 28.23 | 54.05 | 79.29 |
| 04–21 | Tamil Nadu | 1520 | 1477 | 43 | 197 | 278 | 347 | 2.91 | 14.89 | 22.384 | 29.58 |
| 04–21 | Telengana | 919 | 873 | 46 | 128 | 221 | 295 | 5.26 | 16.18 | 31.661 | 47.275 |
| 04–21 | Uttar Pradesh | 1294 | 1176 | 118 | 325 | 521 | 637 | 10.03 | 33.53 | 67.39 | 96.95 |
| 04–21 | West Bengal | 392 | 339 | 53 | 105 | 161 | 202 | 15.63 | 36.58 | 69.69 | 106.31 |
Fig. 1COVID-19 Spread Scenario of different Indian States
Fig. 3Scaled Importance of features in COVID-19 spread using Random Forest
Mean Average Error in Validation of prediction model state wise
| S. No | State | MAE(1 day) | MAE(7 day) | MAE(15 days) |
|---|---|---|---|---|
| 01 | Andhra Pradesh | 12.00 | 46.42 | 55.336 |
| 02 | Delhi | 7.00 | 42.42 | 1599.27 |
| 03 | Gujarat | 35.00 | 137.42 | 195.46 |
| 04 | Haryana | 24.00 | 151.00 | 68.27 |
| 05 | Jammu &Kashmir | 1.00 | 24.71 | 131.6 |
| 06 | Karnataka | 25.00 | 45.71 | 38.8 |
| 07 | Madhya Pradesh | 26.00 | 141.42 | 162.2 |
| 08 | Maharashtra | 128.00 | 337.14 | 3527.06 |
| 09 | Punjab | 133.00 | 814.14 | 440.80 |
| 10 | Rajasthan | 29.00 | 46.85 | 340.93 |
| 11 | Tamil Nadu | 372.00 | 1297.71 | 1214.20 |
| 12 | Telengana | 1.00 | 52.57 | 145.27 |
| 13 | Uttar Pradesh | 7.00 | 38.57 | 245.40 |
| 14 | West Bengal | 16.00 | 82.71 | 375.20 |
| 15 | Kerala | 2.00 | 35.57 | 38.80 |
Fig. 4Validation Results of Prediction model for different states
Fig. 5Validation Results of Prediction model for USA
The dataset of the COVID-19 used in the paper [34]
| Date (D/M/Y) | Confirmed | Date (D/M/Y) | Confirmed | Date (D/M/Y) | Confirmed |
|---|---|---|---|---|---|
| 21/1/2020 | 278 | 31/1/2020 | 9720 | 10/2/2020 | 40,554 |
| 22/1/2020 | 309 | 1/2/2020 | 11,821 | 11/2/2020 | 42,708 |
| 23/1/2020 | 571 | 2/2/2020 | 14,411 | 12/2/2020 | 44,730 |
| 24/1/2020 | 830 | 3/2/2020 | 17,283 | 13/2/2020 | 46,550 |
| 25/1/2020 | 1297 | 4/2/2020 | 20,471 | 14/2/2020 | 48,548 |
| 26/1/2020 | 1985 | 5/2/2020 | 24,363 | 15/2/2020 | 50,054 |
| 27/1/2020 | 2741 | 6/2/2020 | 28,060 | 16/2/2020 | 51,174 |
| 28/1/2020 | 4537 | 7/2/2020 | 31,211 | 17/2/2020 | 70,635 |
| 29/1/2020 | 5997 | 8/2/2020 | 34,598 | 18/2/2020 | 72,528 |
| 30/1/2020 | 7736 | 9/2/2020 | 37,251 |
The predicted and actual confirmed cases of China
| Date(D/M/Y) | Actual Confirmed | Predicted Confirmed |
|---|---|---|
| 11/2/2020 | 42,708 | 44,032 |
| 12/2/2020 | 44,730 | 45,461 |
| 13/2/2020 | 46,550 | 47,105 |
| 14/2/2020 | 48,548 | 48,632 |
| 15/2/2020 | 50,054 | 50,513 |
| 16/2/2020 | 51,174 | 51,128 |
| 17/2/2020 | 70,635 | 54,219 |
| 18/2/2020 | 72,528 | 59,703 |
Comparison of Result obtained from various models on the same dataset
| Method | Mean Average Error |
|---|---|
| ANN | 5413 |
| KNN | 7671 |
| SVR | 5354 |
| ANFIS | 5523 |
| PSO | 4559 |
| GA | 4963 |
| ABC | 6066 |
| FPA | 4379 |
| FPASSA | 4271 |
| Proposed |
Fig. 6.prediction results for next 30 days for different states of India