| Literature DB >> 35789572 |
Arnabi Bej1, Ujjwal Maulik1, Anasua Sarkar1.
Abstract
Probabilistic Regression is a statistical technique and a crucial problem in the machine learning domain which employs a set of machine learning methods to forecast a continuous target variable based on the value of one or multiple predictor variables. COVID-19 is a virulent virus that has brought the whole world to a standstill. The potential of the virus to cause inter human transmission makes the world a dangerous place. This article predicts the upcoming circumstances of the Corona virus to subside its action. We have performed Conditional GAN regression to anticipate the subsequent COVID-19 cases of five countries. The GAN variant CGAN is used to design the model and predict the COVID-19 cases for 3 months ahead with least error for the dataset provided. Each country is examined individually, due to their variation in population size, tradition, medical management and preventive measures. The analysis is based on confirmed data, as provided by the World Health Organization. This paper investigates how conditional Generative Adversarial Networks (GANs) can be used to accurately exhibit intricate conditional distributions. GANs have got spectacular achievement in producing convoluted high-dimensional data, but work done on their use for regression problems is minimal. This paper exhibits how conditional GANs can be employed in probabilistic regression. It is shown that conditional GANs can be used to evaluate a wide range of various distributions and be competitive with existing probabilistic regression models.Entities:
Keywords: COVID-19; Conditional generative adversarial networks (CGAN); Deep learning; Forecasting; Time series
Year: 2022 PMID: 35789572 PMCID: PMC9244013 DOI: 10.1007/s42979-022-01225-7
Source DB: PubMed Journal: SN Comput Sci ISSN: 2661-8907
Fig. 1Transmission and mitigation of COVID-19
Fig. 2GAN architecture
Fig. 3CGAN architecture
Choosing data used for experiment
| Country | Confirmed cases | Death cases |
|---|---|---|
| India | 1.1.2020–06.04.2021 | 1.1.2020–06.04.2021 |
| Germany | 1.1.2020–06.04.2021 | 1.1.2020–06.04.2021 |
| USA | 1.1.2020–06.04.2021 | 1.1.2020–06.04.2021 |
| Italy | 1.1.2020–06.04.2021 | 1.1.2020–06.04.2021 |
| Spain | 1.1.2020–06.04.2021 | 1.1.2020–06.04.2021 |
Fig. 4Proposed methodology framework
Fig. 5Generator network structure
Fig. 6Discriminator network structure
Fig. 7India new cases
Fig. 8India new deaths
Fig. 9Italy new cases
Fig. 10Italy new deaths
Fig. 11Germany new cases
Fig. 12Germany new deaths
Fig. 13USA new cases
Fig. 14USA new deaths
Fig. 15Spain new cases
Fig. 16Spain new deaths
RMSE values
| Country | New cases | Cumulative cases | New deaths | Cumulative deaths |
|---|---|---|---|---|
| India | 557.4 | 448.49 | 163.35 | 503.44 |
| Germany | 665.65 | 231.3 | 412.6 | 242.06 |
| USA | 709.35 | 542.8 | 587.54 | 142.43 |
| Italy | 734.7 | 681.7 | 308.5 | 221.4 |
| Spain | 603.56 | 126.3 | 253.8 | 203.4 |
P values
| Country | New cases | Cumulative cases | New deaths | Cumulative deaths |
|---|---|---|---|---|
| India | 0.02 | 0.007 | 0.008e−2 | 2.15e−1 |
| Germany | 0.003 | 2.43e−1 | 0.0016 | 3.3e−2 |
| USA | 3.6e−2 | 2.6e−1 | 2.87e−1 | 3.05e−2 |
| Italy | 2.18e−2 | 1.28e−1 | 3.41e−1 | 2.11e−2 |
| Spain | 0.014 | 2.6e−2 | 3.2e−1 | 2.5e−2 |
Mean absolute percentage error
| Country | New cases | Cumulative cases | New deaths | Cumulative deaths |
|---|---|---|---|---|
| India | 2.38 | 2.49 | 1.26 | 0.66 |
| Germany | 0.54 | 2.40 | 0.84 | 3.42 |
| USA | 0.84 | 2.64 | 0.40 | 3.52 |
| Italy | 0.59 | 2.18 | 3.33 | 2.38 |
| Spain | 1.94 | 3.87 | 3.04 | 1.91 |
R2 score, the coefficient of determination
| Country | New cases | Cumulative cases | New deaths | Cumulative deaths |
|---|---|---|---|---|
| India | 0.87 | 0.77 | 0.89 | 0.84 |
| Germany | 0.768 | 0.8 | 0.9 | 0.96 |
| USA | 0.75 | 0.86 | 0.78 | 0.98 |
| Italy | 0.83 | 0.75 | 0.93 | 0.88 |
| Spain | 0.89 | 0.94 | 0.84 | 0.95 |
Explained variance score
| Country | New cases | Cumulative cases | New deaths | Cumulative deaths |
|---|---|---|---|---|
| India | 0.54 | 0.60 | 0.61 | 0.62 |
| Germany | 0.68 | 0.84 | 0.56 | 0.64 |
| USA | 0.51 | 0.90 | 0.76 | 0.50 |
| Italy | 0.73 | 0.51 | 0.633 | 0.60 |
| Spain | 0.63 | 0.85 | 0.71 | 0.85 |
Performance measures from existing regression algorithms for New Cases in India
| Regressor | Expected variance | MSE | RMSE | |
|---|---|---|---|---|
| Logistic regression | – | – | 41,690.592 | 204.182 |
| Lasso | 0.0 | − 0.255 | 0.124 | 0.352 |
| Ridge regression | 0.977 | 0.976 | 0.002 | 0.048 |
| ElasticNet | 0.193 | − 0.009 | 0.099 | 0.316 |
| Proposed | 0.950 | 0.943 | 0.006 | 0.075 |
True and depicted daily new cases comparison
| Country | Date | Reported data | Predicted data |
|---|---|---|---|
| India | 06.6.2021 | 100,636 | 100,321 |
| 07.6.2021 | 86,498 | 86,212 | |
| 08.6.2021 | 92,596 | 92,321 | |
| 09.6.2021 | 93,463 | 93,112 | |
| 10.6.2021 | 92,291 | 92,345 | |
| Germany | 06.6.2021 | 1964 | 1912 |
| 07.6.2021 | 1444 | 1421 | |
| 08.6.2021 | 2253 | 2289 | |
| 09.6.2021 | 3275 | 3264 | |
| 10.6.2021 | 2747 | 2732 | |
| USA | 06.6.2021 | 5395 | 5356 |
| 07.6.2021 | 15,496 | 15,400 | |
| 08.6.2021 | 13,013 | 13,000 | |
| 09.6.2021 | 18,647 | 18,670 | |
| 10.6.2021 | 14,545 | 14,500 | |
| Italy | 06.6.2021 | 2275 | 2289 |
| 07.6.2021 | 1270 | 1265 | |
| 08.6.2021 | 1894 | 1800 | |
| 09.6.2021 | 2198 | 2450 | |
| 10.6.2021 | 2078 | 2043 | |
| Spain | 06.6.2021 | 3 | 10 |
| 07.6.2021 | 9542 | 9456 | |
| 08.6.2021 | 3504 | 3605 | |
| 09.6.2021 | 4427 | 4489 | |
| 10.6.2021 | 14,004 | 14,390 |