| Literature DB >> 35529266 |
Sweeti Sah1, B Surendiran1, R Dhanalakshmi2, Sachi Nandan Mohanty3, Fayadh Alenezi4, Kemal Polat5.
Abstract
Due to the proliferation of COVID-19, the world is in a terrible condition and human life is at risk. The SARS-CoV-2 virus had a significant impact on public health, social issues, and financial issues. Thousands of individuals are infected on a regular basis in India, which is one of the populations most seriously impacted by the pandemic. Despite modern medical and technical technology, predicting the spread of the virus has been extremely difficult. Predictive models have been used by health systems such as hospitals, to get insight into the influence of COVID-19 on outbreaks and possible resources, by minimizing the dangers of transmission. As a result, the main focus of this research is on building a COVID-19 predictive analytic technique. In the Indian dataset, Prophet, ARIMA, and stacked LSTM-GRU models were employed to forecast the number of confirmed and active cases. State-of-the-art models such as the recurrent neural network (RNN), gated recurrent unit (GRU), long short-term memory (LSTM), linear regression, polynomial regression, autoregressive integrated moving average (ARIMA), and Prophet were used to compare the outcomes of the prediction. After predictive research, the stacked LSTM-GRU model forecast was found to be more consistent than existing models, with better prediction results. Although the stacked model necessitates a large dataset for training, it aids in creating a higher level of abstraction in the final results and the maximization of the model's memory size. The GRU, on the other hand, assists in vanishing gradient resolution. The study findings reveal that the proposed stacked LSTM and GRU model outperforms all other models in terms of R square and RMSE and that the coupled stacked LSTM and GRU model outperforms all other models in terms of R square and RMSE. This forecasting aids in determining the future transmission paths of the virus.Entities:
Mesh:
Year: 2022 PMID: 35529266 PMCID: PMC9070409 DOI: 10.1155/2022/1556025
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.809
Various other data sources for COVID-19.
| Data source | Type of data | Link |
|---|---|---|
| WHO | This dataset is provided by WHO (World Health Organization), providingup-to-date data across the world. This is the official website to get the COVID-19 case live updates and get information about outbreaks |
|
| Kaggle | Kaggle is the largest data science community to provide an open-source dataset globally. Every day, the count of new cases increases around the world. This collection contains data from India's states and also union territories on a daily basis |
|
| Johns Hopkins University | Johns Hopkins University's Center for Systems Science and Engineering team maintains the dashboard and dataset (CSSE). Since January 22, 2020, it has been posting data on confirmed cases and deaths for all nations. Every day, JHU updates its data many times. This information comes from governments and national and subnational agencies all across the world; a complete list of data sources for each country may be seen on the Johns Hopkins GitHub page. It also makes its data available to the general public there [ |
|
| C. R. Well's GitHub | Open-source, writable linked data repository, developed by Thomson Reuters and Refinitiv and is used as a central knowledge graph database [ |
|
| China CDC (CCDC) | This dataset shows the daily count of cases particularly in China consisting of confirmed, asymptomatic, and recoveries |
|
| DataHub | DataHub provides time series data on COVID-19 cases. The data is in .csv format and is updated on a daily basis. It comes from this upstream repository, which is managed by the fantastic team at Johns Hopkins University's Center for Systems Science and Engineering (CSSE), who have been collecting data from all around the world since the beginning |
|
| Conference of State Bank Supervisors | This provides a country-level map of COVID cases in the U.S., which is revised hourly |
|
| U.S. CDC | This provides the cases in U.S. Consumers, financial institutions, and fellow regulators will get timely information on actions taken to support communities during the COVID-19 epidemic from state regulators. This website will be updated with all public CSBS updates related to COVID-19 |
|
| Italy Ministry of Health | This provides the COVID-19 cases in Italy |
|
| U.S. National Institutes of Health | This provides overall COVID-19 cases in the U.S. |
|
| W. Zeng's website | This website provides the global COVID-19 cases country-wise |
|
| COVID-19 Radiography Database | This provides COVID-19, 45k scholarly articles, and its family. This publicly available dataset is made available to the world's researchers to use recent developments in natural language processing (NLP) and other A.I. approaches to bring about fresh intuition in support of the ongoing fight against this disease |
|
| A. G. Chung's GitHub—Actualmed Initiative | It includes an X-ray image dataset of the chest initiative of COVID-19 |
|
| Georgia State University's Panacea Lab | This chatter in Twitter provides the different linguistic datasets. The data obtained from the stream includes all languages, but English, Spanish, and French have the highest incidence |
|
| NCBI GeneBank | This provides the sequence of SARS-CoV-2 |
|
| The GISAID Initiative | This is the global proposal revealing all types of cold data |
|
| China National GeneBank | This provides the sequence database of COVID-19 |
|
| EMBL-EBI | This provides the sequence of COVID outbreak isolates and record |
|
Figure 1Per day statistics of India (considering all cured cases, death cases, confirmed cases, and new cases), where the x-axis is “date” and y-axis represents “number of cases.”
Figure 2Per day statistics of India with marker values showing no. of confirmed, cured, and death cases.
Figure 3Per day statistics for new cases in India.
Figure 4Model of the data analysis phase.
Figure 5Bar graph showing the percentage of confirmed/cured/death cases per state (India).
Figure 6Top five states of India (cases of COVID-19).
Implementation detail of models used.
| Dataset used | COVID-19 Indian dataset from Kaggle (up to 12th Dec 2020) |
|---|---|
| Models used | Prophet, ARIMA, stacked LSTM-GRU |
| Language | Python |
| System software | Co-Lab |
| Libraries used | Pandas, NumPy, Sklearn, Matplotlib, FbProphet, ARIMA, LSTM, Sequential, Dense, Math, DateTime |
Figure 7Working model of Prophet.
Figure 8Prediction through Prophet for a count of confirmed cases (for the next 20 days), where the x-axis is “ds” and y-axis is “y” which is the target.
Figure 9Prediction through Prophet for the number of new cases (for the next 20 days), where x-axis is “ds” and y-axis is “y” which is target.
Figure 10Working model of ARIMA.
Figure 11Prediction through ARIMA for the confirmed cases (for the next 20 days), where the x-axis is “dates” and the y-axis is “total cases.”
Figure 12Prediction through ARIMA for the new cases (for the next 20 days), where the x-axis is “dates” and the y-axis is “total cases.”
Figure 13Model of stacked LSTM-GRU.
Figure 14Prediction through stacked LSTM-GRU neural network for new cases (for the next 20 days), where the x-axis is “days” and the y-axis is “number of cases.”
Figure 15Prediction curve for confirmed cases, where the x-axis shows “days” and y-axis shows the “number of cases.”
Statistical values for forecasted COVID-19 new confirmed cases in India.
| Date | Forecasted COVID-19 new confirmed cases |
|---|---|
| 11/12/2020 | 29,340 |
| 12/12/2020 | 26,819 |
| 13/12/2020 | 31,100 |
| 14/12/2020 | 27,546 |
| 15/12/2020 | 22,051 |
| 16/12/2020 | 26,111 |
| 17/12/2020 | 24,000 |
| 18/12/2020 | 22,700 |
| 19/12/2020 | 21,354 |
| 20/12/2020 | 29,456 |
| 21/12/2020 | 34,567 |
| 22/12/2020 | 23,965 |
| 23/12/2020 | 24,700 |
| 24/12/2020 | 23,765 |
| 25/12/2020 | 24,567 |
| 26/12/2020 | 27,912 |
| 27/12/2020 | 18,722 |
| 28/12/2020 | 16,400 |
| 29/12/2020 | 19,500 |
| 30/12/2020 | 21,800 |
| 31/12/2020 | 21,121 |
Performance metrics for various models showing comparison with the state-of-the-art models.
| Performance metrics | RNN | GRU | LSTM | Linear regression | Polynomial regression | ARIMA | Prophet | LSTM-GRU |
|---|---|---|---|---|---|---|---|---|
|
| 0.30 | 0.74 | 0.05 | 0.01 | 0.31 | 0.56 | 0.46 | 0.74 |
| RMSE | 120.35 | 94.558 | 134.505 | 284809.4 | 149117.8 | 1260 | 568.58 | 69.92 |
Evaluation metrics.
|
|
| Where |
| RMSE |
| Where |