| Literature DB >> 33041534 |
Rahele Kafieh1, Narges Saeedizadeh1, Roya Arian1, Zahra Amini1, Nasim Dadashi Serej1, Atefeh Vaezi2, Shaghayegh Haghjooy Javanmard3.
Abstract
The coronavirus COVID-19 is affecting 213 countries and territories around the world. Iran was one of the first affected countries by this virus. Isfahan, as the third most populated province of Iran, experienced a noticeable epidemic. The prediction of epidemic size, peak value, and peak time can help policymakers in correct decisions. In this study, deep learning is selected as a powerful tool for forecasting this epidemic in Isfahan. A combination of effective Social Determinant of Health (SDH) and the occurrences of COVID-19 data are used as spatiotemporal input by using time-series information from different locations. Different models are utilized, and the best performance is found to be for a tailored type of long short-term memory (LSTM). This new method incorporates the mutual effect of all classes (confirmed/ death / recovered) in the prediction process. The future trajectory of the outbreak in Isfahan is forecasted with the proposed model. The paper demonstrates the positive effect of adding SDHs in pandemic prediction. Furthermore, the effectiveness of different SDHs is discussed, and the most effective terms are introduced. The method expresses high ability in both short- and long- term forecasting of the outbreak. The model proves that in predicting one class (like the number of confirmed cases), the effect of other accompanying numbers (like death and recovered cases) cannot be ignored. In conclusion, the superiorities of this model (particularity the long term predication ability) turn it into a reliable tool for helping the health decision-makers.Entities:
Keywords: COVID-19; Deep learning; Isfahan; Predication
Year: 2020 PMID: 33041534 PMCID: PMC7534756 DOI: 10.1016/j.chaos.2020.110339
Source DB: PubMed Journal: Chaos Solitons Fractals ISSN: 0960-0779 Impact factor: 5.944
Fig. 1Spread of confirmed cases in Isfahan.
Description of data types.
| Order | variable | Location | Source | Description |
|---|---|---|---|---|
| 1 | Occurrences of COVID-19 Data | Selected countries (Global data) | John Hopkins University | Daily number of Confirmed/ Death / Recovered people |
| Isfahan Province (Local data) | Isfahan University of Medical Sciences | |||
| 2 | Social Determinants of Health (SDH) | Selected countries (Global data) | Population, Yearly Change, Net Change, Density, Land Area (Km²), Migrants (net), Fertility, Age, Urban_percentage, World Share, Quarantine, Schools, Restrictions, Hospital Bed, sex0, sex14, sex25, sex54, sex65plus, Sex Ratio, GDP, Smoking, day_from_jan_first | |
| Isfahan Province (Local data) | Isfahan University of Medical Sciences | |||
| 3 | Occurrences of SARS Data | Selected countries (Global data) | Daily number of Confirmed/ Death / Recovered people |
Population (2020), Yearly Change (yearly population change), Net Change (net change of the population), Density (P/Km²), Land Area (Km²), Migrants (net migrants of the countries), Fertility (fertility or the growth rate), Age (median lifespan), Urban percentage (urban population), World Share (the population contributed to the world's share), The date for Quarantine, Schools, Restrictions, Hospital Bed (per 1000 people), sex0 (age from 0 to 14years), sex14 (age from 14 to 54 years), sex54 (age from 54 to 65 years), sex65plus (age upper 65 years), sex Ratio, GDP (Gross Domestic Product), Smoking (Percent of smoker population, 2016).
Fig. 2COVID-19 disease prediction models.
Fig. 3The overall structure of a neuron and a MLP model.
Fig. 4The overall structure of a 1D CNN.
The best selected architecture for each model.
| Model | Layers | Filters | Batch normalization/ dropout | Activation function |
|---|---|---|---|---|
| RF | — | n_estimators = 300 | — | — |
| XGBoost | — | n_estimators = 300 | — | — |
| LGBM | 1 LGBM Regressor | n_estimators=200 | — | — |
| MLP | 6 | 128, 128, 256, 256, 256, 1 | — | relu |
| 1D CNN | 4CNN +1 Flatten + 2 Dense | 32, 128, 128, 128, 50, 1 | — | relu |
| stacked LSTM | 2 LSTM+ 2 Dense | 64,32,32,1 | — | relu |
| Multiple Input Series | 2 LSTM+ 2 Dense | 64,32,32,1 | — | relu |
| Multiple Parallel Series | 2 LSTM+ 2 Dense | 64,32,32,3 | — | relu |
Results of different models for the confirmed group in Global Data.
| Model | Adding SDH featuresMAPE% | Only LagMAPE% |
|---|---|---|
| RF | 4.78 | 4.42 |
| XGBoost | 5 | 7.8 |
| LGBM | 6.49 | 9.68 |
| MLP | 1.19 | 1.7 |
| 1D CNN | 1.27 | 2.25 |
| LSTM | 1.36 | 2.41 |
| Multiple Input Series | 1.23 | 1.88 |
| Multiple Parallel Series | 1.12 | 1.76 |
Fig. 5Effectiveness of SDHs. (a) Heatmap by correlation method. (b) MAPE values by CAM method, (c) Heatmap by CAM method.
Fig. 6(a) Cumulative forecasting of confirmed, death, and recovered values for local data using the proposed method, (b) daily values of predicted confirmed cases, (c) long term prediction of the model.