| Literature DB >> 33994895 |
Novanto Yudistira1, Sutiman Bambang Sumitro2, Alberth Nahas3,4, Nelly Florida Riama4.
Abstract
Determinant factors which contribute to the prediction should take into account multivariate analysis for capturing coarse-to-fine contextual information. From the preliminary descriptive analysis, it shows that environmental factor such as UV (ultraviolet) is one of the essential factors that should be considered to observe the COVID-19 epidemic drivers. Moreover, there are education, government, morphological, health, economic, and behavioral factors contributing to the growth of COVID-19. Besides descriptive analysis, in this research, multivariate analysis is considered to provide comprehensive explanations about factors contributing to pandemic dynamics. To achieve rich explanations, visual attribution of explainable Convolution-LSTM is utilized to see high contributing factors responsible for the growth of daily COVID-19 cases. Our model consists of 1 D CNN in the first layer to capture local relationships among variables followed by LSTM layers to capture local dependencies over time. It produces the lowest prediction errors compared to the other existing models. This permits us to employ gradient-based visual attribution for generating saliency maps for each time dimension and variable. These are then used for explaining which variables throughout which period of the interval is contributing for a given time-series prediction, likewise as explaining that during that time intervals were the joint contribution of most vital variables for that prediction. The explanations are useful for stakeholders to make decisions during and post pandemics. The explainable Convolution-LSTMcode is available here: https://github.com/cbasemaster/time-series-attribution.Entities:
Keywords: COVID-19; LSTM prediction; Multivariate; Visual explanation
Year: 2021 PMID: 33994895 PMCID: PMC8103767 DOI: 10.1016/j.asoc.2021.107469
Source DB: PubMed Journal: Appl Soft Comput ISSN: 1568-4946 Impact factor: 6.725
59 factors used in this research ranging from environmental, government, economical, to behavioral factors during 2020-03-22 to 2020-09-11.
| Number | Factor | Category | Value |
|---|---|---|---|
| 1 | retail_and_recreation_percent_change_from_baseline | Behavioral/Mobility | Percentage |
| 2 | grocery_and_pharmacy_percent_change_from_baseline | Behavioral/Mobility | Percentage |
| 3 | parks_percent_change_from_baseline | Behavioral/Mobility | Percentage |
| 4 | transit_stations_percent_change_from_baseline | Behavioral/Mobility | Percentage |
| 5 | workplaces_percent_change_from_baseline | Behavioral/Mobility | Percentage |
| 6 | residential_percent_change_from_baseline | Behavioral/Mobility | Percentage |
| 7 | total_cases | COVID-19 | Number of people |
| 8 | new_cases | COVID-19 | Number of people |
| 9 | new_cases_smoothed | COVID-19 | Number of people smoothed |
| 10 | total_deaths | COVID-19 | Number of people |
| 11 | new_deaths | COVID-19 | Number of people |
| 12 | new_deaths_smoothed | COVID-19 | Number of people smoothed |
| 13 | total_cases_per_million | COVID-19 | Number of people per million |
| 14 | new_cases_per_million | COVID-19 | Number of people per million |
| 15 | new_cases_smoothed_per_million | COVID-19 | Number of people smoothed per million |
| 16 | total_deaths_per_million | COVID-19 | Number of people per million |
| 17 | new_deaths_per_million | COVID-19 | Number of people per million |
| 18 | new_deaths_smoothed_per_million | COVID-19 | Number of people per thousand per million |
| 19 | new_tests | COVID-19 | Number of people |
| 20 | total_tests | COVID-19 | Number of people |
| 21 | total_tests_per_thousand | COVID-19 | Number of people per thousand |
| 22 | new_tests_per_thousand | COVID-19 | Number of people per thousand |
| 23 | new_tests_smoothed | COVID-19 | Number of people smoothed |
| 24 | new_tests_smoothed_per_thousand | COVID-19 | Number of people per thousand |
| 25 | tests_per_case | COVID-19 | Percentage |
| 26 | positive_rate | COVID-19 | Percentage |
| 27 | tests_units | COVID-19 | People tested or not |
| 28 | stringency_index | Government | 0–100 |
| 29 | Population | Demography | Number of people within country |
| 30 | population_density | Demography | Number of people within country per km 2 |
| 31 | median_age | Demography | Age |
| 32 | aged_65_older | Demography | Percentage |
| 33 | aged_70_older | Demography | Percentage |
| 34 | gdp_per_capita | Economic | Gross domestic product per capita (Purchasing Power Parity) |
| 35 | extreme_poverty | Economic | Percentage |
| 36 | cardiovascular_death_rate | Health | The annual number of deaths from cardiovascular diseases per 100000 people |
| 37 | diabetes_prevalence | Health | Percentage |
| 38 | female_smokers | Health | Percentage |
| 39 | male_smokers | Health | Percentage |
| 40 | handwashing_facilities | Health Facilities | Number of hand washing facilities |
| 41 | hospital_beds_per_thousand | Health Facilities | Number of hospital beds per thousand |
| 42 | life_expectancy | Health | Age |
| 43 | human_development_index | Education | 0–1 |
| 44 | UVIEF (cloud-free UV index) | Environmental | 0–17 |
| 45 | UVIEFerr (cloud-free erythemal UV index smoothed) | Environmental | kJ/m2 |
| 46 | UVDEF (cloud-free erythemal UV dose) | Environmental | kJ/m2 |
| 47 | UVDEFerr (cloud-free erythemal UV dose smoothed) | Environmental | kJ/m2 |
| 48 | UVDEC (cloud-modified erythemal UV dose) | Environmental | kJ/m2 |
| 49 | UVDECerr (cloud-modified Vitamin-D UV dose smoothed) | Environmental | kJ/m2 |
| 50 | UVDVF (cloud-free vitamin-D UV dose) | Environmental | kJ/m2 |
| 51 | UVDVFerr (cloud-free vitamin-D UV dose smoothed) | Environmental | kJ/m2 |
| 52 | UVDVC (cloud-modified vitamin-D UV dose) | Environmental | kJ/m2 |
| 53 | UVDVCerr (cloud-modified vitamin-D UV dose smoothed) | Environmental | kJ/m2 |
| 54 | UVDDF (cloud-free dna-damage) | Environmental | kJ/m2 |
| 55 | UVDDFerr (cloud-free dna-damage smoothed) | Environmental | kJ/m2 |
| 56 | UVDDC (cloud-modified dna-damage) | Environmental | kJ/m2 |
| 57 | UVDDCerr (cloud-modified dna-damage smoothed) | Environmental | kJ/m2 |
| 58 | CMF (average cloud modification factor) | Environmental | – |
| 59 | Ozone (local solar noon ozone column) | Environmental | DU (Dobson Unit) |
Fig. 1(a) The growth of cumulative confirmed cases in northern subtropical (blue), tropical (green), and southern subtropical (red) countries (b) The growth of cumulative recovered cases in northern subtropical (blue), tropical (green), and southern subtropical (red) countries (c) The growth of cumulative death cases in northern subtropical (blue), tropical (green), and southern subtropical (red) countries (d) Daily mean UV Index dynamics over time of northern subtropical (blue), tropical (green) and southern subtropical countries (red).
Fig. 2(a) Human mobility dynamics in Brazil (b) Weekly mean confirmed cases in Brazil (c) Human mobility dynamics in Malaysia (d) Weekly mean confirmed cases in Malaysia (e) Human mobility dynamics in Indonesia (f) Weekly mean confirmed cases in Indonesia.
Fig. 3Time lagged cross-correlation between weekly mean confirmed cases and human activities in Jakarta region.
Fig. 4Pearson correlation of weekly mean confirmed, recovered, death cases and weekly mean human activities of all countries with off set of −2.
Parameters of proposed Conv–LSTM.
| Layer | Value |
|---|---|
| Input layer | (length, features) |
| Convolution layer | 59 filters, size |
| LSTM layer 1 | (input, hidden unit) |
| LSTM layer 2 | (input, hidden unit) |
| FC layer | (input,output) |
| Activation function | Sigmoid |
| Output layer | (length, features) |
Fig. 5Convolution–LSTM architecture and its visual attribution via Gradcam.
Contribution of features to Conv–LSTM prediction in RMSE.
| Italy | Sweden | Indonesia | Norway | |
|---|---|---|---|---|
| Without environmental features | 0.001 | 0.017 | 0.026 | |
| Without mobility (behavioral) features | 0.002 | 0.009 | 0.028 | |
| All features | 0.009 | 0.008 |
Prediction accuracy (RMSE) of new cases of COVID-19 in Italy, Sweden, Indonesia, and Norway.
| Italy | Sweden | Indonesia | Norway | |
|---|---|---|---|---|
| 1D CNN 1 layer | 0.052 | 0.134 | 0.085 | 0.069 |
| LSTM 1 layer | 0.001 | 0.013 | 0.013 | 0.014 |
| Conv–LSTM |
Fig. 6Actual and reconstruction of Indonesia.
Fig. 7Actual and reconstruction of Norway.
Fig. 8Actual and reconstruction of Italy.
Fig. 9Actual and reconstruction of Sweden.
Fig. 10Time and feature attention corresponding to a prediction for new daily COVID-19 cases in Italy.
Fig. 11Temporal aggregation of attribution in Italy (x-axis is variables and -axis is aggregation score).
Fig. 12Time and feature attention corresponding to a prediction for new daily COVID-19 cases in Sweden.
Fig. 13Temporal aggregation of attribution in Sweden (x-axis is variables and -axis is aggregation score).
Fig. 14feature attention corresponding to a prediction for new daily COVID-19 cases in Norway.
Fig. 15Temporal aggregation of attribution in Italy (x-axis is variables and -axis is aggregation score).
Fig. 16Time and feature attention corresponding to a prediction for new daily COVID-19 cases in Indonesia.
Fig. 17Temporal aggregation of attribution in Indonesia (x-axis is variables and -axis is aggregation score).