| Literature DB >> 34291025 |
Shi Chen1,2, Rajib Paul1,2, Daniel Janies3, Keith Murphy1, Tinghao Feng4, Jean-Claude Thill2,5.
Abstract
Background: Mathematical models are powerful tools to study COVID-19. However, one fundamental challenge in current modeling approaches is the lack of accurate and comprehensive data. Complex epidemiological systems such as COVID-19 are especially challenging to the commonly used mechanistic model when our understanding of this pandemic rapidly refreshes. Objective: We aim to develop a data-driven workflow to extract, process, and develop deep learning (DL) methods to model the COVID-19 epidemic. We provide an alternative modeling approach to complement the current mechanistic modeling paradigm. Method: We extensively searched, extracted, and annotated relevant datasets from over 60 official press releases in Hubei, China, in 2020. Multivariate long short-term memory (LSTM) models were developed with different architectures to track and predict multivariate COVID-19 time series for 1, 2, and 3 days ahead. As a comparison, univariate LSTMs were also developed to track new cases, total cases, and new deaths.Entities:
Keywords: COVID-19; deep learning; epidemic; modeling; multivariate
Mesh:
Year: 2021 PMID: 34291025 PMCID: PMC8287417 DOI: 10.3389/fpubh.2021.661615
Source DB: PubMed Journal: Front Public Health ISSN: 2296-2565
Figure 1Schematic data mining and multivariate deep learning (long short-term memory LSTM) workflow for COVID-19 modeling.
Figure 2Comparison of model loss (multivariate E1D1 vs. E2D2).
Multivariate and Univariate Model Performance Comparison Between Different LSTM Architectures based on Mean Absolute Error (MAE).
| New cases | 919 (6.19) | 470 (3.17) | 1,383 (9.32) | 687 (4.63) | 1,168 (7.87) | 1,039 (7.00) |
| Total cases | 842 (1.24) | 3,555 (5.22) | 5,905 (8.67) | 2,686 (3.94) | 3,914 (5.74) | 7,866 (11.54) |
| New deaths | 11 (4.55) | 8 (3.31) | 18 (7.44) | 0 (0.00) | 15 (6.20) | 20 (8.26) |
| New discharges | 594 (23.11) | 320 (12.45) | 184 (7.16) | 416 (16.19) | 302 (11.75) | 255 (9.92) |
| Hospital severe | 670 (7.21) | 215 (2.31) | 84 (0.90) | 669 (7.20) | 299 (3.22) | 144 (1.55) |
| Hospital critical | 130 (5.22) | 58 (2.33) | 55 (2.21) | 209 (8.39) | 78 (3.13) | 18 (0.72) |
| Total deaths | 1,579 (35.00) | 1,144 (25.35) | 898 (19.90) | 1,431 (31.72) | 1,084 (24.02) | 873 (19.35) |
| Total discharges | 689 (1.07) | 626 (0.97) | 27 (0.04) | 2,304 (3.57) | 1,526 (2.37) | 3,611 (5.60) |
| Total tracked | 17,487 (6.17) | 5,669 (2.00) | 20,359 (7.19) | 398 (0.14) | 17,917 (6.33) | 25,764 (9.10) |
| Total monitored | 2,213 (2.85) | 2,889 (3.72) | 2,486 (3.20) | 1,930 (2.48) | 4,426 (5.70) | 5,460 (7.03) |
| New cases univariate | 1,235 (8.32) | 63 (0.42) | 394 (2.65) | 877 (5.91) | 190 (1.28) | 60 (0.40) |
| Total cases univariate | 5,720 (8.40) | 705 (1.03) | 3,640 (5.34) | 2,814 (4.13) | 1,042 (1.32) | 1,920 (2.82) |
| New deaths univariate | 21 (8.68) | 2 (0.83) | 5 (2.07) | 14 (5.79) | 5 (2.07) | 4 (1.65) |
The first 10 variables were in the same multivariate LSTM model. The last three univariate LSTMs were different models based on their respective univariate inputs. Numbers in parentheses represented relative values as percentage.