| Literature DB >> 33907761 |
Yanli Zhang-James, Jonathan Hess, Asif Salekin, Dongliang Wang, Samuel Chen, Peter Winkelstein, Christopher P Morley, Stephen V Faraone.
Abstract
The global pandemic of coronavirus disease 2019 (COVID-19) has killed almost two million people worldwide and over 400 thousand in the United States (US). As the pandemic evolves, informed policy-making and strategic resource allocation relies on accurate forecasts. To predict the spread of the virus within US counties, we curated an array of county-level demographic and COVID-19-relevant health risk factors. In combination with the county-level case and death numbers curated by John Hopkins university, we developed a forecasting model using deep learning (DL). We implemented an autoencoder-based Seq2Seq model with gated recurrent units (GRUs) in the deep recurrent layers. We trained the model to predict future incident cases, deaths and the reproductive number, R . For most counties, it makes accurate predictions of new incident cases, deaths and R values, up to 30 days in the future. Our framework can also be used to predict other targets that are useful indices for policymaking, for example hospitalization or the occupancy of intensive care units. Our DL framework is publicly available on GitHub and can be adapted for other indices of the COVID-19 spread. We hope that our forecasts and model can help local governments in the continued fight against COVID-19.Entities:
Year: 2021 PMID: 33907761 PMCID: PMC8077584 DOI: 10.1101/2021.04.14.21255507
Source DB: PubMed Journal: medRxiv
Figure 1.Sequence-to-sequence learning framework1
Figure 2.Test set results of cases and deaths.
Figure 3.A. Comparison of ARE distribution of our model with nice additional models that reported to CDC, including the ensemble model. B. Median AREs for each team/model were plotted over four-week ahead forecasts.
*For model names and team information, see CDC’s COVID-19 forecast model descriptions: https://github.com/cdcepi/COVID-19-Forecasts/blob/master/COVID-19_Forecast_Model_Descriptions.md
Comparisons of county-level case forecast from different teams and models. Total numbers of counties that are present in all models are listed for each week’s forecast. RE, median relative errors. ARE, median absolute relative error.
| −0.02 | 0.36 | p<.0001 | −0.15 | 0.44 | p<.0001 | −0.21 | 0.50 | p<.0001 | −0.23 | 0.55 | p<.0001 | |
| 0.02 | 0.24 | p=.070 | −0.13 | 0.31 | p<.0001 | −0.19 | 0.42 | p=.006 | −0.15 | 0.49 | p=.387 | |
| 0.17 | 0.34 | p<.0001 | −0.02 | 0.35 | p=.829 | −0.11 | 0.44 | p=.195 | −0.08 | 0.50 | p=.871 | |
| 0.01 | 0.39 | p<.0001 | −0.06 | 0.55 | p<.0001 | −0.06 | 0.66 | p<.0001 | 0.06 | 0.76 | p<.0001 | |
| −0.38 | 0.40 | p<.0001 | −0.45 | 0.47 | p<.0001 | −0.46 | 0.50 | p<.0001 | −0.47 | 0.53 | p=.008 | |
| 0.06 | 0.30 | p<.0001 | 0.00 | 0.41 | p<.0001 | −0.05 | 0.48 | p=.066 | −0.08 | 0.55 | p<.0001 | |
| 0.05 | 0.22 | p<.0001 | −0.05 | 0.28 | p<.0001 | −0.04 | 0.38 | p<.0001 | 0.04 | 0.45 | p=.001 | |
| 0.06 | 0.25 | p=.317 | −0.11 | 0.32 | p=.003 | −0.18 | 0.44 | p=.417 | −0.15 | 0.51 | p=.552 | |
| −0.09 | 0.25 | NA | −0.21 | 0.35 | NA | −0.21 | 0.45 | NA | −0.11 | 0.50 | NA | |
| 0.07 | 0.31 | p<.0001 | −0.10 | 0.40 | p<.0001 | −0.14 | 0.50 | p<.0001 | −0.10 | 0.58 | p<.0001 | |
| −0.17 | 0.31 | p<.0001 | −0.27 | 0.41 | p<.0001 | −0.25 | 0.46 | p=.019 | −0.13 | 0.48 | p<.0001 | |
| −0.18 | 0.26 | p<.0001 | −0.24 | 0.36 | p=.282 | −0.21 | 0.42 | p=.319 | −0.05 | 0.48 | p<.0001 | |
| 0.00 | 0.29 | p<.0001 | −0.07 | 0.35 | p=.788 | −0.04 | 0.42 | p=.536 | 0.13 | 0.50 | p=.067 | |
| −0.19 | 0.36 | p<.0001 | −0.25 | 0.47 | p<.0001 | −0.28 | 0.56 | p<.0001 | −0.15 | 0.64 | p<.0001 | |
| −0.49 | 0.50 | p<.0001 | −0.50 | 0.51 | p<.0001 | −0.47 | 0.51 | p<.0001 | −0.37 | 0.50 | p=.013 | |
| −0.22 | 0.33 | p<.0001 | −0.26 | 0.41 | p<.0001 | −0.22 | 0.48 | p<.0001 | −0.09 | 0.52 | p=.914 | |
| −0.25 | 0.28 | p<.0001 | −0.30 | 0.37 | p=.053 | −0.23 | 0.40 | p=.005 | −0.04 | 0.45 | p<.0001 | |
| −0.16 | 0.27 | p<.0001 | −0.22 | 0.38 | p=.004 | −0.18 | 0.46 | p=.017 | −0.02 | 0.53 | p=.450 | |
| −0.12 | 0.20 | NA | −0.14 | 0.35 | NA | −0.11 | 0.43 | NA | 0.04 | 0.52 | NA | |
| 0.00 | 0.33 | p<.0001 | −0.06 | 0.40 | p<.0001 | −0.05 | 0.51 | p<.0001 | 0.10 | 0.59 | p<.0001 | |
Interclass correlation coefficients (ICC(2,1)) between the predicated and actual cases for all models.
| 0.97(F=67.87, p<.0001) | 0.92(F=25.60, p<.0001) | 0.82(F=10.24, p<.0001) | 0.71(F=5.95, p<.0001) | |
| 0.98(F=84.19, p<.0001) | 0.94(F=30.15, p<.0001) | 0.83(F=10.91, p<.0001) | 0.67(F=5.13, p<.0001) | |
| 0.93(F=29.80, p<.0001) | 0.80(F=9.25, p<.0001) | 0.60(F=4.01, p<.0001) | 0.42(F=2.47, p<.0001) | |
| 0.93(F=28.18, p<.0001) | 0.83(F=10.98, p<.0001) | 0.64(F=4.57, p<.0001) | 0.54(F=3.34, p<.0001) | |
| 0.95(F=43.42, p<.0001) | 0.90(F=20.69, p<.0001) | 0.80(F=9.14, p<.0001) | 0.65(F=4.75, p<.0001) | |
| 0.96(F=55.26, p<.0001) | 0.95(F=42.28, p<.0001) | 0.87(F=14.87, p<.0001) | 0.73(F=6.33, p<.0001) | |
| 0.98(F=83.62, p<.0001) | 0.96(F=45.66, p<.0001) | 0.86(F=13.58, p<.0001) | 0.76(F=7.27, p<.0001) | |
| 0.97(F=64.10, p<.0001) | 0.90(F=18.33, p<.0001) | 0.74(F=6.63, p<.0001) | 0.59(F=3.91, p<.0001) | |
| 0.97(F=65.34, p<.0001) | 0.87(F=15.22, p<.0001) | 0.71(F=5.84, p<.0001) | 0.56(F=3.58, p<.0001) | |
| 0.93(F=30.08, p<.0001) | 0.90(F=18.10, p<.0001) | 0.78(F=8.21, p<.0001) | 0.61(F=4.17, p<.0001) | |
| 0.92(F=24.08, p<.0001) | 0.73(F=6.60, p<.0001) | 0.55(F=3.49, p<.0001) | 0.55(F=3.43, p<.0001) | |
| 0.94(F=31.66, p<.0001) | 0.81(F=9.53, p<.0001) | 0.72(F=6.21, p<.0001) | 0.74(F=6.62, p<.0001) | |
| 0.92(F=25.39, p<.0001) | 0.76(F=7.22, p<.0001) | 0.58(F=3.77, p<.0001) | 0.56(F=3.52, p<.0001) | |
| 0.96(F=51.02, p<.0001) | 0.90(F=18.13, p<.0001) | 0.84(F=11.59, p<.0001) | 0.82(F=10.27, p<.0001) | |
| 0.87(F=14.63, p<.0001) | 0.77(F=7.69, p<.0001) | 0.67(F=5.11, p<.0001) | 0.72(F=6.18, p<.0001) | |
| 0.96(F=47.14, p<.0001) | 0.92(F=25.00, p<.0001) | 0.84(F=11.65, p<.0001) | 0.81(F=9.47, p<.0001) | |
| 0.93(F=30.31, p<.0001) | 0.79(F=8.77, p<.0001) | 0.66(F=4.85, p<.0001) | 0.69(F=5.51, p<.0001) | |
| 0.93(F=28.56, p<.0001) | 0.79(F=8.59, p<.0001) | 0.64(F=4.57, p<.0001) | 0.64(F=4.61, p<.0001) | |
| 0.96(F=53.56, p<.0001) | 0.79(F=8.78, p<.0001) | 0.58(F=3.74, p<.0001) | 0.50(F=3.02, p<.0001) | |
| 0.84(F=11.33, p<.0001) | 0.80(F=9.06, p<.0001) | 0.81(F=9.56, p<.0001) | 0.87(F=14.62, p<.0001) | |
The ICC(2,1) reported in this table are only for counties that are present in all team/models at each forecast, as those in Table 1.