| Literature DB >> 33316596 |
Abstract
COVID-19 pandemic had expanded to the US since early 2020 and has caused nationwide economic loss and public health crisis. Until now, although the US has the most confirmed cases in the world and are still experiencing an increasing pandemic, several states insisted to re-open business activities and colleges while announced strict control measures. To provide a quantitative reference for official strategies, predicting the near future trend based on finer spatial resolution data and presumed scenarios are urgently needed. In this study, the first attempted COVID-19 case predicting model based on county-level demographic, environmental, and mobility data was constructed with multiple machine learning techniques and a hybrid framework. Different scenarios were also applied to selected metropolitan counties including New York City, Cook County in Illinois, Los Angeles County in California, and Miami-Dade County in Florida to assess the impact from lockdown, Phase I, and Phase III re-opening. Our results showed that, for selected counties, the mobility decreased substantially after the lockdown but kept increasing with an apparent weekly pattern, and the weekly pattern of mobility and infections implied high infections during the weekend. Meanwhile, our model was successfully built up, and the scenario assessment results indicated that, compared with Phase I re-opening, a 1-week and a 2-week lockdown could reduce 4%-29% and 15%-55% infections, respectively, in the future week, while 2-week Phase III re-opening could increase 16%-80% infections. We concluded that the mandatory orders in metropolitan counties such lockdown should last longer than one week, the effect could be observed. The impact of lockdown or re-opening was also county-dependent and varied with the local pandemic. In future works, we expect to involve a longer period of data, consider more county-dependent factors, and employ more sophisticated techniques to decrease the modeling uncertainty and apply it to counties nationally and other countries.Entities:
Keywords: County-level; Forecasting; Lockdown; Pandemic; Re-opening
Mesh:
Year: 2020 PMID: 33316596 PMCID: PMC7837279 DOI: 10.1016/j.scitotenv.2020.144151
Source DB: PubMed Journal: Sci Total Environ ISSN: 0048-9697 Impact factor: 7.963
Selected variables, data sources, periods and types used in this study.
| Variable | Source | Data period | Data type |
|---|---|---|---|
| Cumulative cases | New York Times | 2020/01/01–2020/05/31 | Continuous |
| Daily increased cases | New York Times | 2020/01/01–2020/05/31 | Continuous |
| Population density | USDA ERA | 2019 | Continuous |
| Labor force rate | USDA ERA | 2019 | Continuous |
| Unemployment rate | USDA ERA | 2019 | Continuous |
| Household median income | USDA ERA | 2019 | Continuous |
| Metro system or not | USDA ERA | 2013 | Categorical |
| Maximum and minimum temperature | gridMET | 2020/01/01–2020/05/31 | Continuous |
| Maximum and minimum relative humidity | gridMET | 2020/01/01–2020/05/31 | Continuous |
| Precipitation | gridMET | 2020/01/01–2020/05/31 | Continuous |
| Surface downwelling solar radiation | gridMET | 2020/01/01–2020/05/31 | Continuous |
| Wind speed | gridMET | 2020/01/01–2020/05/31 | Continuous |
| Retail and recreation percent change from baseline | Google | 2020/02/15–2020/05/31 | Continuous |
| Grocery and pharmacy percent change from baseline | Google | 2020/02/15–2020/05/31 | Continuous |
| Parks percent change from baseline | Google | 2020/02/15–2020/05/31 | Continuous |
| Transit stations percent change from baseline | Google | 2020/02/15–2020/05/31 | Continuous |
| Workplaces percent change from baseline | Google | 2020/02/15–2020/05/31 | Continuous |
| Residential percent change from baseline | Google | 2020/02/15–2020/05/31 | Continuous |
| Mobility and Engagement Index (MEI) | Federal Reserve Bank of Dallas | 2020/01/03–2020/05/31 | Continuous |
| Statewide stay-at-home order or not | New York Times | 2020/01/01–2020/05/31 | Categorical |
| Weekday | – | 2020/01/01–2020/05/31 | Categorical |
| Weekend or not | – | 2020/01/01–2020/05/31 | Categorical |
New York Times, https://github.com/jeffcore/covid-19-usa-by-state/tree/d2fa4b2596889bac4687cdabb97a3967eb541392.
United States Department of Agriculture, Economic Research Service, https://www.ers.usda.gov/data-products/county-level-data-sets/.
gridMET (Abatzoglou, 2013), https://github.com/jbayham/gridMETr.
Google, COVID-19 Community Mobility, https://www.google.com/covid19/mobility/.
Federal Reserve Bank of Dallas, https://www.dallasfed.org/research/mei.
New York Times, https://www.nytimes.com/interactive/2020/us/states-reopen-map-coronavirus.html.
Fig. 1Predicting work flow based on ensemble deep learning technique.
Fig. 2Temporal variation of averaged daily incidence rate (per 100,000) and Mobility and Engagement Index (MEI) from February 15 to June 12 for selected metropolitan counties (n = 172).
Fig. 3Averages of community mobility change for selected counties (n = 172) and level-1 metropolitan counties during lockdown.
Modeling performance of basic learners and GLM-hybrid results.
| Index | Model | |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Training samples | Test samples | GLM hybrid | ||||||||||||||||
| EN | PCR | PLSR | KNN | RT | RF | GBM | ANN | EN | PCR | PLSR | KNN | RT | RF | GBM | ANN | |||
| R2 | N1D | 0.89 | 0.81 | 0.89 | 0.81 | 0.86 | 0.89 | 0.87 | 0.89 | 0.89 | 0.85 | 0.89 | 0.86 | 0.85 | 0.88 | 0.89 | 0.89 | 0.91 |
| N4D | 0.94 | 0.84 | 0.93 | 0.87 | 0.91 | 0.94 | 0.96 | 0.93 | 0.92 | 0.87 | 0.92 | 0.90 | 0.89 | 0.94 | 0.95 | 0.94 | 0.96 | |
| N7D | 0.93 | 0.84 | 0.93 | 0.87 | 0.94 | 0.95 | 0.97 | 0.96 | 0.91 | 0.86 | 0.91 | 0.89 | 0.89 | 0.93 | 0.95 | 0.94 | 0.97 | |
| RMSE | N1D | 97 | 125 | 98 | 122 | 116 | 107 | 101 | 96 | 110 | 128 | 111 | 122 | 129 | 122 | 108 | 110 | 105 |
| N4D | 80 | 109 | 79 | 99 | 90 | 83 | 60 | 70 | 88 | 113 | 90 | 101 | 108 | 90 | 72 | 78 | 66 | |
| N7D | 74 | 106 | 73 | 95 | 71 | 76 | 52 | 59 | 95 | 118 | 96 | 105 | 106 | 93 | 70 | 79 | 62 | |
| MAE | N1D | 38 | 60 | 39 | 56 | 42 | 40 | 37 | 38 | 39 | 59 | 39 | 56 | 41 | 40 | 38 | 41 | 39 |
| N4D | 32 | 55 | 33 | 48 | 29 | 28 | 25 | 29 | 33 | 55 | 34 | 49 | 31 | 28 | 26 | 30 | 28 | |
| N7D | 32 | 54 | 32 | 46 | 26 | 24 | 22 | 27 | 34 | 56 | 34 | 48 | 28 | 25 | 23 | 29 | 27 | |
| R2 | N1D | 1.00 | 0.93 | 1.00 | 0.94 | 0.99 | 0.99 | 1.00 | 1.00 | 1.00 | 0.95 | 1.00 | 0.95 | 0.99 | 0.99 | 1.00 | 1.00 | 1.00 |
| N4D | 1.00 | 0.94 | 1.00 | 0.94 | 0.98 | 0.98 | 1.00 | 1.00 | 1.00 | 0.95 | 1.00 | 0.95 | 0.99 | 0.99 | 1.00 | 1.00 | 1.00 | |
| N7D | 1.00 | 0.93 | 0.99 | 0.92 | 0.99 | 0.99 | 1.00 | 1.00 | 1.00 | 0.95 | 0.99 | 0.95 | 0.99 | 0.99 | 1.00 | 1.00 | 1.00 | |
| RMSE | N1D | 407 | 3160 | 534 | 3107 | 1153 | 2240 | 300 | 280 | 447 | 3091 | 541 | 3112 | 1196 | 2672 | 276 | 261 | 189 |
| N4D | 538 | 3194 | 834 | 3172 | 1481 | 2204 | 339 | 514 | 556 | 3091 | 860 | 3203 | 1302 | 2591 | 263 | 499 | 187 | |
| N7D | 544 | 3251 | 1140 | 3309 | 1497 | 2013 | 403 | 768 | 461 | 3126 | 1183 | 3297 | 1742 | 2611 | 332 | 606 | 258 | |
| MAE | N1D | 155 | 1894 | 222 | 1699 | 260 | 605 | 77 | 140 | 158 | 1858 | 219 | 1686 | 268 | 617 | 75 | 133 | 103 |
| N4D | 227 | 1933 | 350 | 1763 | 323 | 601 | 110 | 238 | 226 | 1876 | 346 | 1746 | 310 | 634 | 106 | 236 | 100 | |
| N7D | 221 | 1956 | 477 | 1838 | 358 | 600 | 149 | 341 | 181 | 1900 | 474 | 1806 | 383 | 654 | 145 | 265 | 118 | |
EN: elastic net model; PCR: principal components regression model; PLSR: partial least squares regression model; KNN: k-nearest neighbors regression model; RT: regression tree model; RF: random forests model; GBM: gradient boosted tree models; ANN: 2-layer artificial neural network model; GLM: general linear model.
RMSE: root mean square error; MAE: mean absolute error; N1D: future-1-day average; N4D: future-4-day average; N7D: future-7-day average.
Fig. 4Reported cases and predicted cases from June 2 with 95% confidence intervals based on June 1 and historic data for New York City and other top 12 counties with the most cumulative cases until May 31.
Fig. 5Modeled next-7-day (N7D) results under Phase I re-opening, lockdown, and Phase III re-opening lasting 1 week and 2 weeks for selected level-1 metropolitan counties.