| Literature DB >> 34226811 |
Jose Olmo1,2, Marcos Sanso-Navarro1.
Abstract
This paper proposes an ensemble predictor for the weekly increase in the number of confirmed COVID-19 cases in the city of New York at zip code level. Within a Bayesian model averaging framework, the baseline is a Poisson regression for count data. The set of covariates includes autoregressive terms, spatial effects, and demographic and socioeconomic variables. Our results for the second wave of the coronavirus pandemic show that these regressors are more significant to predict the number of new confirmed cases as the pandemic unfolds. Both pointwise and interval forecasts exhibit strong predictive ability in-sample and out-of-sample.Entities:
Keywords: Bayesian model averaging; COVID‐19; Poisson regression; prediction models; spatial effects
Year: 2021 PMID: 34226811 PMCID: PMC8242800 DOI: 10.1111/pirs.12615
Source DB: PubMed Journal: Pap Reg Sci ISSN: 1056-8190
FIGURE 1Weekly new confirmed COVID‐19 cases in New York City, 31 March 2020 – 2 February 2021
Weekly new confirmed COVID‐19 cases in modified zip code tabulation areas (ZCTAs, N=177): Descriptive statistics
| Period | Total | Mean | Standard deviation | Minimum | Maximum |
|---|---|---|---|---|---|
| 8 Sep–15 Sep | 1,992 | 11.254 | 10.251 | 0 | 66 |
| 29 Sep–6 Oct | 3,520 | 19.887 | 25.033 | 0 | 187 |
| 20 Oct–27 Oct | 3,891 | 21.983 | 16.340 | 0 | 79 |
| 10 Nov–17 Nov | 8,533 | 48.209 | 32.627 | 0 | 164 |
| 1 Dec–8 Dec | 10,277 | 58.062 | 63.942 | 0 | 400 |
| 22 Dec–29 Dec | 22,123 | 124.989 | 87.499 | 6 | 435 |
| 12 Jan–19 Jan | 33,392 | 188.655 | 126.137 | 3 | 624 |
Weekly new confirmed COVID‐19 cases and potential determinants: Description of variables and data sources
| Variable | Description | Sources |
|---|---|---|
| cases | Weekly new confirmed COVID‐19 cases | DOH |
| lcases | New confirmed cases during the previous week | DOH |
| lincid | Rate of new confirmed cases per 100,000 people during the previous week | DOH, ACS |
| lwcases | Average number of new confirmed cases during the previous week in contiguous ZCTAs | DOH |
| lwincid | Average rate of new confirmed cases per 100,000 people during the previous week in contiguous ZCTAs | DOH, ACS |
| popul | Total population | ACS |
| hhsize | Average household size | ACS |
| male | Percentage of male population | ACS |
| black | Black or African American, as per cent of total population | ACS |
| hispanic | Hispanic or Latino, as per cent of total population | ACS |
| age | Median age | ACS |
| over64 | Percentage of population 65 years and over | ACS |
| enroll | Percentage of population 3 years and over enrolled in school | ACS |
| income | Per capita income in the past 12 months | ACS |
| selfemp | Percentage of households with self‐employment income | ACS |
| empl | Percentage of civilian noninstitutionalized population 18 to 64 years in the labor force that are employed | ACS |
| fulltime | Percentage of civilian noninstitutionalized population 18 to 64 years that worked full‐time, year round | ACS |
| pubtrans | Percentage of workers 16 years and over that use public transportation (excl. taxicab) | ACS |
| travtime | Mean travel time to work, in minutes | ACS |
| retail | Percentage of workers 16 years and over in retail trade | ACS |
| transp | Percentage of workers 16 years and over in transportation and warehousing, and utilities | ACS |
| eduheal | Percentage of workers 16 years and over in educational services, and health care and social assistance | ACS |
| entert | Percentage of workers 16 years and over in arts, entertainment, and recreation, and accommodation and food services | ACS |
| occup | Percentage of housing units that are occupied | ACS |
| rooms | Median number of rooms | ACS |
| internet | Percentage of households with an Internet subscription | ACS |
| nocov | Percentage of civilian noninstitutionalized population 19 to 64 years with no health insurance coverage | ACS |
Note: Data sources are the American Community Survey, 5‐year estimates 2014‐2018 (ACS), and the New York City Department of Health and Mental Hygiene (DOH).
Poisson regression: Weekly new confirmed COVID‐19 cases
| 8 Sep–15 Sep | 29 Sep–6 Oct | 20 Oct–27 Oct | 10 Nov–17 Nov | 1 Dec–8 Dec | 22 Dec–29 Dec | 12 Jan–19 Jan | |
|---|---|---|---|---|---|---|---|
| lcases | 0.033*** | ‐0.005* | 0.003 | 0.005*** | 0.005*** | 0.003*** | 3.46e‐04* |
| (0.007) | (0.003) | (0.003) | (0.001) | (5.02e‐04) | (2.91e‐04) | (1.76e‐04) | |
| lincid | ‐0.004 | 0.014*** | 0.002 | 0.001*** | 0.001*** | ‐3.74e‐05 | 0.001*** |
| (0.004) | (0.002) | (0.002) | (3.54e‐04) | (02.62e‐04) | (1.50e‐04) | (1.18e‐04) | |
| lwcases | ‐0.009 | 0.002 | 0.008*** | ‐0.002** | 0.003*** | 0.001*** | 3.78e‐04*** |
| (0.008) | (0.004) | (0.002) | (0.001) | (4.43e‐04) | (1.85e‐04) | (1.03e‐04) | |
| lwincid | 0.021*** | 0.001 | ‐0.007** | 0.005*** | 4.09e‐04 | 4.63e‐04*** | 2.66e‐04** |
| (0.006) | (0.004) | (0.003) | (0.001) | (2.86e‐04) | (1.43e‐04) | (1.06e‐04) | |
| popul | 1.49e‐05*** | 1.38e‐05*** | 1.75e‐05*** | 1.15e‐05*** | 9.91e‐06*** | 7.96e‐06*** | 1.18e‐05*** |
| (3.03e‐06) | (2.44e‐06) | (2.28e‐06) | (1.54e‐06) | (1.40e‐06) | (9.27e‐07) | (9.51e‐07) | |
| hhsize | ‐0.839*** | ‐0.176 | ‐0.322*** | ‐0.279*** | ‐0.583*** | ‐0.066 | ‐0.096*** |
| (0.148) | (0.114) | (0.104) | (0.074) | (0.068) | (0.044) | (0.035) | |
| male | ‐0.041** | ‐0.014 | 0.018* | ‐0.012 | 0.035*** | ‐0.005 | ‐0.002 |
| (0.017) | (0.013) | (0.011) | (0.008) | (0.007) | (0.005) | (0.004) | |
| black | ‐0.003 | ‐0.006*** | ‐0.008*** | ‐0.006*** | ‐0.015*** | ‐0.005** | ‐0.001** |
| (0.002) | (0.002) | (0.002) | (0.001) | (0.001) | (0.001) | (0.001) | |
| hispanic | 0.006** | ‐0.001 | 0.001 | 0.004*** | ‐0.005*** | 0.001* | 2.40e‐04 |
| (0.002) | (0.002) | (0.002) | (0.001) | (0.001) | (0.001) | (5.49e‐04) | |
| age | ‐0.045*** | ‐0.064*** | ‐0.037*** | ‐0.023*** | ‐0.069*** | ‐0.017*** | ‐0.018*** |
| (0.014) | (0.011) | (0.010) | (0.007) | (0.006) | (0.004) | (0.003) | |
| over64 | ‐1.04e‐05 | 4.49e‐05*** | 2.56e‐05* | 2.75e‐05*** | ‐6.55e‐07 | 2.58e‐05*** | 3.69e‐05*** |
| (1.98e‐05) | (1.59e‐05) | (1.40e‐05) | (9.02e‐06) | (9.01e‐06) | (6.28e‐06) | (4.74e‐06) | |
| enroll | ‐0.002 | ‐0.016* | ‐0.008 | ‐0.007 | ‐0.065*** | ‐0.019*** | ‐0.004 |
| (0.012) | (0.010) | (0.008) | (0.006) | (0.006) | (0.004) | (0.003) | |
| income | ‐9.47e‐06*** | ‐5.95e‐06** | 2.41e‐06 | ‐1.06e‐06 | ‐2.23e‐06 | ‐2.24e‐06** | ‐1.05e‐06 |
| (3.31e‐06) | (2.62e‐06) | (2.19e‐06) | (1.51e‐06) | (1.51e‐06) | (1.01e‐06) | (8.28e‐07) | |
| selfemp | 0.045*** | 0.029*** | 0.010 | 0.034*** | ‐0.011** | 0.007** | 0.010*** |
| (0.011) | (0.009) | (0.008) | (0.005) | (0.005) | (0.003) | (0.003) | |
| empl | 0.026 | 0.027** | ‐0.007 | 0.032*** | ‐0.023*** | ‐0.001 | ‐0.004 |
| (0.016) | (0.013) | (0.012) | (0.008) | (0.007) | (0.005) | (0.0045) | |
| fulltime | 0.003 | ‐0.003 | ‐1.93e‐04 | 0.006 | ‐0.007* | 0.001 | 0.002 |
| (0.010) | (0.008) | (0.007) | (0.005) | (0.004) | (0.003) | (0.002) | |
| pubtrans | ‐0.004 | ‐0.006* | ‐0.006** | 3.46e‐04 | 0.022*** | ‐2.42e‐04 | ‐0.003*** |
| (0.004) | (0.003) | (0.003) | (0.002) | (0.002) | (0.001) | (0.001) | |
| travtime | ‐4.70e‐06 | 6.16e‐06 | ‐4.51e‐05 | 3.30e‐05 | ‐3.77e‐04*** | 1.79e‐05 | 6.99e‐05*** |
| (9.38e‐05) | (7.58e‐05) | (7.17e‐05) | (4.75e‐05) | (4.85e‐05) | (3.06e‐05) | (2.46e‐05) | |
| retail | 0.015 | 0.012 | 0.032** | 0.050*** | 0.018** | ‐0.001 | 0.003 |
| (0.018) | (0.015) | (0.013) | (0.009) | (0.008) | (0.006) | (0.004) | |
| transp | 0.056*** | 0.034** | 0.066*** | 0.041*** | 0.132*** | 0.034*** | 0.039*** |
| (0.017) | (0.014) | (0.012) | (0.008) | (0.008) | (0.005) | (0.004) | |
| eduheal | 0.018* | 0.031*** | 0.011 | 0.025*** | 0.044*** | 0.013*** | 0.006** |
| (0.010) | (0.008) | (0.007) | (0.005) | (0.004) | (0.003) | (0.002) | |
| entert | ‐0.016 | 0.036*** | 0.036*** | 0.023*** | ‐0.008 | 0.009** | 0.015*** |
| (0.015) | (0.012) | (0.010) | (0.007) | (0.006) | (0.004) | (0.003) | |
| occup | 0.005 | 0.002 | 0.002 | ‐0.005 | ‐0.009** | 0.004 | 0.015*** |
| (0.008) | (0.006) | (0.005) | (0.003) | (0.004) | (0.003) | (0.002) | |
| rooms | 0.203*** | 0.039 | 0.103* | 0.157*** | 0.602*** | 0.113*** | ‐0.023 |
| (0.073) | (0.060) | (0.053) | (0.036) | (0.035) | (0.023) | (0.019) | |
| internet | 0.009 | 0.004 | ‐0.001 | ‐0.009*** | ‐0.013*** | ‐0.010*** | ‐0.002 |
| (0.007) | (0.006) | (0.005) | (0.003) | (0.003) | (0.002) | (0.002) | |
| nocov | ‐0.003 | ‐0.021*** | ‐0.018*** | ‐0.006 | ‐0.016*** | ‐0.001 | ‐7.84e‐06 |
| (0.010) | (0.007) | (0.007) | (0.004) | (0.004) | (0.003) | (0.002) | |
| constant | 1.541 | 1.320 | 2.843** | ‐0.084 | 7.737*** | 4.516*** | 3.092*** |
| (1.950) | (1.602) | (1.372) | (0.921) | (0.871) | (0.606) | (0.468) | |
| Log likelihood | ‐601.626 | ‐589.701 | ‐597.097 | ‐764.797 | ‐2,741.613 | ‐926.591 | ‐1,064.428 |
Note: The number of observations is 177. Standard errors in parentheses. *** p < 0.01, ** p < 0.05, * p < 0.10.
Bayesian model averaging: Posterior inclusion probabilities
| 8 Sep–15 Sep | 29 Sep–6 Oct | 20 Oct–27 Oct | 10 Nov–17 Nov | 1 Dec–8 Dec | 22 Dec–29 Dec | 12 Jan–19 Jan | |
|---|---|---|---|---|---|---|---|
| lcases | 0.274 | 1.000 | 0.064 | 0.090 | 1.000 | 0.055 | 0.067 |
| lincid | 0.999 | 1.000 | 0.993 | 0.980 | 0.103 | 1.000 | 1.000 |
| lwcases | 0.038 | 0.377 | 0.975 | 0.274 | 1.000 | 0.992 | 1.000 |
| lwincid | 0.040 | 0.244 | 0.809 | 1.000 | 1.000 | 1.000 | 0.543 |
| popul | 0.181 | 1.000 | 0.999 | 1.000 | 0.999 | 1.000 | 1.000 |
| hhsize | 0.158 | 0.053 | 0.084 | 0.141 | 0.105 | 0.055 | 0.292 |
| male | 0.066 | 0.043 | 0.959 | 0.128 | 0.053 | 0.244 | 0.162 |
| black | 0.990 | 1.000 | 0.638 | 0.467 | 0.808 | 1.000 | 0.982 |
| hispanic | 0.189 | 1.000 | 0.121 | 0.719 | 0.917 | 0.057 | 0.621 |
| age | 0.926 | 1.000 | 0.680 | 0.093 | 0.985 | 0.999 | 1.000 |
| over64 | 0.999 | 0.992 | 0.846 | 0.076 | 1.000 | 1.000 | 1.000 |
| enroll | 0.044 | 0.091 | 0.121 | 0.272 | 0.475 | 0.973 | 0.990 |
| income | 0.993 | 0.894 | 0.084 | 0.910 | 0.218 | 0.214 | 0.046 |
| selfemp | 0.151 | 0.080 | 0.332 | 0.719 | 0.440 | 0.113 | 0.094 |
| empl | 0.045 | 0.037 | 0.095 | 0.152 | 0.042 | 0.676 | 0.909 |
| fulltime | 0.048 | 0.264 | 0.139 | 0.246 | 0.083 | 0.149 | 0.061 |
| pubtrans | 0.122 | 0.072 | 0.105 | 0.064 | 0.767 | 0.571 | 0.046 |
| travtime | 0.047 | 0.109 | 0.066 | 0.749 | 0.832 | 0.110 | 0.164 |
| retail | 0.041 | 0.041 | 0.164 | 0.039 | 0.042 | 0.475 | 0.985 |
| transp | 0.049 | 0.074 | 0.699 | 0.052 | 0.998 | 1.000 | 1.000 |
| eduheal | 0.093 | 0.998 | 0.862 | 0.089 | 0.832 | 0.998 | 0.058 |
| entert | 0.052 | 0.083 | 0.261 | 0.347 | 0.861 | 1.000 | 1.000 |
| occup | 0.038 | 0.095 | 0.047 | 0.040 | 0.122 | 0.043 | 1.000 |
| rooms | 0.055 | 0.335 | 0.102 | 0.049 | 0.359 | 0.087 | 0.092 |
| internet | 0.041 | 0.147 | 0.078 | 0.065 | 0.339 | 0.266 | 0.051 |
| nocov | 0.052 | 0.043 | 0.095 | 0.996 | 0.057 | 0.462 | 1.000 |
Note: The number of observations is 177. The sampling method consists of a random walk Metropolis‐Hastings algorithm combined with a random swap. A benchmark and uniform priors have been established, respectively, for parameters and models.
FIGURE 2Bayesian model averaging: Posterior density functions for the coefficients of selected regressors, 12 Jan – 19 Jan
Predictive performance: Root mean square errors
| Relative, in‐sample | |||||||
|---|---|---|---|---|---|---|---|
| 8 Sep–15 Sep | 29 Sep–6 Oct | 20 Oct–27 Oct | 10 Nov–17 Nov | 1 Dec–8 Dec | 22 Dec–29 Dec | 12 Jan–19 Jan | |
| Benchmark | 542.735 | 942.967 | 766.300 | 1136.260 | 637.715 | 565.335 | 602.540 |
| Poisson | 1.068 | 1.094 | 0.886 | 1.064 | 1.145 | 0.364 | 0.558 |
| BMA | 1.080 | 1.205 | 0.951 | 1.147 | 1.206 | 0.381 | 0.538 |
| HPM | 1.093 | 1.243 | 0.928 | 1.149 | 1.204 | 0.386 | 0.530 |
| BPM | 1.102 | 1.213 | 0.922 | 1.141 | 1.224 | 0.377 | 0.535 |
Predictive performance: Empirical coverage probabilities
| In‐sample | |||||||
|---|---|---|---|---|---|---|---|
| 8 Sep–15 Sep | 29 Sep–6 Oct | 20 Oct–27 Oct | 10 Nov–17 Nov | 1 Dec–8 Dec | 22 Dec–29 Dec | 12 Jan–19 Jan | |
| Benchmark | 0.96 | 0.960 | 0.938 | 0.977 | 0.96 | 0.955 | 0.932 |
| Poisson | 0.915 | 0.926 | 0.904 | 0.910 | 0.915 | 0.915 | 0.887 |
| BMA | 0.926 | 0.910 | 0.910 | 0.904 | 0.910 | 0.915 | 0.887 |
| HPM | 0.921 | 0.915 | 0.915 | 0.898 | 0.910 | 0.921 | 0.881 |
| BPM | 0.921 | 0.910 | 0.910 | 0.915 | 0.921 | 0.932 | 0.893 |
FIGURE 3Predictive performance: Choropleth maps of observed and predicted (BPM) weekly new confirmed COVID‐19 cases in selected periods