| Literature DB >> 25369315 |
Abstract
We propose the use of Google online search data for nowcasting and forecasting the number of food stamps recipients. We perform a large out-of-sample forecasting exercise with almost 3000 competing models with forecast horizons up to 2 years ahead, and we show that models including Google search data statistically outperform the competing models at all considered horizons. These results hold also with several robustness checks, considering alternative keywords, a falsification test, different out-of-samples, directional accuracy and forecasts at the state-level.Entities:
Mesh:
Year: 2014 PMID: 25369315 PMCID: PMC4219814 DOI: 10.1371/journal.pone.0111894
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Bloomberg story-count for “food stamps” worldwide (left plot); Google standardized volume of news related to “food stamps” worldwide (right plot).
Google data are registered trademarks of Google Inc., used with permission.
Figure 2Original an cleaned food stamp data at the US national level.
Sample: 1988M10 - 2011M5.
Figure 3GI for the keywords “food stamps”.
Sample: 2004M1 - 2011M5. Google data are registered trademarks of Google Inc., used with permission.
Unit root tests.
|
|
|
|
| |
|
|
|
|
| |
|
| 1.34 | 1.85 | −3.26 [1997M4] | −4.06[1996M4,2008M8] |
|
| 0.85* | 1.07* | −3.94[2007M9] | −5.22[1992M11,200810] |
|
| 1.60 | 2.94* | −4.53*[2008M7] | −5.60*[2006M8,2008M8] |
|
| 1.17* | 1.58 | −5.52*[2007M4] | −6.48*[2007M3,2009M9] |
|
| 5.16* | 7.86* | −3.10[1998M10] | −3.72[1992M5,1999M3] |
|
|
| |||
|
|
|
|
| |
|
|
|
|
| |
|
| 0.02 |
| 31.55* | / |
|
| 15.05* | / | 3.94* | / |
|
| NA | NA | 1.41 |
|
|
| NA | NA | 6.46* | / |
|
| 20.20* | / | 8.06* | / |
Unit root tests: RUR = Range Unit Root test by [33]; FB = Forward-Backward RUR test by [33]; LS = unit root test with breaks by [32] - the estimated break dates are reported in brackets. The second step for the periodic unit root tests by [35] and [34] is performed only if the first step did not reject the null hypothesis of a periodic unit root. P-values smaller than 0.05 are in bold font. * Significant at the 5%, level.
P-values of sequential tests for weak exogeneity.
| Wald test | Toda-Yamamoto | |||
| 1st step | 2nd step | 1st step | 2nd step | |
|
|
|
| 0.05 | / |
|
|
|
|
| 0.07 |
|
| 0.58 | / | 0.79 | / |
|
|
|
| 0.26 | / |
|
|
|
|
|
|
P-values of sequential tests for weak exogeneity: standard Wald test and Wald test using the approach by [37]. P-values smaller than 0.05 are in bold font.
Single-equation and multivariate cointegration tests with and without structural break(s).
| Single-equation cointegration tests | ||||
| Engle and Granger (1987) | Gregory and Hansen (1996) | Hatemi (2008) | ||
| No breaks | 1 (endogenous) break | up to 2 (endogenous) breaks | ||
|
|
|
|
|
|
| −3.83 | −4.82 | 2009M11 | −5.29 | 2006M1 2009M1 |
Single-equation and multivariate cointegration tests with and without structural break(s). The null hypothesis for all tests is the absence of cointegration. All the tests considered the case of no deterministic trend in the data and an intercept in the cointegration equation (CE), centered seasonal dummies outside the CE, while the number of lags is chosen using the Schwartz criterion. The tests allowing for break(s) considered the case of a level shift. * Significant at the 5% level.
Estimated coefficients in the equation of food stamps recipients (left block) and misspecification and stability tests (right block).
|
|
|
|
|
|
| log(Food stamps(-1)) | 0.59 | 5.40 | Ljung-Box(12) | 0.52 |
| log(Food stamps(-2)) | 0.30 | 2.31 | Ljung-Box(24) | 0.65 |
| log(Food stamps(-3)) | 0.29 | 2.22 | Ljung-Box(12) res. sq. | 0.79 |
| log(Food stamps(-4)) | -0.23 | -2.25 | Ljung-Box(24) res. sq. | 0.79 |
| log(Unemployment rate) | 0.02 | 3.13 | ARCH(12) | 0.89 |
| log(GI - Food Stamps) | 0.01 | 3.96 | ARCH(24) | 0.98 |
| log(GI - Jobs) | 0.02 | 2.03 | Jarque-Bera |
|
| constant | 0.87 | 4.63 | RESET | 0.56 |
| S1 | −0.02 | −5.74 | BDS (dim = 2) | 0.12 |
| S2 | −0.02 | −8.07 | BDS (dim = 6) |
|
| S3 | −0.01 | −4.37 | OLS-CUSUM | 0.99 |
| S4 | −0.02 | −4.44 | Rec-CUSUM | 0.06 |
| S5 | −0.01 | −3.43 | OLS-MOSUM | 0.51 |
| S6 | −0.02 | −4.04 | Rec-MOSUM | 0.39 |
| S7 | −0.01 | −3.95 | Andrews max-F |
|
| S8 | −0.01 | −3.89 | Andrews exp-F | 0.22 |
| S9 | −0.01 | −4.64 | Andrews ave-F | 0.09 |
| S10 | −0.01 | −3.69 | Optimal n. breakpoints (BIC) | 0 |
| S11 | −0.01 | −4.36 | Optimal n. breakpoints (LWZ) | 0 |
Estimated coefficients in the equation of food stamps recipients (left block) and misspecification and stability tests (right block). Sample: 2004M1- 2011M05. P-values smaller than 0.05 are in bold font.
Misspecification tests: the [46] statistics for testing the absence of autocorrelation up to order in the models' residuals and residuals squared; the Lagrange multiplier test for Auto-Regressive Conditional Heteroskedasticity (ARCH) in the residuals proposed by [47]; the [48] test for checking whether a time series is normally distributed; the REgression Specification Error Test (RESET) proposed by [49], which is a general test for incorrect functional form, omitted variables, and correlation between the regressors and the error term; the BDS test by [50] to test whether the residuals are independent and identically distributed (iid) and which is robust against a variety of possible deviations from independence, including linear dependence, non-linear dependence, or chaos.
Stability tests: the test for parameter instability by [51] which is based on the CUmulative SUM of the recursive residuals (Rec-CUSUM); [52] suggested to modify the previous structural change test and use the cumulative sums of the common OLS residuals (OLS-CUSUM). [53] proposed a structural change test which analyzes moving sums of residuals (MOSUM) instead of cumulative sums. We remark that a unifying view of the previous structural change tests within a generalized M-fluctuation test framework was proposed by [54] and [55]. [56] was the first to suggest an F-test for structural change when the break point is known: [57] and [58] extended the Chow test by computing the F statistics for all potential break points and suggested three different test statistics, the sup-F, the ave-F and the exp-F, which are based on Wald, Lagrange Multiplier or Likelihood Ratio statistics respectively, in a very general class of models fitted by Generalized Method of Moments. See [59] for a review and a step-by-step description of stability tests using R software. Besides, [60], following [61], suggested to find the optimal number of breakpoints by optimizing the Bayesian Information Criterion (BIC) and the modified BIC by [62] (LWZ, 1997).
Models used for nowcasting and forecasting: Linear and Periodic models.
| LAGS | Additional regressors | Time Samples | Data Transformations | Total models | ||||||||||||||||||
| NO REG. |
|
|
|
|
|
|
|
| 1988M10-2011M5 | 2004M1-2011M5 |
| log( |
|
| (sum by row) | |||||||
| Regressors: |
|
|
|
|
|
|
|
|
|
|
|
|
| |||||||||
| sim./lagged |
|
|
|
|
|
|
| |||||||||||||||
| AR(p) | up to 12 | yes | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | yes | 12 | ||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | yes | 12 | ||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | yes | 12 | ||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | yes | 12 | ||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||||
| ARMA(p, q) | up to 12 | yes | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | yes | 12 | ||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | yes | 12 | ||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||||
| AR(p) | up to 12 | yes | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||
| + seasonal | up to 12 | yes | yes | yes | yes | yes | yes | 12 | ||||||||||||||
| constants | up to 12 | yes | yes | yes | yes | yes | yes | 12 | ||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | yes | 12 | ||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | yes | 12 | ||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||||
| up to 12 | yes | yes | yes | yes | yes | yes | 12 | |||||||||||||||
| PAR(p) | up to 3 | yes | yes | yes | yes | yes | yes | yes | 3 | |||||||||||||
| up to 3 | yes | yes | yes | yes | yes | yes | 3 | |||||||||||||||
| up to 3 | yes | yes | yes | yes | yes | yes | 3 | |||||||||||||||
| up to 3 | yes | yes | yes | yes | yes | yes | 3 | |||||||||||||||
| up to 3 | yes | yes | yes | yes | yes | yes | 3 | |||||||||||||||
| up to 3 | yes | yes | yes | yes | yes | yes | 3 | |||||||||||||||
| up to 3 | yes | yes | yes | yes | yes | yes | 3 | |||||||||||||||
| up to 1 | yes | yes | yes | yes | yes | yes | yes | 1 | ||||||||||||||
| up to 3 | yes | yes | yes | yes | yes | yes | yes | 3 | ||||||||||||||
| up to 1 | yes | yes | yes | yes | yes | yes | yes | 1 | ||||||||||||||
| up to 3 | yes | yes | yes | yes | yes | yes | yes | 3 | ||||||||||||||
| up to 1 | yes | yes | yes | yes | yes | yes | 1 | |||||||||||||||
| up to 1 | yes | yes | yes | yes | yes | yes | 1 | |||||||||||||||
| up to 1 | yes | yes | yes | yes | yes | yes | 1 | |||||||||||||||
| PAR(p) | up to 3 | yes | yes | yes | yes | yes | yes | yes | 3 | |||||||||||||
| + periodic | up to 3 | yes | yes | yes | yes | yes | yes | 3 | ||||||||||||||
| trends | up to 3 | yes | yes | yes | yes | yes | yes | 3 | ||||||||||||||
| up to 3 | yes | yes | yes | yes | yes | yes | 3 | |||||||||||||||
| up to 1 | yes | yes | yes | yes | yes | yes | yes | 1 | ||||||||||||||
| up to 1 | yes | yes | yes | yes | yes | yes | yes | 1 | ||||||||||||||
| up to 1 | yes | yes | yes | yes | yes | yes | 1 | |||||||||||||||
| up to 1 | yes | yes | yes | yes | yes | yes | 1 | |||||||||||||||
| up to 1 | yes | yes | yes | yes | yes | yes | 1 | |||||||||||||||
| PAR(p) | up to 3 | yes | yes | yes | yes | yes | yes | yes | 3 | |||||||||||||
| + ARCH(1) | up to 3 | yes | yes | yes | yes | yes | yes | 3 | ||||||||||||||
| up to 3 | yes | yes | yes | yes | yes | yes | yes | 3 | ||||||||||||||
| up to 3 | yes | yes | yes | yes | yes | yes | yes | 3 | ||||||||||||||
| PEC | 1,12 | yes | yes | yes | yes | 1 | ||||||||||||||||
| PEC | 1,12 | yes | yes | yes | yes | yes | 1 | |||||||||||||||
| PEC | 1,12 | yes | yes | yes | yes | yes | 1 | |||||||||||||||
| PEC | 1,12 | yes | yes | yes | yes | yes | yes | 1 | ||||||||||||||
| PEC | 1,12 | yes | yes | yes | yes | 1 | ||||||||||||||||
Models used for nowcasting and forecasting: Multivariate models, Nonlinear models and Random Walk with drift.
| Additional regressors | Time Samples | Data Transformations | Row Total | ||||||||||||||||||||||
| MODELS | LAGS | NO REG. |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1988M10-2011M5 | 2004M1-2011M5 |
| log |
|
| models |
| VAR | 1–7 | yes | yes | yes | yes | 1 | |||||||||||||||||||
| VAR | 1–7 | yes | yes | yes | yes | yes | 2 | ||||||||||||||||||
| VAR | 1–6 | yes | yes | yes | yes | 1 | |||||||||||||||||||
| VAR | 1–6 | yes | yes | yes | yes | yes | 2 | ||||||||||||||||||
| BVAR | 1–7 | yes | yes | yes | yes | 1 | |||||||||||||||||||
| BVAR | 1–6 | yes | yes | yes | yes | 1 | |||||||||||||||||||
| BVAR | 1,12 | yes | yes | yes | yes | yes | yes | 1 | |||||||||||||||||
| BVAR | 1–7 | yes | yes | yes | yes | yes | yes | 2 | |||||||||||||||||
| BVAR | 1–6 | yes | yes | yes | yes | yes | yes | 2 | |||||||||||||||||
| BVAR | 1,12 | yes | yes | yes | yes | yes | yes | yes | yes | 2 | |||||||||||||||
| BVAR | 1–7 | yes | yes | yes | yes | 1 | |||||||||||||||||||
| BVAR | 1–6 | yes | yes | yes | yes | 1 | |||||||||||||||||||
| BVAR | 1,12 | yes | yes | yes | yes | yes | yes | 1 | |||||||||||||||||
| VEC | 1,12 | yes | yes | yes | yes | yes | 2 | ||||||||||||||||||
| VEC | 1,12 | yes | yes | yes | yes | yes | 2 | ||||||||||||||||||
| VEC | 1,12 | yes | yes | yes | yes | yes | 2 | ||||||||||||||||||
| VEC | 1,12 | yes | yes | yes | yes | 1 | |||||||||||||||||||
| VEC | 1,12 | yes | yes | yes | yes | 1 | |||||||||||||||||||
| VEC | 1,12 | yes | yes | yes | yes | 1 | |||||||||||||||||||
| VEC | 1,12 | yes | yes | yes | yes | 1 | |||||||||||||||||||
| VEC | 1,12 | yes | yes | yes | yes | 1 | |||||||||||||||||||
| VEC | 1,12 | yes | yes | yes | yes | 1 | |||||||||||||||||||
| VEC | 1,12 | yes | yes | yes | yes | 1 | |||||||||||||||||||
| VEC | 1,12 | yes | yes | yes | yes | 1 | |||||||||||||||||||
| VEC | 1,12 | yes | yes | yes | yes | 1 | |||||||||||||||||||
| VEC | 1,12 | yes | yes | yes | yes | 1 | |||||||||||||||||||
| VEC | 1,12 | yes | yes | yes | yes | 1 | |||||||||||||||||||
| VEC | 1,12 | yes | yes | yes | yes | 1 | |||||||||||||||||||
| SETAR | up to 12 | yes | yes | yes | yes | yes | yes | yes | 12 | ||||||||||||||||
| LSTAR | up to 12 | yes | yes | yes | yes | yes | yes | yes | 12 | ||||||||||||||||
| AAR | up to 12 | yes | yes | yes | yes | yes | yes | yes | 12 | ||||||||||||||||
| NNET | up to 12 | yes | yes | yes | yes | yes | yes | yes | 12 | ||||||||||||||||
| RW | yes | yes | yes | yes | 1 | ||||||||||||||||||||
|
| |||||||||||||||||||||||||
Number of models with Google data out of the Top 100 models according to the RMSE.
| Nowcasting 1 s.a. | Nowcasting 2 s.a. | Forecasting 12 s.a. | Forecasting 24 s.a. | |
| RMSE | 41 | 90 | 92 | 91 |
Ranking of the best models within each class according to the RMSE.
| Type of | Nowcasting | Nowcasting | Forecasting | Forecasting | |
| models | 1 s.a. | 2 s.a. | 12 s.a. | 24 s.a. | |
|
| AR w/GI | 17 | 81 | 127 | 177 |
|
| AR w/o GI | 5 | 75 | 75 | 128 |
| ARMA w/GI | 138 | 51 | 74 | 153 | |
| ARMA w/o GI | 1 | 113 | 87 | 123 | |
| AR + s.d. w/GI | 17 | 1 | 1 | 1 | |
| AR + s.d. w/o GI | 14 | 38 | 180 | 180 | |
|
| PAR w/GI | 2530 | 2364 | 17 | 41 |
|
| PAR w/o GI | 444 | 948 | 690 | 822 |
| PAR+p.t. w/GI | 2632 | 2623 | 959 | 145 | |
| PAR+p.t. w/o GI | 391 | 613 | 377 | 463 | |
| PAR-ARCH w/GI | 2635 | 2514 | 555 | 159 | |
| PAR-ARCH w/o GI | 1138 | 1459 | 610 | 836 | |
| PEC w/GI | 2538 | 2547 | 53 | 44 | |
| PEC w/o GI | 1783 | 2442 | 72 | 703 | |
|
| VAR w/GI | 236 | 441 | 2053 | 2462 |
|
| VAR w/o GI | 293 | 345 | 61 | 229 |
| VEC w/GI | 102 | 194 | 856 | 1518 | |
| VEC w/o GI | 209 | 367 | 257 | 627 | |
| BVAR w/GI | 7 | 370 | 515 | 411 | |
| BVAR w/o GI | 197 | 907 | 925 | 1301 | |
|
| SETAR | Not converged | Not converged | Not converged | Not converged |
|
| LSTAR | 716 | 1144 | 410 | 137 |
| NNET | 1359 | 1595 | 923 | 797 | |
| AAR | 383 | 704 | 82 | 40 | |
|
| RW | 2562 | 2585 | 1847 | 1183 |
Top 10 models in terms of RMSE - baseline case. Nowcasting: 1 step and 2 steps ahead.
| 1 STEP ahead (baseline case) | 2 STEPS ahead (baseline case) | ||
| Top 10 models | RMSE | Top 10 models | RMSE |
| ARMA(10,10) dlog 1988 | 159024 | AR(3)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | 211508 |
| ARMA(10,10) + UR dlog 1988 | 160819 | AR(8)+S.D.+GI(F.S.) lev 2004 | 211784 |
| ARMA(12,12) dlog 1988 | 161311 | AR(7)+ S.D.+GI(F.S.) lev 2004 | 211843 |
| ARMA(11,11) + UR diff 1988 | 162494 | AR(2)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | 212644 |
| AR(12) + IC(sim+lags) dlog 1988 | 164194 | AR(7)+S.D.+GI(J.&F.S.) lev 2004 | 212878 |
| ARMA(12,12) + IC diff 1988 | 165172 | AR(4)+ S.D.+GI(F.S.) lev 2004 | 214086 |
| BVAR(1,12) FS+GI(F.S.) lev 2004 | 165369 | AR(4)+S.D.+GI(J.&F.S.) lev 2004 | 215379 |
| BVAR(1,12) FS+UR+IC+GI(F.S.) lev 2004 | 165531 | AR(8)+S.D.+GI(J.&F.S.) lev 2004 | 215468 |
| ARMA(12,12) + UR dlog 1988 | 166215 | AR(5)+ S.D.+GI(F.S.) lev 2004 | 216076 |
| ARMA(11,11) dlog 1988 | 167503 | AR(6)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | 216667 |
In each row, the following information is reported: the model, the number of lags, (eventual) exogenous regressors, the data transformation, the first year of the estimation sample.
Top 10 models in terms of RMSE - baseline case. Forecasting: 12 steps and 24 steps ahead.
| 12 STEPS ahead (baseline case) | 24 STEPS ahead (baseline case) | ||
| Top 10 models | RMSE | Top 10 models | RMSE |
| AR(2)+S.D.+UR+IC+GI(J.) log 2004 | 1495400 | AR(6)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | 3775883 |
| AR(5)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | 1527588 | AR(5)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | 3777359 |
| AR(3)+S.D.+UR+IC+GI(J.) log 2004 | 1534364 | AR(2)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | 3830094 |
| AR(4)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | 1544779 | AR(2)+S.D.+UR+IC+GI(J.) log 2004 | 3839694 |
| AR(2)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | 1565497 | AR(7)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | 3861489 |
| AR(6)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | 1576811 | AR(4)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | 3887615 |
| AR(6)+S.D.+UR+IC+GI(J.) log 2004 | 1593775 | AR(6)+S.D.+UR+IC+GI(J.) log 2004 | 3914935 |
| AR(7)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | 1595086 | AR(3)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | 3939222 |
| AR(3)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | 1595117 | AR(7)+S.D.+UR+IC+GI(J.) log 2004 | 3973551 |
| AR(8)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | 1608689 | AR(8)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | 3999943 |
In each row, the following information is reported: the model, the number of lags, (eventual) exogenous regressors, the data transformation, the first year of the estimation sample.
Number of models included in the MCS, at the 90% confidence level, using the and statistics and the MSE loss function.
| 1 step | 2 step | 12 steps | 24 step | |||||
|
|
|
|
|
|
|
|
| |
|
| 683 | 6 | 119 | 2 | 11 | 87 | 37 | 20 |
|
| 334 | 2 | 102 | 2 | 11 | 79 | 37 | 20 |
|
| 7 | 0 | 0 | 0 | 0 | 3 | 0 | 0 |
Figure 4GIs for the keywords “Supplemental Nutrition Assistance Program”, “snap program”, and “food stamps”.
Sample: 2004M1 - 2011M5. Search keywords are not case sensitive. Google data are registered trademarks of Google Inc., used with permission.
Number of nowcasting and forecasting models selected in the MCS at the 90% confidence level, using the statistic and the MSE loss function, as well as number of selected models using the “false” Google Index.
| N. 1 step | N. 2 steps | F. 12 steps | F. 24 steps | |
| Models selected | 614 | 122 | 37 | 36 |
| Models using the “false” Google Index | 29 | 2 | 0 | 0 |
Top 10 models in terms of RMSE - different out-of-sample periods. Nowcasting: 1 step ahead.
|
|
|
|
|
|
| ARMA(10,10) dlog 1988 | BVAR(1,12) FS+UR+IC+GI(F.S.) lev 2004 |
| ARMA(10,10) + UR dlog 1988 | BVAR(7) FS+UR+IC+GI(F.S.) lev 2004 |
| ARMA(12,12) dlog 1988 | AR(4)+S.D.+GI(F.S.) lev 2004 |
| ARMA(11,11)+UR diff 1988 | AR(7)+S.D.+GI(F.S.) lev 2004 |
| AR(12) + IC (sim + lags) dlog 1988 | AR(5)+S.D.+GI(F.S.) lev 2004 |
| ARMA(12,12) + IC diff 1988 | AR(8)+S.D.+GI(F.S.) lev 2004 |
| BVAR(1,12) FS+GI(F.S.) lev 2004 | AR(6)+S.D.+GI(F.S.) lev 2004 |
| BVAR(1,12) FS+UR+IC+GI(F.S.) lev 2004 | ARMA(11,11)+UR diff 1988 |
| ARMA(12,12) + UR dlog 1988 | ARMA(12,12)+GI(J.&F.S.) lev 2004 |
| ARMA(11,11) dlog 1988 | AR(6)+S.D.+GI(J.&F.S.) log 2004 |
|
|
|
|
|
|
| ARMA(10,10) dlog 1988 | AR(8)+S.D.+IC lev 2004 |
| ARMA(10,10) + UR dlog 1988 | AR(9)+S.D.+IC lev 2004 |
| ARMA(12,12) dlog 1988 | AR(7)+S.D.+IC lev 2004 |
| ARMA(11,11) + UR diff 1988 | AR(7)+S.D.+UR+IC+GI(J.) log 2004 |
| AR(12) + IC (sim + lags) dlog 1988 | AR(10)+S.D.+IC lev 2004 |
| ARMA(12,12) + IC diff 1988 | AR(8)+S.D.+UR+IC+GI(J.) log 2004 |
| BVAR(1,12) FS+GI(F.S.) lev 2004 | AR(6)+S.D.+GI(J.) log 2004 |
| BVAR(1,12) FS+UR+IC+GI(F.S.) lev 2004 | AR(4)+S.D.+IC lev 2004 |
| ARMA(12,12) + UR dlog 1988 | AR(5)+S.D.+IC lev 2004 |
| ARMA(11,11) dlog 1988 | AR(10)+S.D.+UR+IC+GI(J.) log 2004 |
Baseline case (left column) and the two cases including the 2008 recession (top right column) and the expansion starting in 2009 (low right column).
Top 10 models in terms of RMSE - different out-of-sample periods. Forecasting: 24 steps ahead.
| 24 STEPS ahead (baseline case) | 24 STEPS ahead (expansion 2009) |
| Top 10 models | Top 10 models |
| AR(6)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(2)+S.D.+UR+IC+GI(J.) log 2004 |
| AR(5)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(6)+S.D.+UR+IC+GI(J.) log 2004 |
| AR(2)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(3)+S.D.+UR+IC+GI(J.) log 2004 |
| AR(2)+S.D.+UR+IC+GI(J.) log 2004 | AR(5)+S.D.+UR+IC+GI(J.) log 2004 |
| AR(7)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(7)+S.D.+UR+IC+GI(J.) log 2004 |
| AR(4)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(8)+S.D.+UR+IC+GI(J.) log 2004 |
| AR(6)+S.D.+UR+IC+GI(J.) log 2004 | AR(9)+S.D.+UR+IC+GI(J.) log 2004 |
| AR(3)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(11)+S.D.+UR+IC+GI(J.) log 2004 |
| AR(7)+S.D.+UR+IC+GI(J.) log 2004 | AR(4)+S.D.+UR+IC+GI(J.) log 2004 |
| AR(8)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(6)+S.D.+UR+IC+GI(J.&F.S.) log 2004 |
Baseline case (left column) and the case including the expansion starting in 2009 (low right column).
Number of nowcasting and forecasting models selected in the MCS at the 90% confidence level, using the statistic and the MSE loss function, as well as number of selected Google based models.
| Recession 2008 | Expansion 2009 | ||
| Nowcasting 1 step | Models selected | 173 | 82 |
| Google based models | 101 | 68 | |
| Nowcasting 2 steps | Models selected | 101 | 51 |
| Google based models | 89 | 42 | |
| Forecasting 12 steps | Models selected | 22 | 5 |
| Google based models | 22 | 5 | |
| Forecasting 24 steps | Models selected | NA | 13 |
| Google based models | NA | 13 |
Top 10 models in terms of RMSE - different out-of-sample periods. Nowcasting: 2 steps ahead.
|
|
|
|
|
|
| AR(3)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(8)+S.D.+GI(F.S.) lev 2004 |
| AR(8)+S.D.+GI(F.S.) lev 2004 | AR(7)+ S.D.+GI(F.S.) lev 2004 |
| AR(7)+ S.D.+GI(F.S.) lev 2004 | AR(10)+ S.D.+GI(F.S.) lev 2004 |
| AR(2)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(4)+ S.D.+GI(F.S.) lev 2004 |
| AR(7)+S.D.+GI(J.&F.S.) lev 2004 | AR(9)+ S.D.+GI(F.S.) lev 2005 |
| AR(4)+ S.D.+GI(F.S.) lev 2004 | AR(5)+ S.D.+GI(F.S.) lev 2006 |
| AR(4)+S.D.+GI(J.&F.S.) lev 2004 | AR(6)+ S.D.+GI(F.S.) lev 2007 |
| AR(8)+S.D.+GI(J.&F.S.) lev 2004 | AR(11)+ S.D.+GI(F.S.) lev 2007 |
| AR(5)+ S.D.+GI(F.S.) lev 2004 | ARMA(12,12)+GI(J.&F.S.) log 2004 |
| AR(6)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(6)+S.D.+GI(J.&F.S.) log 2004 |
|
|
|
|
|
|
| AR(3)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(8)+S.D.+IC lev 2004 |
| AR(8)+S.D.+GI(F.S.) lev 2004 | AR(7)+S.D.+IC lev 2004 |
| AR(7)+ S.D.+GI(F.S.) lev 2004 | AR(10)+S.D.+IC lev 2004 |
| AR(2)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(7)+S.D.+UR+IC+GI(J.&F.S.) log 2004 |
| AR(7)+S.D.+GI(J.&F.S.) lev 2004 | AR(9)+S.D.+IC lev 2004 |
| AR(4)+ S.D.+GI(F.S.) lev 2004 | AR(2)+S.D.+UR+IC+GI(F.S.) lev 2004 |
| AR(4)+S.D.+GI(J.&F.S.) lev 2004 | AR(2)+S.D.+UR+IC+GI(J.&F.S.) log 2004 |
| AR(8)+S.D.+GI(J.&F.S.) lev 2004 | AR(6)+S.D.+UR+IC+GI(J.&F.S.) log 2004 |
| AR(5)+ S.D.+GI(F.S.) lev 2004 | AR(3)+S.D.+UR+IC+GI(F.S.) log 2004 |
| AR(6)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(5)+S.D.+IC lev 2004 |
Baseline case (left column) and the two cases including the 2008 recession (top right column) and the expansion starting in 2009 (low right column).
Top 10 models in terms of RMSE - different out-of-sample periods. Forecasting: 12 step ahead.
|
|
|
|
|
|
| AR(2)+S.D.+UR+IC+GI(J.) log 2004 | PAR(1)+UR+IC+GI(J.) log 2004 |
| AR(5)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(2)+S.D.+UR+IC+GI(J.) log 2004 |
| AR(3)+S.D.+UR+IC+GI(J.) log 2004 | AR(3)+S.D.+UR+IC+GI(J.) log 2004 |
| AR(4)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | PAR(1)+UR+IC+GI(J.) lev 2004 |
| AR(2)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | PAR(1)+UR+IC+GI(J.&F.S.) lev 2004 |
| AR(6)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | PAR(1)+UR+IC+GI(J.&F.S.) log 2004 |
| AR(6)+S.D.+UR+IC+GI(J.) log 2004 | AR(5)+S.D.+UR+IC+GI(J.&F.S.) log 2004 |
| AR(7)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(6)+S.D.+UR+IC+GI(J.) log 2004 |
| AR(3)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(4)+S.D.+UR+IC+GI(J.&F.S.) log 2004 |
| AR(8)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(5)+S.D.+UR+IC+GI(J.) log 2004 |
|
|
|
|
|
|
| AR(2)+S.D.+UR+IC+GI(J.) log 2004 | PAR(1)+UR+IC+GI(J.&F.S.) log 2004 |
| AR(5)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(4)+S.D.+GI(J.&F.S.) log 2004 |
| AR(3)+S.D.+UR+IC+GI(J.) log 2004 | AR(5)+S.D.+GI(J.&F.S.) log 2004 |
| AR(4)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(6)+S.D.+GI(J.&F.S.) log 2004 |
| AR(2)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | PAR(1)+UR+IC+GI(J.&F.S.) lev 2004 |
| AR(6)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(7)+S.D.+GI(J.&F.S.) log 2004 |
| AR(6)+S.D.+UR+IC+GI(J.) log 2004 | AR(4)+S.D.+UR+IC+GI(J.&F.S.) log 2004 |
| AR(7)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(5)+S.D.+UR+IC+GI(J.&F.S.) log 2004 |
| AR(3)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | PAR(1)+UR+IC+GI(J.) log 2004 |
| AR(8)+S.D.+UR+IC+GI(J.&F.S.) log 2004 | AR(2)+S.D.+UR+IC+GI(J.) log 2004 |
Baseline case (left column) and the two cases including the 2008 recession (top right column) and the expansion starting in 2009 (low right column).
Directional accuracy of forecasts: number of models with 100% correct predictions for the direction of change.
| N. 1 step | N. 2 steps | F. 12 steps | F. 24 steps | |
| N. of models | 1 | 179 | 1096 | 1252 |
| Google based models | 1 | 101 | 715 | 815 |
Figure 5Yearly changes for the food stamps data and the GI for “food stamps”.
Sample: 2004M1 - 2011M5. The turning points for each series is highlighted by a vertical line of the same color. Google data are registered trademarks of Google Inc., used with permission.
Directional accuracy for : number of first-best and second-best models, together with their percentage of correct predictions for the sign of .
| N. 1 step | N. 2 steps | F. 12 steps | F. 24 steps | ||||||
|
|
|
|
|
|
|
|
| ||
| (1st best) | Models total | 6 | 82% | 3 | 80% | 1 | 79% | 2 | 81% |
| Google based models | 1 | 82% | 0 | / | 0 | / | 1 | 81% | |
| (2nd best) | Models total | 12 | 80% | 7 | 78% | 2 | 77% | 6 | 78% |
| Google based models | 1 | 80% | 4 | 78% | 0 | / | 5 | 78% | |
US state level forecasts.
| US state | 1 step | 2 steps | 12 steps | 24 steps | Census | Density (inhabitants |
| Population | per square mile) | |||||
| Alabama | 38 | 100 | 89 | 87 | 4822023 | 92 |
| Alaska | 45 | 61 | 62 | 38 | 731449 | 1 |
| Arizona | 44 | 36 | 4 | 20 | 6392017 | 56 |
| Arkansas | 0 | 9 | 0 | 26 | 2915918 | 55 |
| California | 34 | 68 | 84 | 72 | 37253956 | 228 |
| Colorado | 1 | 3 | 4 | 53 | 5029196 | 48 |
| Connecticut | 57 | 48 | 17 | 94 | 3574097 | 645 |
| Delaware | 24 | 52 | 42 | 40 | 897934 | 361 |
| District of Columbia | 2 | 1 | 41 | 14 | 601723 | 8805 |
| Florida | 0 | 13 | 13 | 42 | 18801310 | 286 |
| Georgia | 35 | 35 | 58 | 75 | 9687653 | 163 |
| Hawaii | 20 | 30 | 35 | 35 | 1360301 | 124 |
| Idaho | 35 | 36 | 11 | 10 | 1567582 | 19 |
| Illinois | 38 | 68 | 87 | 67 | 12830632 | 222 |
| Indiana | 30 | 48 | 22 | 8 | 6483802 | 178 |
| Iowa | 34 | 60 | 38 | 23 | 3046355 | 54 |
| Kansas | 100 | 98 | 18 | 51 | 2853118 | 35 |
| Kentucky | 66 | 64 | 41 | 44 | 4339367 | 107 |
| Louisiana | 28 | 14 | 98 | 94 | 4533372 | 87 |
| Maine | 21 | 24 | 51 | 57 | 1328361 | 38 |
| Maryland | 64 | 70 | 43 | 60 | 5296486 | 427 |
| Massachusetts | 73 | 65 | 63 | 50 | 6349097 | 602 |
| Michigan | 64 | 65 | 62 | 66 | 9938444 | 103 |
| Minnesota | 16 | 8 | 45 | 65 | 4919479 | 57 |
| Mississippi | 7 | 3 | 35 | 39 | 2844658 | 59 |
| Missouri | 8 | 0 | 2 | 1 | 5595211 | 80 |
| Montana | 53 | 48 | 35 | 36 | 902195 | 6 |
| Nebraska | 1 | 0 | 9 | 95 | 1711263 | 22 |
| Nevada | 39 | 26 | 13 | 21 | 1998257 | 18 |
| New Hampshire | 60 | 14 | 41 | 86 | 1235786 | 132 |
| New Jersey | 57 | 72 | 84 | 87 | 8414350 | 965 |
| New Mexico | 43 | 43 | 46 | 58 | 1819046 | 15 |
| New York | 1 | 14 | 76 | 72 | 18976457 | 348 |
| North Carolina | 65 | 80 | 83 | 74 | 8049313 | 150 |
| North Dakota | 4 | 17 | 41 | 38 | 642200 | 9 |
| Ohio | 16 | 18 | 60 | 73 | 11353140 | 253 |
| Oklahoma | 57 | 65 | 6 | 14 | 3450654 | 49 |
| Oregon | 74 | 56 | 24 | 2 | 3421399 | 35 |
| Pennsylvania | 78 | 76 | 49 | 67 | 12281054 | 267 |
| Rhode Island | 28 | 46 | 76 | 93 | 1048319 | 679 |
| South Carolina | 65 | 70 | 41 | 42 | 4012012 | 125 |
| South Dakota | 43 | 55 | 53 | 95 | 754844 | 10 |
| Tennessee | 6 | 53 | 46 | 33 | 5689283 | 135 |
| Texas | 16 | 75 | 70 | 60 | 20851820 | 78 |
| Utah | 39 | 56 | 31 | 39 | 2233169 | 26 |
| Vermont | 18 | 33 | 29 | 20 | 608827 | 63 |
| Virginia | 73 | 74 | 69 | 40 | 7078515 | 165 |
| Washington | 47 | 44 | 29 | 49 | 5894121 | 83 |
| West Virginia | 1 | 12 | 23 | 24 | 1808344 | 75 |
| Wisconsin | 12 | 4 | 4 | 15 | 5363675 | 82 |
| Wyoming | 5 | 8 | 52 | 33 | 493782 | 5 |
Number of models using Google data out of the Top 100 models (according to the RMSE), 2010 census population data for each US state and population density (inhabitants per square mile, 2010).