| Literature DB >> 26001083 |
Jaroslav Pavlicek1, Ladislav Kristoufek2.
Abstract
The online activity of Internet users has repeatedly been shown to provide a rich information set for various research fields. We focus on job-related searches on Google and their possible usefulness in the region of the Visegrad Group--the Czech Republic, Hungary, Poland and Slovakia. Even for rather small economies, the online searches of inhabitants can be successfully utilized for macroeconomic predictions. Specifically, we study unemployment rates and their interconnection with job-related searches. We show that Google searches enhance nowcasting models of unemployment rates for the Czech Republic and Hungary whereas for Poland and Slovakia, the results are mixed.Entities:
Mesh:
Year: 2015 PMID: 26001083 PMCID: PMC4441379 DOI: 10.1371/journal.pone.0127084
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Unemployment rate in the Visegrad countries.
The group of countries is evidently quite heterogenous in the unemployment rates. The Hungarian rate starts at the lowest level but increases stably during the whole period. The Czech rate begins at quite low levels and decreases up to the outbreak of the financial crisis when the rate surges up until 2010, after which it remains quite stable. The Polish and Slovakian rates commence at very high levels of unemployment, which decrease again up until the outbreak of the crisis, after which they change trends, similarly to the Czech rate.
Summary statistics.
Jarque-Bera test with the null hypothesis of a symmetric distribution with no excess kurtosis is used here, p-values are reported in the brackets.
| Czech Rep. | Hungary | Poland | Slovakia | |
|---|---|---|---|---|
| average | 6.773 | 8.917 | 11.558 | 13.745 |
| median | 6.900 | 8.650 | 9.950 | 14.000 |
| SD | 1.179 | 1.872 | 3.987 | 2.491 |
| minimum | 4.000 | 5.800 | 6.300 | 8.500 |
| maximum | 9.100 | 11.800 | 21.000 | 19.700 |
| skewness | -0.644 | 0.014 | 0.948 | 0.160 |
| excess kurtosis | -0.207 | -1.507 | -0.419 | -0.067 |
| Jarque-Bera test | 8.498 | 11.364 | 18.855 | 0.536 |
|
| [< 0.05] | [< 0.01] | [< 0.01] | [> 0.10] |
| observations | 120 | 120 | 120 | 120 |
Fig 2Google search queries for the job-related terms in the Visegrad countries.
The patterns are again quite heterogenous, and the connection between the Google searches and the unemployment rates can be observed for the Czech and Hungarian rates. For the other two, the connection is not visible by the naked eye. Detailed treatment of the interconnections is given in the Results section of the text. Google data are registered trademarks of Google Inc., used with permission.
Stationarity testing.
Augmented Dickey-Fuller test (ADF) for a presence of unit root and KPSS test for stationarity are used. *, ** and *** stand for statistical significance at 10%, 5% and 1% levels, respectively. Number of lags for the tests is based on the Akaike Information Criterion (AIC) selection.
| Czech Rep. | Hungary | Poland | Slovakia | |
|---|---|---|---|---|
|
| ||||
| Unemployment | -1.6066 | -1.78611 | -2.6739* | -2.6438* |
| - first difference | -5.3860*** | -4.5134*** | -3.5267*** | -4.2349*** |
| -0.6897 | -1.3974 | -2.1745 | -0.8000 | |
| - logarithm | -0.6931 | -1.1728 | -2.3280 | -0.3644 |
| - difference | -11.4213*** | -10.8293*** | -11.1560*** | -11.8463*** |
| - logarithmic difference | -11.5094*** | -10.9022*** | -11.0750*** | -11.7591*** |
|
| ||||
| Unemployment | 0.5399** | 2.5995*** | 1.7946*** | 0.7507*** |
| - first difference | 0.1932 | 0.2848 | 0.6708** | 0.5673** |
| 0.8294*** | 1.9059*** | 1.3281*** | 1.1640*** | |
| - logarithm | 0.8193*** | 1.9375*** | 1.3596*** | 1.1737*** |
| - difference | 0.0889 | 0.1580 | 0.1358 | 0.1122 |
| - logarithmic difference | 0.0977 | 0.1480 | 0.1362 | 0.0967 |
Nowcasting summary (in-sample).
The whole analyzed period 01/2004-12/2013 is covered here. Model in Eq 2 is used here with varying maximum lag L. Joint significance of variables is a simple F-test based on heteroskedasticity and autocorrelation consistent (HAC) standard errors (p-values are reported in the brackets). Adjusted coefficient of determination controls for the number of independent variables used in the model.
| Czech Rep. | Hungary | Poland | Slovakia | ||
|---|---|---|---|---|---|
| Δ |
| 48.115 | 0.965 | 0.290 | 11.424 |
| [< 0.01] | [> 0.10] | [> 0.10] | [< 0.01] | ||
|
| 20.559 | 0.863 | 7.188 | 6.336 | |
| [< 0.01] | [> 0.10] | [< 0.01] | [< 0.01] | ||
|
| 10.361 | 2.120 | 8.332 | 1.871 | |
| [< 0.01] | [< 0.05] | [< 0.01] | [< 0.10] | ||
| Δlog |
| 9.284 | 2.945 | 7.472 | 5.454 |
| [< 0.01] | [< 0.05] | [< 0.01] | [< 0.01] | ||
|
| 5.944 | 3.815 | 5.929 | 3.638 | |
| [< 0.01] | [< 0.01] | [< 0.01] | [< 0.01] | ||
|
| 7.685 | 3.574 | 2.525 | 7.448 | |
| [< 0.01] | [< 0.01] | [< 0.01] | [< 0.01] | ||
|
|
| 0.177 | 0.022 | -0.009 | 0.044 |
|
| 0.288 | -0.003 | -0.016 | 0.044 | |
|
| 0.280 | 0.249 | 0.467 | 0.161 | |
|
|
| 0.318 | 0.076 | 0.144 | 0.163 |
|
| 0.367 | 0.118 | 0.328 | 0.205 | |
|
| 0.407 | 0.418 | 0.552 | 0.406 |
Nowcasting summary (out-of-sample).
The period between 01/2004 and 12/2011 is used for model fitting and the rest of the period between 01/2012 and 12/2013 is used for the forecasting comparison. Diebold-Mariano test described in the Methods section compares the “Google model” defined in Eq 2 to the base model defined in Eq 3 with a null hypothesis of no difference of forecasting accuracy versus the alternative of the “Google model” being more accurate.
| Czech Rep. | Hungary | Poland | Slovakia | ||
|---|---|---|---|---|---|
| Diebold-Mariano test |
| 1.326 | 2.2895 | 0.6467 | -0.744 |
| [< 0.10] | [< 0.05] | [> 0.10] | [> 0.10] | ||
|
| 0.979 | 1.5425 | 1.635 | -1.203 | |
| [> 0.10] | [< 0.10] | [< 0.10] | [> 0.10] | ||
|
| 2.229 | 0.3312 | -0.146 | -4.021 | |
| [< 0.05] | [> 0.10] | [> 0.10] | [> 0.10] |