| Literature DB >> 27872881 |
Nicola Luigi Bragazzi1, Susanna Bacigaluppi2, Chiara Robba3, Anna Siri4, Giovanna Canepa5, Francesco Brigo6.
Abstract
Google Trends (GT) was mined from 2004 to 2015, searching for West-Nile virus disease (WNVD) in Italy. GT-generated data were modeled as a time series and were analyzed using classical time series analyses. In particular, correlation between GT-based Relative Search Volumes (RSVs) related to WNVD and "real-world" epidemiological cases in the same study period resulted r=0.76 (p<0.0001) on a monthly basis and r=0.80 (p<0.0001) on a yearly basis. The partial autocorrelation analysis and the spectral analysis confirmed that a 1-year regular pattern could be detected. Correlation between GT-based RSVs related to WNVD yielded a r=0.54 (p<0.05) on a regional basis. Summarizing, GT-generated data concerning WNVD well correlated with epidemiology and could be exploited for complementing traditional surveillance.Entities:
Keywords: Google Trends; Infodemiology and infoveillance; West-Nile virus disease
Year: 2016 PMID: 27872881 PMCID: PMC5107683 DOI: 10.1016/j.dib.2016.10.022
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Digital interest for West-Nile virus disease in Italy at regional (A) and town (B) level, as captured by Google Trends.
Digital interest for West-Nile virus disease in Italy at regional and town level. Abbreviation: RSV (relative search volume).
| Sardinia | 100 | Cagliari | 100 |
| Emilia-Romagna | 60 | Bologna | 43 |
| Veneto | 47 | Padua | 38 |
| Friuli Venezia Giulia | 47 | Milan | 20 |
| Umbria | 32 | Rome | 20 |
Fig. 2(a) GT-based West-Nile virus disease related web-searches. (b) The wavelet power spectrum. The contour levels are chosen so that 75%, 50%, 25%, and 5% of the wavelet power is above each level, respectively. A statistically significant regular 1-year pattern can be detected. (c) The global wavelet power spectrum.
Fig. 3Google Trends-generated data concerning the West-Nile virus disease related web activities. Autocorrelation function values outside of the two-standard-error bands given by the black lines are statistically significant.
Fig. 4Partial auto-correlation of the Google Trends-generated data concerning the West-Nile virus disease related web activities. Partial autocorrelation function values outside of the two-standard-error bands given by the black lines are statistically significant.
Autocorrelation analysis of the Google Trends-generated data concerning West-Nile Virus disease related web activities.
| 1 | 0.456 | 0.082 | 30.589 | 1 | 0.000 |
| 2 | 0.069 | 0.082 | 31.284 | 2 | 0.000 |
| 3 | −0.087 | 0.082 | 32.400 | 3 | 0.000 |
| 4 | −0.185 | 0.082 | 37.536 | 4 | 0.000 |
| 5 | −0.198 | 0.081 | 43.471 | 5 | 0.000 |
| 6 | −0.199 | 0.081 | 49.504 | 6 | 0.000 |
| 7 | −0.169 | 0.081 | 53.899 | 7 | 0.000 |
| 8 | −0.129 | 0.080 | 56.467 | 8 | 0.000 |
| 9 | −0.030 | 0.080 | 56.610 | 9 | 0.000 |
| 10 | 0.139 | 0.080 | 59.653 | 10 | 0.000 |
| 11 | 0.348 | 0.080 | 78.764 | 11 | 0.000 |
| 12 | 0.366 | 0.079 | 100.104 | 12 | 0.000 |
| 13 | 0.207 | 0.079 | 107.001 | 13 | 0.000 |
| 14 | 0.026 | 0.079 | 107.113 | 14 | 0.000 |
| 15 | −0.107 | 0.078 | 108.980 | 15 | 0.000 |
| 16 | −0.159 | 0.078 | 113.131 | 16 | 0.000 |
Partial autocorrelation analysis of the Google Trends-generated data concerning the West-Nile virus disease related web activities.
| 1 | 0.456 | 0.083 |
| 2 | −0.176 | 0.083 |
| 3 | −0.056 | 0.083 |
| 4 | −0.136 | 0.083 |
| 5 | −0.074 | 0.083 |
| 6 | −0.118 | 0.083 |
| 7 | −0.081 | 0.083 |
| 8 | −0.088 | 0.083 |
| 9 | 0.005 | 0.083 |
| 10 | 0.113 | 0.083 |
| 11 | 0.242 | 0.083 |
| 12 | 0.113 | 0.083 |
| 13 | 0.019 | 0.083 |
| 14 | −0.017 | 0.083 |
| 15 | −0.025 | 0.083 |
| 16 | −0.007 | 0.083 |
Fig. 5Correlational analysis between the Google Trends-generated data concerning West-Nile virus disease related web activities and the real epidemiological cases.
Regression model of the Google Trends-generated data concerning West-Nile virus disease related web activities.
| Cases | 2.55 | 0.20 | 0.74 | 12.87 | 0.0000 |
| Month | 0.75 | 0.24 | 0.26 | 3.18 | 0.0018 |
| Year | 1.00 | 0.24 | 0.33 | 4.17 | 0.0001 |
| Subject area | |
|---|---|
| More specific subject area | |
| Type of data | |
| How data were acquired | |
| Data format | |
| Experimental factors | |
| Experimental features | |
| Data source location | |
| Data accessibility |