Literature DB >> 27872881

Infodemiological data of West-Nile virus disease in Italy in the study period 2004-2015.

Nicola Luigi Bragazzi1, Susanna Bacigaluppi2, Chiara Robba3, Anna Siri4, Giovanna Canepa5, Francesco Brigo6.   

Abstract

Google Trends (GT) was mined from 2004 to 2015, searching for West-Nile virus disease (WNVD) in Italy. GT-generated data were modeled as a time series and were analyzed using classical time series analyses. In particular, correlation between GT-based Relative Search Volumes (RSVs) related to WNVD and "real-world" epidemiological cases in the same study period resulted r=0.76 (p<0.0001) on a monthly basis and r=0.80 (p<0.0001) on a yearly basis. The partial autocorrelation analysis and the spectral analysis confirmed that a 1-year regular pattern could be detected. Correlation between GT-based RSVs related to WNVD yielded a r=0.54 (p<0.05) on a regional basis. Summarizing, GT-generated data concerning WNVD well correlated with epidemiology and could be exploited for complementing traditional surveillance.

Entities:  

Keywords:  Google Trends; Infodemiology and infoveillance; West-Nile virus disease

Year:  2016        PMID: 27872881      PMCID: PMC5107683          DOI: 10.1016/j.dib.2016.10.022

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table Value of the data To the best of our knowledge, this is the first thorough quantitative analysis of West-Nile virus disease related web activities. The analyses presented in this data article show that Google Trends-generated data concerning the West-Nile virus disease well correlated with epidemiology in Italy. This analysis could be extended in other countries, in order to replicate the current findings in other settings and contexts. These data could be further mathematically and statistically refined for designing an approach for complementing traditional surveillance of the West-Nile virus disease.

Data

This paper contains infodemiological data concerning the West-Nile virus diseases related web-activities carried out in Italy from 2004 to 2015 (Fig. 1, Table 1). These data showed a cyclic regular pattern (Fig. 2, Fig. 3, Fig. 4, Table 2, Table 3), well correlating with epidemiological data (Fig. 5, Table 4).
Fig. 1

Digital interest for West-Nile virus disease in Italy at regional (A) and town (B) level, as captured by Google Trends.

Table 1

Digital interest for West-Nile virus disease in Italy at regional and town level. Abbreviation: RSV (relative search volume).

Interest at regional levelRSV (%)Interest at town levelRSV (%)
Sardinia100Cagliari100
Emilia-Romagna60Bologna43
Veneto47Padua38
Friuli Venezia Giulia47Milan20
Umbria32Rome20
Fig. 2

(a) GT-based West-Nile virus disease related web-searches. (b) The wavelet power spectrum. The contour levels are chosen so that 75%, 50%, 25%, and 5% of the wavelet power is above each level, respectively. A statistically significant regular 1-year pattern can be detected. (c) The global wavelet power spectrum.

Fig. 3

Google Trends-generated data concerning the West-Nile virus disease related web activities. Autocorrelation function values outside of the two-standard-error bands given by the black lines are statistically significant.

Fig. 4

Partial auto-correlation of the Google Trends-generated data concerning the West-Nile virus disease related web activities. Partial autocorrelation function values outside of the two-standard-error bands given by the black lines are statistically significant.

Table 2

Autocorrelation analysis of the Google Trends-generated data concerning West-Nile Virus disease related web activities.

LagAutocorrelationStandard deviationBox-Ljung statistics
ValueDegrees of freedomSig.
10.4560.08230.58910.000
20.0690.08231.28420.000
3−0.0870.08232.40030.000
4−0.1850.08237.53640.000
5−0.1980.08143.47150.000
6−0.1990.08149.50460.000
7−0.1690.08153.89970.000
8−0.1290.08056.46780.000
9−0.0300.08056.61090.000
100.1390.08059.653100.000
110.3480.08078.764110.000
120.3660.079100.104120.000
130.2070.079107.001130.000
140.0260.079107.113140.000
15−0.1070.078108.980150.000
16−0.1590.078113.131160.000
Table 3

Partial autocorrelation analysis of the Google Trends-generated data concerning the West-Nile virus disease related web activities.

LagPartial autocorrelationStandard deviation
10.4560.083
2−0.1760.083
3−0.0560.083
4−0.1360.083
5−0.0740.083
6−0.1180.083
7−0.0810.083
8−0.0880.083
90.0050.083
100.1130.083
110.2420.083
120.1130.083
130.0190.083
14−0.0170.083
15−0.0250.083
16−0.0070.083
Fig. 5

Correlational analysis between the Google Trends-generated data concerning West-Nile virus disease related web activities and the real epidemiological cases.

Table 4

Regression model of the Google Trends-generated data concerning West-Nile virus disease related web activities.

Indipendent variableCoefficientStandard errorrpartialtp-Value
Cases2.550.200.7412.870.0000
Month0.750.240.263.180.0018
Year1.000.240.334.170.0001

Experimental design, materials and methods

Google Trends (GT, a tool freely available at https://www.google.com/trends) was mined from 2004 to 2015, searching for West-Nile virus disease (WNVD). Epidemiological data were obtained and downloaded from the Epicentro Italian National Health Institute (ISS) website (accessible at http://www.epicentro.iss.it/problemi/westNile/Rizzo2011.asp) and from the IZSAM Caporale Teramo website (http://sorveglianza.izs.it/emergenze/west_nile/emergenze.html). GT-generated data were modeled as a time series and analyzed using classical time series analyses. In order to detect regular time patterns, spectral analysis was carried out using algorithms written in Matlab, freely available at http://paos.colorado.edu/research/wavelets/ [1]. Further, correlation between GT-based Relative Search Volumes (RSVs) related to WNVD and “real-world” epidemiological cases in the same study period was performed both on a monthly basis and on a yearly basis. Correlation between GT-based RSVs related to WNVD was also carried out on a regional basis. Autocorrelation and partial autocorrelation functions enable to compute the correlation of a time series with its own lagged values, respectively non controlling and controlling for the values at all shorter lags. Moreover, a regression model of the GT-generated data concerning WNVD-related web activities was performed. Autocorrelation and partial autocorrelation analyses, correlational analysis and regressions were performed using the commercial software Statistical Package for Social Science (SPSS, version 23.0, IL, USA) and the commercial software MedCalc Statistical Software version 16.4.3 (MedCalc Software bvba, Ostend, Belgium; https://www.medcalc.org; 2016). Figures with a p-value <0.05 were considered statistically significant.

Conflicts of interest

All authors declare no conflicts of interest.
Subject areaEpidemiology
More specific subject areaDigital epidemiology
Type of dataTable and graphs
How data were acquiredOutsourcing of Google Trends website and of the Italian National Health Institute (ISS) site concerning West-Nile virus disease
Data formatRaw, analyzed
Experimental factorsGoogle Trends search volumes were obtained through heat-maps
Experimental featuresValidation of Google Trends-based data with “real-world” data taken from the Italian National Health Institute (ISS) was performed by means of correlational analysis. Further, autocorrelation and partial autocorrelation analyses and regressions were carried out.
Data source locationItaly
Data accessibilityData are within this article
  7 in total

1.  An infodemiological investigation of the so-called "Fluad effect" during the 2014/2015 influenza vaccination campaign in Italy: Ethical and historical implications.

Authors:  Naim Mahroum; Abdulla Watad; Roberto Rosselli; Francesco Brigo; Valentina Chiesa; Anna Siri; Dana Ben-Ami Shor; Mariano Martini; Nicola Luigi Bragazzi; Mohammad Adawi
Journal:  Hum Vaccin Immunother       Date:  2018-02-15       Impact factor: 3.452

Review 2.  Neurology and the Internet: a review.

Authors:  Marcello Moccia; Francesco Brigo; Gioacchino Tedeschi; Simona Bonavita; Luigi Lavorgna
Journal:  Neurol Sci       Date:  2018-03-28       Impact factor: 3.307

3.  Assessing the Methods, Tools, and Statistical Approaches in Google Trends Research: Systematic Review.

Authors:  Amaryllis Mavragani; Gabriela Ochoa; Konstantinos P Tsagarakis
Journal:  J Med Internet Res       Date:  2018-11-06       Impact factor: 5.428

4.  Using social media listening and data mining to understand travellers' perspectives on travel disease risks and vaccine-related attitudes and behaviours.

Authors:  Catherine Bravo; Valérie Bosch Castells; Susann Zietek-Gutsch; Pierre-Antoine Bodin; Cliona Molony; Markus Frühwein
Journal:  J Travel Med       Date:  2022-03-21       Impact factor: 8.490

5.  Discrepancies Between Classic and Digital Epidemiology in Searching for the Mayaro Virus: Preliminary Qualitative and Quantitative Analysis of Google Trends.

Authors:  Mohammad Adawi; Nicola Luigi Bragazzi; Abdulla Watad; Kassem Sharif; Howard Amital; Naim Mahroum
Journal:  JMIR Public Health Surveill       Date:  2017-12-01

6.  Harnessing Big Data for Communicable Tropical and Sub-Tropical Disorders: Implications From a Systematic Review of the Literature.

Authors:  Vincenza Gianfredi; Nicola Luigi Bragazzi; Daniele Nucci; Mariano Martini; Roberto Rosselli; Liliana Minelli; Massimo Moretti
Journal:  Front Public Health       Date:  2018-03-21

7.  Data Mining and Content Analysis of the Chinese Social Media Platform Weibo During the Early COVID-19 Outbreak: Retrospective Observational Infoveillance Study.

Authors:  Jiawei Li; Qing Xu; Raphael Cuomo; Vidya Purushothaman; Tim Mackey
Journal:  JMIR Public Health Surveill       Date:  2020-04-21
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.