| Literature DB >> 28060809 |
Yue Teng1,2, Dehua Bi2,3, Guigang Xie2, Yuan Jin2,4, Yong Huang1,2, Baihan Lin5, Xiaoping An1,2, Dan Feng6, Yigang Tong1,2.
Abstract
We developed a dynamic forecasting model for Zika virus (ZIKV), based on real-time online search data from Google Trends (GTs). It was designed to provide Zika virus disease (ZVD) surveillance and detection for Health Departments, and predictive numbers of infection cases, which would allow them sufficient time to implement interventions. In this study, we found a strong correlation between Zika-related GTs and the cumulative numbers of reported cases (confirmed, suspected and total cases; p<0.001). Then, we used the correlation data from Zika-related online search in GTs and ZIKV epidemics between 12 February and 20 October 2016 to construct an autoregressive integrated moving average (ARIMA) model (0, 1, 3) for the dynamic estimation of ZIKV outbreaks. The forecasting results indicated that the predicted data by ARIMA model, which used the online search data as the external regressor to enhance the forecasting model and assist the historical epidemic data in improving the quality of the predictions, are quite similar to the actual data during ZIKV epidemic early November 2016. Integer-valued autoregression provides a useful base predictive model for ZVD cases. This is enhanced by the incorporation of GTs data, confirming the prognostic utility of search query based surveillance. This accessible and flexible dynamic forecast model could be used in the monitoring of ZVD to provide advanced warning of future ZIKV outbreaks.Entities:
Mesh:
Year: 2017 PMID: 28060809 PMCID: PMC5217860 DOI: 10.1371/journal.pone.0165085
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Time series plot of the daily GTs volumes from 12 February to 9 November 2016 (Yearly EPI Week 6 to 45).
Fig 2Time series plots of the number of reported ZIKV confirmed cases (A), suspected cases (B), total cases (C) and the accumulated Google search volumes (D) during the ZVD epidemic from 12 February to 9 November 2016 (Yearly EPI Week 6 to 45).
Fig 3Numbers of reported confirmed cases (A), suspected cases (B) and total cases (C) in the testing set compared with the simulation data by the advanced ARIMA (0, 1, 3) model for training set using the data of Google Trends as the external regressor. The red, blue and green solid lines represent the predicted, training and actual number of cases, respectively. The black dash lines represent the prediction of the linear baseline model. The blue region represents the 95% confidence interval predicted by the ARIMA (0, 1, 3) model.
Fig 4Forecasts of the number of ZIKV confirmed cases (A), suspected cases (B) and total cases (C) in worldwide between 27 October and 9 November 2016 (Yearly EPI Week 43 and 45) by the advanced ARIMA (0, 1, 3) model, which was improved by aggregating historical logs with the most-current 3 weeks’ data of Zika-related Google Trends as a estimating predictor to estimate ZVD cases. The red, blue and green solid lines represent the predicted, training and actual number of cases. The blue region represents the 95% confidence interval predicted by the ARIMA (0, 1, 3) model.